uofr_edited.jpg

HELLO, I'M

Tony Geng

universityofrochesterlogo_1487157387000_17343118_ver1.0.webp

Assistant Professor @ University of Rochester

tonggeng_photo.jpg

Tony Geng

Assistant Professor

Department of ECE

Department of Computer Science

University of Rochester

tgeng(at)ur(dot)rochester(dot)edu

  • g
  • universityofrochesterlogo_1487157387000_17343118_ver1.0
 

About Tony

MY BACKGROUND

Dr. Tony Geng is a tenure-track assistant professor in the ECE and CS departments of the University of Rochester (UR). Before joining Rochester, Tony worked in the Physical & Computational Sciences Directorate (PCSD) at Pacific Northwest National Laboratory (PNNL) operated by the Department of Energy of the US government for 2 years. He received his Ph.D. in Computer Engineering at Boston University in 2020. His research interests are at the intersection of computer architecture & systems, machine learning, graph intelligence, and high-performance computing. He is the recipient of the Outstanding Postdoc Award at PNNL, and META's faculty award on AI System Hardware/Software Codesign. Tony's papers have appeared in many prestigious conferences and journals including MICRO, HPCA, AAAI, DAC, SC, TPDS, TC, etc.

To prospective students:

I am always looking for Postdoc, Ph.D. students, and Interns (remote is acceptable) to work on next-generation hardware architectures & systems for AI, graph intelligence, ML, and their applications including Fintech, Recommendation systems, Social media, and Smart city. Please drop me an email with your CV and transcripts if you are interested.

RESEARCH INTERESTS

Computer Architecture: GPU, FPGA, CGRA, Accelerators for AI, Quantum Computer, Future Heterogeneity in Hardware and System 

Machine Learning: Spatio-temporal Graph Neural Networks, Broadly-defined Graph Intelligence, DNNs

Applications: Fintech, Social Media, Recommendation System, Smart City, Public Health, Supply Chain
 

Selected Publications

​   2022:

  • [AAAI 2022] Z.Pan, A.Sharma, J.Hu, Z.liu, A.Li, H.Liu, M.Huang, T.Geng: "Ising-Traffic: An Ising-based Framework for Traffic Congestion Prediction with Uncertainty", Thirty-Seventh AAAI Conference on Artificial Intelligence.

  • [TPDS 2022] W.Sun, A.Li, T.Geng, S.Stuijk, H.Corporaal: "Dissecting Tensor Cores via Microbenchmarks: Latency, Throughput and Numerical Behaviors", IEEE Transactions on Parallel and Distributed Systems.

  • [HPCA 2022] H.You*, T.Geng*, Y.Zhang, A.Li, Y.Lin: "GCoD: Graph Convolutional Network Acceleration via Dedicated Algorithm and Accelerator Co-Design", The 28th IEEE International Symposium on HighPerformance Computer Architecture.

  • [HPCA 2022] C.Tan, N.B.Agostini, T.Geng, C.Xie, J.Li, A.Li, K.Barker, A.Tumeo: "DRIPS: Dynamic Rebalancing of Pipelined Streaming Applications on CGRAs", The 28th IEEE International Symposium on High-Performance Computer Architecture.

  • [DAC 2022] H. Peng, ..., T.Geng, ..., C.Ding: "A Length Adaptive Algorithm-Hardware Co-design of Transformer on FPGA Through Sparse Attention and Dynamic Pipelining", The 58th Design Automation Conference.

  • [ICS 2022] C.Zhang, S.Jin, T.Geng, J.Tian, A.Li, D.Tao: "Accelerating Parallel I/O Via Hardware-Algorithm Co-Designed Adaptive Lossy Compression", the 36th ACM International Conference on Supercomputing.

  • [ICS 2022] C.Tan, T.Tembe, J.Zhang, B.Fang, T.Geng, G.Wei, D.Brooks, A.Tumeo, G.Gopalakrishnan A.Li: "ASAP - Automatic Synthesis of Area-Efficient and  Precision-Aware CGRA", the 36th ACM International Conference on Supercomputing.

   2021:

  • [MICRO 2021] T.Geng, C.Wu, ..., M.Herbordt, Y.Lin, A.Li: "I-GCN: A Graph Convolutional Network Accelerator with Runtime Locality Enhancement through Islandization", the 54th IEEE/ACM International Symposium on Microarchitecture.

  • [TPDS 2021] T.Geng, T.Wang, C.Wu, Y.Li, ..., A.Li, M.Herbordt: "O3BNN-R: An Out-Of-Order Architecture for HighPerformance and Regularized BNN inference", IEEE Transactions on Parallel and Distributed Systems.

  • [TPDS 2021] C.Tan, C.Xie, T.Geng, ..., K.Barker, A.Li: "ARENA: Asynchronous Reconfigurable Accelerator Ring to Enable Data-Centric Parallel Computing", IEEE Transactions on Parallel and Distributed Systems.

  • [SC 2021] B.Feng, Y.Wang, T.Geng, A.Li, Y.Ding: "APNN-TC: Accelerating Arbitrary Precision Neural Networks on Ampere GPU Tensor Cores", Proceedings of the International Conference for High Performance Computing, Networking, Storage, and Analysis.

  • [ICCAD 2021] Y.Zhang, H.You, Y.Fu, T.Geng, A.Li, Y.Lin: "G-CoS: GNN-Accelerator Co-Search Towards Both Better Accuracy and Efficiency", 2021 International Conference On Computer Aided Design.

  • [ICCD 2021] C.Tan, T.Geng, C.Xie, N.Agostini, J.Li, A.Li, K.Barker, A.Tumeo: "DynPaC: Coarse-Grained, Dynamic, and Partially Reconfigurable Array for Streaming Applications", the 39th IEEE International Conference on Computer Design. (Best Paper Award)

   2020:

   2019:

  • [ICS 2019] T.Geng, T.Wang, C.Wu, C.Yang, W.Wu, A.Li, M.Herbordt: "O3BNN: An Out-Of-Order Architecture for High-Performance Binarized Neural Network Inference with Fine-Grained Pruning", the 33th ACM International Conference on Supercomputing.

  • [SC 2019] A.Li, T.Geng, T.Wang, M.Herbordt, S.Song, K.Barker: "BSTC: A Novel BinarizedSoft-Tensor-Core Design for Accelerating Bit-Based Approximated Neural Nets", Proceedings of the International Conference for High Performance Computing, Networking, Storage, and Analysis.

  • [SC 2019] C.Yang, T.Geng, T.Wang, ..., M.Herbordt: "Fully integrated FPGA molecular dynamics simulations", Proceedings of the International Conference for High Performance Computing, Networking, Storage, and Analysis.

 

Projects

Graph-Neural-Networks.webp
ai-circuit-board-technology-system-scaled_edited.jpg

Graph

Intelligence

 View Selected Papers for More Details:

I-GCN [MICRO 2021],

GCoD [HPCA 2022],

GCoS [ICCAD 2021],

AWB-GCN [MICRO 2020]

Ising-Traffic [AAAI 2023]

DNN Inference & Training

 View Selected Papers for More Details:

FPDeep [TC 2020]

O3BNN-R[TPDS 2021],

Tensor-Core ANN[SC2021],

BSTC[SC 2019],

O3BNN[ICS 2019]

CGRA

Architecture

 View Selected Papers for More Details:

DRIPS[HPCA2021], DynPaC[ICCD2021],

ASAP[ICS2022],

CQNN[HPEC 2020]

preço-de-gpu-pakistao_edited.png
ai-circuit-board-technology-system-scaled_edited.jpg

GPU

Architecture

 View Selected Papers for More Details:

AP-TC [SC2021], 

 BSTC [SC 2019]

Sparse TC[TPDS2022]

dominoes-2364492_1920_edited.png

NLP: RNN & Transformer

 View Selected Papers for More Details:

Co-design Transformer [DAC 2022], 

CSB-RNN [ICS 2020],

FPGA Transformer[ISQED 2021]

Drug Discovery --

Molecular Dynamics

 View Selected Papers for More Details:

FPGA-based MD

[ICCAD 2021], [SC 2019], [ASAP 2019],[FCCM2021], [HPEC2021], [FCCM2022]

Future Heterogeneity in Hardware and Systems

 View Selected Papers for More Details:

ARENA[TPDS2021],

In-NIC Data Compression[ICS2022],

FCsN[ FCCM 2022],

Smart Switch

[CPE 2021], [HPEC 2021], [FCCM 2020], [FPT 2020]

 

Meet The Team

Image from iOS.jpg

Zhenyu Pan

Research Interests:

1. Ising Graph Learning;

2. Quantum-inspired GNN;

3. Domain-specific Acceleration with FPGAs;

Zhuo Liu

Research Interests:

1. GNN Acceleration;

2. Heterogeneous System;

3. Ising Machine;

4. FPGAs;

WeChat Image_20220629182830_edited_edited.jpg

Yinghao Wu

Research Interests:

1. Spatial-Temporal GNN

2. Ising-aided GNN

3. ML Acceleration

4. Computer Architecture

WeChat Image_20220629143309_edited.jpg

Banksy Luo

Research Interests:

1. GPUs and Reconfigurability in GPUs;

2. Graph Learning on Heterogeneous System;

lz1_edited.jpg
 
 

News

11/2022 One paper was accepted by AAAI 2023 -- Congrats to the leading author, Zhenyu Pan, for publishing in his first Ph.D. semester.

11/2022 One paper was accepted by LoG 2023 -- Learning on Graphs (LoG) Conference -- a very decent new conference, strongly recommend

10/2022 One paper was accepted by TPDS 2022.

09/2022 Prof. Tony Geng received Meta (Facebook) Faculty Research Award on AI System Hardware/Software Codesign.

09/2022 Four papers were accepted by ICCD 2022.

09/2022 Our proposal was selected as an internationally excellent finalist in Meta (Facebook) RFP - Networking for AI.

06/2022 Three papers were accepted by FPL 2022.

04/2022 Two papers were accepted by ICS 2022.

02/2022 One paper was accepted by DAC 2022.

02/2022 Tony gave an invited talk at Northwestern University.

02/2022 Tony gave an invited talk at the University of Rochester.

12/2021 Two papers were accepted by HPCA 2022.