uofr_edited.jpg

HELLO, I'M

Tony Geng

universityofrochesterlogo_1487157387000_17343118_ver1.0.webp

Assistant Professor @ University of Rochester

tonggeng_photo.jpg

Tony Geng

Assistant Professor

Department of ECE

Department of Computer Science

University of Rochester

tgeng(at)ur(dot)rochester(dot)edu

  • g
  • universityofrochesterlogo_1487157387000_17343118_ver1.0
 

About Tony

MY BACKGROUND

Dr. Tony Geng is a tenure-track assistant professor in the ECE and CS departments of the University of Rochester (UR). Before joining Rochester, Tony worked in the Physical & Computational Sciences Directorate (PCSD) at Pacific Northwest National Laboratory (PNNL) operated by the Department of Energy of the US government for 2 years. He received his Ph.D. in Computer Engineering at Boston University in 2020. His research interests are at the intersection of computer architecture & systems, machine learning, graph intelligence, and high-performance computing. He is the finalist for the Best Thesis Award of BU and the recipient of the Outstanding Postdoc Award at PNNL. Tony has published over 50 papers and his papers have appeared in many prestigious conferences and journals including MICRO, HPCA, DAC, SC, TPDS, TC, ICS, etc.

To prospective students:

I am always looking for Postdoc, Ph.D. students, and Interns (remote is acceptable) to work on next-generation hardware architectures & systems for AI, graph intelligence, ML, and their applications including Fintech, Recommendation systems, Social media, and Smart city. Please drop me an email with your CV and transcripts if you are interested.

RESEARCH INTERESTS

Computer Architecture: GPU, FPGA, CGRA, Accelerators for AI, Quantum Computer, Future Heterogeneity in Hardware and System 

Machine Learning: Spatio-temporal Graph Neural Networks, Broadly-defined Graph Intelligence, DNNs

Applications: Fintech, Social Media, Recommendation System, Smart City, Public Health, Supply Chain
 

Selected Publications

​   2022:

  • [HPCA 2022] H.You*, T.Geng*, Y.Zhang, A.Li, Y.Lin: "GCoD: Graph Convolutional Network Acceleration via Dedicated Algorithm and Accelerator Co-Design", The 28th IEEE International Symposium on HighPerformance Computer Architecture.

  • [HPCA 2022] C.Tan, N.B.Agostini, T.Geng, C.Xie, J.Li, A.Li, K.Barker, A.Tumeo: "DRIPS: Dynamic Rebalancing of Pipelined Streaming Applications on CGRAs", The 28th IEEE International Symposium on High-Performance Computer Architecture.

  • [DAC 2022] H. Peng, ..., T.Geng, ..., C.Ding: "A Length Adaptive Algorithm-Hardware Co-design of Transformer on FPGA Through Sparse Attention and Dynamic Pipelining", The 58th Design Automation Conference.

  • [ICS 2022] C.Zhang, S.Jin, T.Geng, J.Tian, A.Li, D.Tao: "Accelerating Parallel I/O Via Hardware-Algorithm Co-Designed Adaptive Lossy Compression", the 36th ACM International Conference on Supercomputing.

  • [ICS 2022] C.Tan, T.Tembe, J.Zhang, B.Fang, T.Geng, G.Wei, D.Brooks, A.Tumeo, G.Gopalakrishnan A.Li: "ASAP - Automatic Synthesis of Area-Efficient and  Precision-Aware CGRA", the 36th ACM International Conference on Supercomputing.

   2021:

  • [MICRO 2021] T.Geng, C.Wu, ..., M.Herbordt, Y.Lin, A.Li: "I-GCN: A Graph Convolutional Network Accelerator with Runtime Locality Enhancement through Islandization", the 54th IEEE/ACM International Symposium on Microarchitecture.

  • [TPDS 2021] T.Geng, T.Wang, C.Wu, Y.Li, ..., A.Li, M.Herbordt: "O3BNN-R: An Out-Of-Order Architecture for HighPerformance and Regularized BNN inference", IEEE Transactions on Parallel and Distributed Systems.

  • [TPDS 2021] C.Tan, C.Xie, T.Geng, ..., K.Barker, A.Li: "ARENA: Asynchronous Reconfigurable Accelerator Ring to Enable Data-Centric Parallel Computing", IEEE Transactions on Parallel and Distributed Systems.

  • [SC 2021] B.Feng, Y.Wang, T.Geng, A.Li, Y.Ding: "APNN-TC: Accelerating Arbitrary Precision Neural Networks on Ampere GPU Tensor Cores", Proceedings of the International Conference for High Performance Computing, Networking, Storage, and Analysis.

  • [ICCAD 2021] Y.Zhang, H.You, Y.Fu, T.Geng, A.Li, Y.Lin: "G-CoS: GNN-Accelerator Co-Search Towards Both Better Accuracy and Efficiency", 2021 International Conference On Computer Aided Design.

  • [ICCAD 2021] D.Manu, ..., T.Geng, A.Li, C.Ding, W.Jiang, L.Yang: "BFL-DISCO: Federated Generative Adversarial Network for Graph-based Molecule Drug Discovery", 2021 International Conference On Computer Aided Design.

  • [ICCAD 2021] H.Peng, ..., T.Geng, A.Li, J.Bi, M.Song, W.Jiang, H.Liu, C.Ding: "Optimizing FPGA-based Accelerator Design for Large-Scale Molecular Similarity Search", 2021 International Conference On Computer Aided Design.

  • [ICCD 2021] C.Tan, T.Geng, C.Xie, N.Agostini, J.Li, A.Li, K.Barker, A.Tumeo: "DynPaC: Coarse-Grained, Dynamic, and Partially Reconfigurable Array for Streaming Applications", the 39th IEEE International Conference on Computer Design. (Best Paper Award)

   2020:

   2019:

  • [ICS 2019] T.Geng, T.Wang, C.Wu, C.Yang, W.Wu, A.Li, M.Herbordt: "O3BNN: An Out-Of-Order Architecture for High-Performance Binarized Neural Network Inference with Fine-Grained Pruning", the 33th ACM International Conference on Supercomputing.

  • [SC 2019] A.Li, T.Geng, T.Wang, M.Herbordt, S.Song, K.Barker: "BSTC: A Novel BinarizedSoft-Tensor-Core Design for Accelerating Bit-Based Approximated Neural Nets", Proceedings of the International Conference for High Performance Computing, Networking, Storage, and Analysis.

  • [SC 2019] C.Yang, T.Geng, T.Wang, ..., M.Herbordt: "Fully integrated FPGA molecular dynamics simulations", Proceedings of the International Conference for High Performance Computing, Networking, Storage, and Analysis.

 

Projects

Graph-Neural-Networks.webp
ai-circuit-board-technology-system-scaled_edited.jpg

Graph

Intelligence

 View Selected Papers for More Details:

I-GCN [MICRO 2021],

GCoD [HPCA 2022],

GCoS [ICCAD 2021],

AWB-GCN [MICRO 2020]

DNN Inference & Training

 View Selected Papers for More Details:

FPDeep [TC 2020]

O3BNN-R[TPDS 2021],

Tensor-Core ANN[SC2021],

BSTC[SC 2019],

O3BNN[ICS 2019]

CGRA

Architecture

 View Selected Papers for More Details:

DRIPS[HPCA2021], DynPaC[ICCD2021],

ASAP[ICS2022],

CQNN[HPEC 2020]

preço-de-gpu-pakistao_edited.png
ai-circuit-board-technology-system-scaled_edited.jpg

GPU

Architecture

 View Selected Papers for More Details:

Arbitrary-Precision Tensor Core [SC2021], 

Binarized Soft Tensor Core -- BSTC [SC 2019]

dominoes-2364492_1920_edited.png

NLP: RNN & Transformer

 View Selected Papers for More Details:

Co-design Transformer [DAC 2022], 

CSB-RNN [ICS 2020],

FPGA-based Transformer[ISQED 2021]

Drug Discovery --

Molecular Dynamics

 View Selected Papers for More Details:

FPGA-based MD

[ICCAD 2021], [SC 2019], [ASAP 2019],[FCCM2021], [HPEC2021], [FCCM2022]

Future Heterogeneity in Hardware and Systems

 View Selected Papers for More Details:

ARENA[TPDS2021],

In-NIC Data Compression[ICS2022],

FCsN[ FCCM 2022],

Smart Switch

[CPE 2021], [HPEC 2021], [FCCM 2020], [FPT 2020]

 

Meet The Team

Image from iOS.jpg

Zhenyu Pan

Research Interests:

1. Ising Graph Learning;

2. Quantum-inspired GNN;

3. Domain-specific Acceleration with FPGAs;

Zhuo Liu

Research Interests:

1. GNN Acceleration;

2. Heterogeneous System;

3. Ising Machine;

4. FPGAs;

WeChat Image_20220629182830_edited_edited.jpg

Yinghao Wu

Research Interests:

1. Spatial-Temporal GNN

2. Ising-aided GNN

3. ML Acceleration

4. Computer Architecture

WeChat Image_20220629143309_edited.jpg

Banksy Luo

Research Interests:

1. GPUs and Reconfigurability in GPUs;

2. Graph Learning on Heterogeneous System;

lz1_edited.jpg
 

Professional Services

Conference TPC/ERC:

International Symposium on High-Performance Computer Architecture [HPCA 2022]

International Conference on Field Programmable Logic and Applications [FPL 2022 & 2021]

International Parallel & Distributed Processing Symposium [IPDPS 2021]

International Conference on Application-specific Systems, Architectures and Processors [ASAP 2021]

Journal Review:

IEEE Transactions on Computers [TC]

IEEE Transactions on Parallel and Distributed Systems [TPDS]

ACM Transactions on Reconfigurable Technology and Systems [TRETS]

IEEE Transactions on Computer-Aided Design of Integrated Circuits & Systems [TCAD]

IEEE Computer Architecture Letters [CAL]

ACM Computing Surveys [CSUR]

Parallel Computing [ParCo]

Microprocessors and Microsystems [MICPRO]

 

News

09/2022 Our proposal received 2022 Meta (Facebook) Research Award on AI System Hardware/Software Codesign. Many thanks to Meta and Meta Research!

09/2022 Four papers were accepted by ICCD 2022, all about how to efficiently process GNNs with emerging heterogeneities including multicore, quantum computer, and ReRAM.

09/2022 Our proposal was selected as an internationally excellent finalist in Meta (Facebook) RFP - Networking for AI.

08/2022 Tony gave a talk at Zhejiang University.

06/2022 Three papers were accepted by FPL 2022.

04/2022 Two papers were accepted by ICS 2022.

03/2022 Tony gave an invited talk at George Mason University.

03/2022 Tony gave an invited talk at Umass Boston.

02/2022 One paper was accepted by DAC 2022.

02/2022 Tony gave an invited talk at the University of Connecticut.

02/2022 Tony gave an invited talk at Northwestern University.

02/2022 Tony gave an invited talk at William & Mary.

02/2022 Tony gave an invited talk at the University of Rochester.

02/2022 Tony gave an invited talk at Fordham University.

01/2022 Tony gave an invited talk at Illinois Institute of Technology.

12/2021 Tony gave an invited talk at Rensselaer Polytechnic Institute.

12/2021 Two papers were accepted by HPCA 2022.