About Tony
MY BACKGROUND
Dr. Tony Geng is a tenure-track assistant professor in the ECE and CS departments of the University of Rochester (UR) and the director of UR's IntelliArch Lab. Before joining Rochester, Tony worked in the Physical & Computational Sciences Directorate (PCSD) at Pacific Northwest National Laboratory (PNNL) operated by the Department of Energy of the US government for 2 years. He received his Ph.D. in Computer Engineering at Boston University in 2020. His research interests are at the intersection of computer architecture & systems, machine learning, graph intelligence, and high-performance computing. Tony's papers have appeared in many prestigious conferences and journals including MICRO, HPCA, AAAI, DAC, SC, TPDS, TC, etc.
To prospective students:
I am always looking for Postdoc, Ph.D. students, and Interns (remote is acceptable) to work on next-generation hardware architectures & systems for AI, graph intelligence, ML, and their applications including Fintech, Recommendation systems, Social media, and Smart city. Please drop me an email with your CV and transcripts if you are interested.
RESEARCH INTERESTS
Computer Architecture: GPU, FPGA, CGRA, Accelerators for AI, Quantum Computer, Future Heterogeneity in Hardware and System
Machine Learning: Spatio-temporal Graph Neural Networks, Broadly-defined Graph Intelligence, DNNs
Applications: Fintech, Social Media, Recommendation System, Smart City, Public Health, Supply Chain
Selected Publications
2022:
-
[AAAI 2022] Z.Pan, A.Sharma, J.Hu, Z.liu, A.Li, H.Liu, M.Huang, T.Geng: "Ising-Traffic: An Ising-based Framework for Traffic Congestion Prediction with Uncertainty", Thirty-Seventh AAAI Conference on Artificial Intelligence.
-
[TPDS 2022] W.Sun, A.Li, T.Geng, S.Stuijk, H.Corporaal: "Dissecting Tensor Cores via Microbenchmarks: Latency, Throughput and Numerical Behaviors", IEEE Transactions on Parallel and Distributed Systems.
-
[HPCA 2022] H.You*, T.Geng*, Y.Zhang, A.Li, Y.Lin: "GCoD: Graph Convolutional Network Acceleration via Dedicated Algorithm and Accelerator Co-Design", The 28th IEEE International Symposium on HighPerformance Computer Architecture.
-
[HPCA 2022] C.Tan, N.B.Agostini, T.Geng, C.Xie, J.Li, A.Li, K.Barker, A.Tumeo: "DRIPS: Dynamic Rebalancing of Pipelined Streaming Applications on CGRAs", The 28th IEEE International Symposium on High-Performance Computer Architecture.
-
[DAC 2022] H. Peng, ..., T.Geng, ..., C.Ding: "A Length Adaptive Algorithm-Hardware Co-design of Transformer on FPGA Through Sparse Attention and Dynamic Pipelining", The 58th Design Automation Conference.
-
[ICS 2022] C.Zhang, S.Jin, T.Geng, J.Tian, A.Li, D.Tao: "Accelerating Parallel I/O Via Hardware-Algorithm Co-Designed Adaptive Lossy Compression", the 36th ACM International Conference on Supercomputing.
-
[ICS 2022] C.Tan, T.Tembe, J.Zhang, B.Fang, T.Geng, G.Wei, D.Brooks, A.Tumeo, G.Gopalakrishnan A.Li: "ASAP - Automatic Synthesis of Area-Efficient and Precision-Aware CGRA", the 36th ACM International Conference on Supercomputing.
2021:
-
[MICRO 2021] T.Geng, C.Wu, ..., M.Herbordt, Y.Lin, A.Li: "I-GCN: A Graph Convolutional Network Accelerator with Runtime Locality Enhancement through Islandization", the 54th IEEE/ACM International Symposium on Microarchitecture.
-
[TPDS 2021] T.Geng, T.Wang, C.Wu, Y.Li, ..., A.Li, M.Herbordt: "O3BNN-R: An Out-Of-Order Architecture for HighPerformance and Regularized BNN inference", IEEE Transactions on Parallel and Distributed Systems.
-
[TPDS 2021] C.Tan, C.Xie, T.Geng, ..., K.Barker, A.Li: "ARENA: Asynchronous Reconfigurable Accelerator Ring to Enable Data-Centric Parallel Computing", IEEE Transactions on Parallel and Distributed Systems.
-
[SC 2021] B.Feng, Y.Wang, T.Geng, A.Li, Y.Ding: "APNN-TC: Accelerating Arbitrary Precision Neural Networks on Ampere GPU Tensor Cores", Proceedings of the International Conference for High Performance Computing, Networking, Storage, and Analysis.
-
[ICCAD 2021] Y.Zhang, H.You, Y.Fu, T.Geng, A.Li, Y.Lin: "G-CoS: GNN-Accelerator Co-Search Towards Both Better Accuracy and Efficiency", 2021 International Conference On Computer Aided Design.
-
[ICCD 2021] C.Tan, T.Geng, C.Xie, N.Agostini, J.Li, A.Li, K.Barker, A.Tumeo: "DynPaC: Coarse-Grained, Dynamic, and Partially Reconfigurable Array for Streaming Applications", the 39th IEEE International Conference on Computer Design. (Best Paper Award)
2020:
-
[TC 2020] T.Geng*, T.Wang*, A.Li, X.Jin, M.Herbordt: "FPDeep: Scalable Acceleration of CNN Training on DeeplyPipelined FPGA Clusters", IEEE Transactions on Computers.
-
[ICS 2020] T.Geng*, R.Shi*, P.Dong*, ..., M.Herbordt, A.Li, Y.Wang: "CSB-RNN: A Faster-than-Realtime RNN Acceleration Framework with Compressed Structured Blocks", the 34th ACM International Conference on Supercomputing.
2019:
-
[ICS 2019] T.Geng, T.Wang, C.Wu, C.Yang, W.Wu, A.Li, M.Herbordt: "O3BNN: An Out-Of-Order Architecture for High-Performance Binarized Neural Network Inference with Fine-Grained Pruning", the 33th ACM International Conference on Supercomputing.
-
[SC 2019] A.Li, T.Geng, T.Wang, M.Herbordt, S.Song, K.Barker: "BSTC: A Novel BinarizedSoft-Tensor-Core Design for Accelerating Bit-Based Approximated Neural Nets", Proceedings of the International Conference for High Performance Computing, Networking, Storage, and Analysis.
-
[SC 2019] C.Yang, T.Geng, T.Wang, ..., M.Herbordt: "Fully integrated FPGA molecular dynamics simulations", Proceedings of the International Conference for High Performance Computing, Networking, Storage, and Analysis.
Projects
Graph
Intelligence
View Selected Papers for More Details:
I-GCN [MICRO 2021],
GCoD [HPCA 2022],
GCoS [ICCAD 2021],
AWB-GCN [MICRO 2020]
Ising-Traffic [AAAI 2023]
DNN Inference & Training
View Selected Papers for More Details:
FPDeep [TC 2020]
O3BNN-R[TPDS 2021],
Tensor-Core ANN[SC2021],
BSTC[SC 2019],
O3BNN[ICS 2019]
CGRA
Architecture
View Selected Papers for More Details:
DRIPS[HPCA2021], DynPaC[ICCD2021],
ASAP[ICS2022],
CQNN[HPEC 2020]
GPU
Architecture
View Selected Papers for More Details:
AP-TC [SC2021],
BSTC [SC 2019]
Sparse TC[TPDS2022]
NLP: RNN & Transformer
View Selected Papers for More Details:
Co-design Transformer [DAC 2022],
CSB-RNN [ICS 2020],
FPGA Transformer[ISQED 2021]
Drug Discovery --
Molecular Dynamics
View Selected Papers for More Details:
FPGA-based MD
[ICCAD 2021], [SC 2019], [ASAP 2019],[FCCM2021], [HPEC2021], [FCCM2022]
Future Heterogeneity in Hardware and Systems
View Selected Papers for More Details:
ARENA[TPDS2021],
In-NIC Data Compression[ICS2022],
FCsN[ FCCM 2022],
Smart Switch
[CPE 2021], [HPEC 2021], [FCCM 2020], [FPT 2020]
Meet The Team

Zhenyu Pan
Research Interests:
1. Ising Graph Learning;
2. Graph Neural Network;
3. Quantum-Aided GNN;

Zhuo Liu
Research Interests:
1. Ising Graph Learning;
2. Temporal Graph Learning
3. Graphical Model;

Banksy Luo
Research Interests:
1. EDA
2. Computer Vision
3. Deep Learning

Clein Song

Chuan Liu
Research Interests:
1. Graph Neural Networks
2. Future Graph Learning
Research Interests:
1. VLSI
2. Mixed-Signal IC
3. Future Learning System
News
11/2022 One paper was accepted by AAAI 2023 -- Congrats to the leading author, Zhenyu Pan, for publishing in his first Ph.D. semester.
11/2022 One paper was accepted by LoG 2023 -- Learning on Graphs (LoG) Conference -- a very decent new conference, strongly recommend
10/2022 One paper was accepted by TPDS 2022.
09/2022 Prof. Tony Geng received Meta (Facebook) Faculty Research Award on AI System Hardware/Software Codesign.
09/2022 Four papers were accepted by ICCD 2022.
09/2022 Our proposal was selected as an internationally excellent finalist in Meta (Facebook) RFP - Networking for AI.
06/2022 Three papers were accepted by FPL 2022.
04/2022 Two papers were accepted by ICS 2022.
02/2022 One paper was accepted by DAC 2022.
02/2022 Tony gave an invited talk at Northwestern University.
02/2022 Tony gave an invited talk at the University of Rochester.