Theme 1: Heterogeneous Computing Platforms
We envision a planet-scale distributed computing infrastructure with a myriad of heterogeneous accelerators. Accelerators will rapidly evolve with applications and, in addition, at any point in time, co-exist with earlier or later generations. Hence, we propose a new methodology to easily generate, deploy, and reconfigure Evolvable accelerators. Groups of accelerators will be organized into Ensembles distributed across one or multiple datacenters. Applications will dynamically pick (and reconfigure) the desired set of accelerators from the ensemble with minor overhead. Advanced runtime and compilation methods will reconfigure multi tenant accelerator ensembles, and map and schedule applications to them. Finally, revamped general-purpose cores will differentiate to increase performance and energy efficiency.
Papers and Presentations:
2023
Supporting a Virtual Vector Instruction Set on a Commercial Compute-in-SRAM Accelerator
Courtney Golden, Dan Ilan, Caroline Huang, Niansong Zhang, Zhiru Zhang, and Christopher Batten
IEEE COMPUTER ARCHITECTURE LETTERS, VOL. 23, NO. 1, JANUARY-JUNE 2024
10.1109/LCA.2023.3341389
PrimeNet: Pre-Training for Irregular Multivariate Time Series
Ranak Roy Chowdhury, Jiacheng Li, Xiyuan Zhang, Dezhi Hong, Rajesh K. Gupta, Jingbo Shang
Proceedings of the 37th AAAI Conference on Artificial Intelligence (AAAI ’23) Feb 7, 2023
10.1609/aaai.v37i6.25876
Proteus: HLS-based NoC Generator and Simulator
Abhimanyu Rajeshkumar BambhaniyaYangyu ChenAnshumanRohan BanerjeeTushar Krishna
Design, Automation and Test in Europe Conference April 2023
10.23919/DATE56975.2023.10137173
SPADE: A Flexible and Scalable Accelerator for SpMM and SDDMM
Gerasimos Gerogiannis, Serif Yesil, Damitha Lenadora, Dingyuan Cao, Charith Mendis, Josep Torrellas
ISCA '23: Proceedings of the 50th Annual International Symposium on Computer Architecture June 2023
10.1145/3579371.3589054
SENSEi: Input-sensitive dense-sparse primitive compositions for GNN acceleration
Damitha Lenadora, Vimarsh Sathia, Gerasimos Gerogiannis, Serif Yesil, Josep Torrellas, Charith Mendis
arxiv.org/abs/2306.15155 June 2023
FluRKA: Fast fused Low-Rank & Kernel Attention
Ahan Gupta, Yueming Yuan, Yanqi Zhou, Charith Mendis
10.48550/arXiv.2306.15799 June 2023
MXFaaS: Resource Sharing in Serverless Environments for Parallelism and Efficiency
Jovan Stojkovic, Tianyin Xu, Hubertus Franke, Josep Torrellas
ISCA '23: Proceedings of the 50th Annual International Symposium on Computer Architecture June 2023
10.1145/3579371.3589069
µManycore: A Cloud-Native CPU for Tail at Scale
Jovan Stojkovic, Chunao Liu, Muhammad Shahbaz, Josep Torrellas
ISCA '23: Proceedings of the 50th Annual International Symposium on Computer Architecture June 2023
10.1145/3579371.3589068
Arvon: A Heterogeneous System-in-Package Integrating FPGA and DSP Chiplets for Versatile Workload Acceleration
Cheng-Hsun Lu , Junkang Zhu, Tianyu Wei , Wei Tang , Zhengya Zhang
2023 Symposium on VLSI Circuits June 2023
10.1109/JSSC.2023.3343457
Towards Diverse and Coherent Augmentation for Time-Series Forecasting
Xiyuan Zhang, Ranak Roy Chowdhury, Jingbo Shang, Rajesh Gupta, Dezhi Hong
ICASSP 2023 June 2023
10.48550/arXiv.2303.14254
Unleashing the Power of Shared Label Structures for Human Activity Recognition
Xiyuan Zhang, Ranak Roy Chowdhury, Jiayun Zhang, Rajesh K. Gupta, Jingbo Shang, Dezhi Hong
CIKM 2023 October 2023
10.48550/arXiv.2301.03462
Micro-Armed Bandit: Lightweight & Reusable Reinforcement Learning for Microarchitecture Decision-Making
Gerasimos Gerogiannis, Josep Torrellas
Proceedings of the 56th Annual IEEE/ACM International Symposium on Microarchitecture (Micro ’23)
10.1145/3613424.3623780 Oct 2023
Machine Learning Hardware Design for Efficiency, Flexibility and Scalability
Jie-Fang Zhang, Zhengya Zhang
IEEE Circuits and Systems Magazine ( IF 6.9 ) Pub Date: October 2023
10.1109/mcas.2023.3302390
Large Graph Property Prediction via Graph Segment Training
Kaidi Cao, Phitchaya Mangpo Phothilimthana, Sami Abu-El-Haija, Dustin Zelle, Yanqi Zhou, Charith Mendis, Jure Leskovec, Bryan Perozzi
arXiv:2305.12322 Nov 2023
TpuGraphs: A Performance Prediction Dataset on Large Tensor Computational Graphs
Charith Mendis, Phitchaya Mangpo Phothilimthana, Sami Abu-El-Haija, Bahare Fatem, Bryan Perozzi, & Kaidi Cao
Workshop on Graph Learning Benchmarks Dec 2023
10.48550/arXiv.2308.13490
Slapo: A Schedule Language for Progressive Optimization of Large Deep Learning Model Training
Hongzheng Chen, Codi Hao Yu, Shuai Zheng, Zhen Zhang, Zhiru Zhang, Yida Wang
arXiv:2302.08005 December 2023
An Intermediate Language for General Sparse Format Customization
Jie Liu, Zhongyuan Zhao, Zijian Ding, Benjamin Brock, Hongbo Rong, Zhiru Zhang
IEEE Computer Architecture Letters (Volume: 22, Issue: 2, July-Dec. 2023)
10.1109/LCA.2023.3262610