Theme 1: Heterogeneous Computing Platforms

We envision a planet-scale distributed computing infrastructure with a myriad of heterogeneous accelerators. Accelerators will rapidly evolve with applications and, in addition, at any point in time, co-exist with earlier or later generations. Hence, we propose a new methodology to easily generate, deploy, and reconfigure Evolvable accelerators. Groups of accelerators will be organized into Ensembles distributed across one or multiple datacenters.  Applications will dynamically pick (and reconfigure) the desired set of accelerators from the ensemble with minor overhead. Advanced runtime and compilation methods will reconfigure multi tenant accelerator ensembles, and map  and schedule applications to them. Finally, revamped general-purpose cores will differentiate to increase performance and energy efficiency.

Inside Theme 1-2
Design-space exploration of reconfigurable ASIC accelerators (Courtesy of Zhiru Zhang).

Papers and Presentations:

2023

Supporting a Virtual Vector Instruction Set on a Commercial Compute-in-SRAM Accelerator

Courtney Golden, Dan Ilan, Caroline Huang, Niansong Zhang, Zhiru Zhang, and Christopher Batten

IEEE COMPUTER ARCHITECTURE LETTERS, VOL. 23, NO. 1, JANUARY-JUNE 2024

10.1109/LCA.2023.3341389

 

PrimeNet: Pre-Training for Irregular Multivariate Time Series

Ranak Roy Chowdhury, Jiacheng Li, Xiyuan Zhang, Dezhi Hong, Rajesh K. Gupta, Jingbo Shang

Proceedings of the 37th AAAI Conference on Artificial Intelligence (AAAI ’23) Feb 7, 2023

10.1609/aaai.v37i6.25876

 

Proteus: HLS-based NoC Generator and Simulator

Abhimanyu Rajeshkumar BambhaniyaYangyu ChenAnshumanRohan BanerjeeTushar Krishna

Design, Automation and Test in Europe Conference April 2023

10.23919/DATE56975.2023.10137173

 

SPADE: A Flexible and Scalable Accelerator for SpMM and SDDMM

Gerasimos Gerogiannis, Serif Yesil, Damitha Lenadora, Dingyuan Cao, Charith Mendis, Josep Torrellas

ISCA '23: Proceedings of the 50th Annual International Symposium on Computer Architecture

10.1145/3579371.3589054

 

SENSEi: Input-sensitive dense-sparse primitive compositions for GNN acceleration

Damitha Lenadora, Vimarsh Sathia, Gerasimos Gerogiannis, Serif Yesil, Josep Torrellas, Charith Mendis

arxiv.org/abs/2306.15155 June 2023

 

FluRKA: Fast fused Low-Rank & Kernel Attention

Ahan GuptaYueming YuanYanqi ZhouCharith Mendis

10.48550/arXiv.2306.15799 June 2023

 

MXFaaS: Resource Sharing in Serverless Environments for Parallelism and Efficiency

Jovan Stojkovic, Tianyin Xu, Hubertus Franke, Josep Torrellas

ISCA '23: Proceedings of the 50th Annual International Symposium on Computer Architecture June 2023

10.1145/3579371.3589069

 

µManycore: A Cloud-Native CPU for Tail at Scale

Jovan Stojkovic, Chunao Liu, Muhammad Shahbaz, Josep Torrellas

ISCA '23: Proceedings of the 50th Annual International Symposium on Computer Architecture June 2023

10.1145/3579371.3589068

 

Arvon: A Heterogeneous System-in-Package Integrating FPGA and DSP Chiplets for Versatile Workload Acceleration

Cheng-Hsun Lu , Junkang Zhu, Tianyu Wei , Wei Tang , Zhengya Zhang

2023 Symposium  on VLSI Circuits June 2023

10.1109/JSSC.2023.3343457 

 

Towards Diverse and Coherent Augmentation for Time-Series Forecasting

Xiyuan Zhang, Ranak Roy Chowdhury, Jingbo Shang, Rajesh Gupta, Dezhi Hong

ICASSP  2023 June 2023

10.48550/arXiv.2303.14254

 

Unleashing the Power of Shared Label Structures for Human Activity Recognition

Xiyuan Zhang, Ranak Roy Chowdhury, Jiayun Zhang, Rajesh K. Gupta, Jingbo Shang, Dezhi Hong

CIKM 2023 October 2023

10.48550/arXiv.2301.03462

 

Micro-Armed Bandit: Lightweight & Reusable Reinforcement Learning for Microarchitecture Decision-Making

Gerasimos Gerogiannis, Josep Torrellas

Proceedings of the 56th Annual IEEE/ACM International Symposium on Microarchitecture (Micro ’23)

10.1145/3613424.3623780 Oct 2023

 

Machine Learning Hardware Design for Efficiency, Flexibility and Scalability

Jie-Fang Zhang, Zhengya Zhang

IEEE Circuits and Systems Magazine IF 6.9 ) Pub Date: October 2023

10.1109/mcas.2023.3302390

 

Large Graph Property Prediction via Graph Segment Training

Kaidi CaoPhitchaya Mangpo PhothilimthanaSami Abu-El-HaijaDustin ZelleYanqi ZhouCharith MendisJure LeskovecBryan Perozzi
arXiv:2305.12322
 
Nov 2023

 

TpuGraphs: A Performance Prediction Dataset on Large Tensor Computational Graphs

Charith Mendis, Phitchaya Mangpo Phothilimthana, Sami Abu-El-Haija, Bahare Fatem, Bryan Perozzi, & Kaidi Cao

Workshop on Graph Learning Benchmarks Dec 2023

10.48550/arXiv.2308.13490

 

Slapo: A Schedule Language for Progressive Optimization of Large Deep Learning Model Training

Hongzheng Chen, Codi Hao Yu, Shuai Zheng, Zhen Zhang, Zhiru Zhang, Yida Wang
arXiv:2302.08005
  December 2023

 

An Intermediate Language for General Sparse Format Customization

Jie Liu, Zhongyuan Zhao, Zijian Ding, Benjamin Brock, Hongbo Rong, Zhiru Zhang

IEEE Computer Architecture Letters (Volume: 22Issue: 2, July-Dec. 2023)

10.1109/LCA.2023.3262610