Sunday, October 12 (Tutorials and Workshops) Monday, October 13 Tuesday, October 14 Poster Session (4:10pm Monday)

Sunday, October 12

   
9:00
|
10:20
Tutorial 1: Observability into Application-level Metrics with eBPFs
Session 1 of 2
10:20
|
10:40
Coffee Break
10:40
|
12:00
Tutorial 1: Observability into Application-level Metrics with eBPFs
Session 2 of 2
12:00
|
1:20
Lunch
1:20
|
3:00
Workshop 1: Workshop on Security–Performance Trade-offs
Session 1 of 2 • Website pending
3:00
|
3:20
Coffee Break
3:20
|
5:00
Workshop 1: Workshop on Security–Performance Trade-offs
Session 2 of 2 • Website pending

Monday, October 13

   
9:00
|
9:15
Welcome
9:15
|
10:15
Keynote 1
TBD
10:15
|
10:35
Coffee Break
10:35
|
11:50
Session 1: Cross-Domain Methods for Workload Analysis
Session chair: TBD
  Belenos: Bottleneck Evaluation to Link Biomechanics to Novel Computing Optimizations
Hana Chitsaz, Johnson Umeike, Amirmahdi Namjoo (University of Maryland, College Park); Babak N. Safa (University of South Florida); Bahar Asgari (University of Maryland, College Park)
  Workload Characterization Using Cross-Layer Features and Multilevel PCA
Lina Sawalha, Grant Deljevic (Western Michigan University)
  Athena: A Plug-and-Play Advisor for Retrieval-Augmented Generation using VectorDB
Ning Liang (Duke University); Fabian Wenz, Jana Giceva (TU Munich); Lisa Wu Wills (Duke University)
  The Fake-Busy and True-Idle Problems of Running Graph Applications on Chiplet-Based Multi-cores
Rashid Aligholipour, Yuan Yao (Uppsala University)
  WANify: Gauging and Balancing Runtime WAN Bandwidth for Geo-distributed Data Analytics
Anshuman Das Mohapatra, Kwangsung Oh (University of Nebraska at Omaha)
12:00
|
1:20
Lunch
1:20
|
2:35
Session 2: Large Language Models
Session chair: TBD
  Understanding Distributed Training of Large Language Models with Unified Virtual Memory
Jane Rhee, Eunbi Jeong (Ewha Womans University); Jiwon Lee (Samsung Electronics); Myung Kuk Yoon (Ewha Womans University)
  Confidential LLM Inference: Performance and Cost Across CPU and GPU TEEs
Marcin Chrapek, Marcin Copik, Etienne Mettaz, Torsten Hoefler (ETH Zurich)
  EdgeReasoning: Optimizing Reasoning LLM Deployment on Edge GPUs
Benjamin Kubwimana, Qijing Jenny Huang (NVIDIA)
  Keeping up with Large Language Models: A Holistic Methodology of Compute, Memory, Communication, and Cost Modeling
Wenzhe Guo, Joyjit Kundu, Uras Tos, Weijiang Kong, Giuliano Sisto, Timon Evenblij, Manu Perumkunnil (imec)
  DABench-LLM: Standardized and In-Depth Benchmarking of Post-Moore Dataflow AI Accelerators for LLMs
Ziyu Hu, Zhiqing Zhong (Stevens Institute of Technology); Weijian Zheng (Argonne National Laboratory); Zhijing Ye (Stevens Institute of Technology); Xuwei Tan, Xueru Zhang (The Ohio State University); Zhen Xie (Binghamton University); Rajkumar Kettimuthu (Argonne National Laboratory); Xiaodong Yu (Stevens Institute of Technology)
2:35
|
2:55
Coffee Break
2:55
|
4:10
Session 3: Security, Confidentiality, and Reliability
Session chair: TBD
  ZKProphet: Understanding Performance of Zero-Knowledge Proofs on GPUs
Tarunesh Verma, Yichao Yuan, Nishil Talati (University of Michigan); Todd Austin (University of Michigan / Agita Labs)
  The Curious Case of Global Stable Loads
Shagnik Pal (University of Texas at Austin); Jeeho Ryoo (Fairleigh Dickinson University); Lizy K. John (UT Austin)
  vACE: Exploring the Design Space of Vector Processing Units for Soft Error Vulnerability
George-Marios Fragkoulis, Dimitris Gizopoulos (University of Athens)
  CASM: A Generalizable and Accessible Security Metric to Evaluate Security of Cache Architectures
Phaedra Curlin, Tamara Silbergleit Lehman (University of Colorado Boulder)
  Learning Architectural Cache Simulator Behaviour
Pranjali Jain (UC Santa Barbara); Meiru Han (University of Pennsylvania); Zhizhou Zhang (Uber Technologies Inc); Brandon Lee, Jonathan Balkind (UC Santa Barbara)
4:10
|
5:30
Poster Session
Session chair: TBD
5:30
|
8:30
Conference Banquet

Tuesday, October 14

   
8:45
|
10:00
Session 4: AI Accelerators, PIM, and Post-Moore Architectures
Session chair: TBD
  HALO: Hybrid Systolic Arrays via Logical Partitioning for Acceleration of Complex-Valued Neural Networks
Ji Yeong Yi, Eunbi Jeong, SungHee Yum, Jane Rhee (Ewha Womans University); Sangun Choi, Gunjae Koo, Yunho Oh (Korea University); Myung Kuk Yoon (Ewha Womans University)
  Exploring Lossy Compression of Activation Data for Emerging AI Accelerators: A Case Study on the Graphcore IPU
Milan Shah (North Carolina State University); Xiaodong Yu (Stevens Institute of Technology); Sheng Di (Argonne National Laboratory); Michela Becchi (North Carolina State University); Franck Cappello (Argonne National Laboratory)
  BetterTogether: A Interference-Aware Framework for Fine-grained Software Pipelining on Heterogeneous SoCs
Yanwen Xu, Rithik Sharma, Zheyuan Chen, Shaan Mistry (University of California, Santa Cruz); Tyler Sorensen (Microsoft Research, University of California Santa Cruz)
  PRISM: Processing-In-Memory Sparse MTTKRP for Tensor Decomposition Acceleration
Daniel Pacheco, Leonel Sousa (INESC-ID, Instituto Superior Técnico, Universidade de Lisboa); Aleksandar Ilic (INESC-ID & Instituto Superior Técnico)
  ALPHA-PIM: Analysis of Linear Algebraic Processing for High-Performance Graph Applications on a Real Processing-In-Memory System
Marzieh Barkhordar, Alireza Tabatabaeian (Simon Fraser University); Mohammad Sadrosadati (ETH Zürich); Christina Giannoula (University of Toronto); Juan Gomez Luna (NVIDIA); Izzat El Hajj (American University of Beirut); Onur Mutlu (ETH Zurich); Alaa Alameldeen (Simon Fraser University)
10:00
|
10:20
Coffee Break
10:20
|
11:50
Session 5: Emerging Workloads
Session chair: TBD
  PangenomicsBench: A Benchmark Suite and Characterization of Pangenomics
Noah Kaplan (University of Michigan); Jan-Niklas Schmelzle (Cornell University); Yufeng Gu (University of Michigan); Erik Garrison (University of Tennessee Health Science Center); Christopher Batten (Cornell University); Reetuparna Das (University of Michigan)
  decoder-bench: Benchmarking Decoders for Quantum Error Correction
Satvik Maurya (University of Wisconsin-Madison); Joshua Viszlai (University of Chicago); Nithin Raveendran (University of Arizona); Poulami Das (UT Austin); Swamit Tannu (University of Wisconsin-Madison)
  A Comprehensive Analysis of Graph Neural Networks Training at Different Scales
Mostafa Eghbali Zarch, Michela Becchi (North Carolina State University, USA)
  EntoBench: A Benchmark Suite and Evaluation Framework for Insect-Scale Robotics
Derin Ozturk, Nick Cebry, Angela Cui, Hang Gao, Julie Villamil, Farrell Helbling, Christopher Batten (Cornell University)
  Improving the Performance of Out-of-Core LLM Inference Using Heterogeneous Host Memory
Sudhanshu Gupta (University of Rochester); Sandhya Dwarkadas (University of Virginia)
  miniGiraffe: A Pangenomic Mapping Proxy App
Jessica Imlau Dagostini (University of California Santa Cruz); Scott Beamer (University of California, Santa Cruz); Tyler Sorensen (Microsoft Research and UC Santa Cruz); Joseph Manzano (Pacific Northwest National Lab)
12:00
|
1:20
Lunch
1:20
|
2:50
Session 6: Memory, Storage, and Beyond
Session chair: TBD
  Does Linux Provide Performance Isolation for NVMe SSDs? Configuring cgroups for I/O Control in the NVMe Era
Krijn Doekemeijer, Zebin Ren, Tiziano De Matteis, Balakrishnan Chandrasekaran (Vrije Universiteit Amsterdam); Animesh Trivedi (IBM Research Europe, Zurich)
  Dissecting CPU–GPU Unified Physical Memory for HPC Applications on the AMD MI300A APU
Jacob Wahlgren, Gabin Schieffer, Ruimin Shi (KTH Royal Institute of Technology); Edgar Leon, Roger Pearce, Maya Gokhale (Lawrence Livermore National Laboratory); Ivy Peng (KTH Royal Institute of Technology)
  Design and Accuracy Trade-offs in Computational Statistics
Tiancheng Xu, Alan L. Cox, Scott Rixner (Rice University)
  An Analysis of Ethereum Workloads from a Key-Value Storage Perspective
Yanjing Ren, Jia Zhao (The Chinese University of Hong Kong); Jingwei Li (University of Electronic Science and Technology of China); Patrick P. C. Lee (The Chinese University of Hong Kong)
  Storage-Based Approximate Nearest Neighbor Search: What are the Performance Cost and I/O Characteristics?
Zebin Ren (Vrije Universiteit Amsterdam); Krijn Doekemeijer (Vrije Universiteit Amsterdam, The Netherlands); Padma Apparao (Intel Corporation); Animesh Trivedi (IBM Research Europe, Zurich)
  Sweet or Sour CHERI: Performance Characterization of the Arm Morello Platform
Xiaoyang Sun (University of Leeds); Jeremy Singer (University of Glasgow); Zheng Wang (University of Leeds)
2:50
|
3:10
Coffee Break
3:10
|
4:40
Session 7: Heterogeneous and Domain-Specific Systems
Session chair: TBD
  Characterizing Adaptive Mesh Refinement on Heterogeneous Platforms with Parthenon-VIBE
Akash Poptani, Alireza Khadem, Scott Mahlke (University of Michigan); Jonah Miller, Joshua Dolence, Galen Shipman (Los Alamos National Laboratory); Reetuparna Das (University of Michigan)
  XRSight: An End-to-End Hardware–Software Co-Design Platform for XR SoC Evaluation
Prashanth Ganesh, Zekai Lin, Yakun Sophia Shao (UC Berkeley)
  Icicle: Open-source Hardware Support for Top-Down Microarchitectural Analysis on RISC-V
Matthew Edwin Weingarten, Michael Grieco, Stephen A Edwards, Tanvir Ahmed Khan (Columbia University)
  AlphaFold3 Workload Characterization: A Comprehensive Analysis of Bottlenecks and Performance Scaling
Jinpyo Kim, Mingi Kwon, Jishen Zhao (UCSD)
  Characterizing and Optimizing Real-Time Optimal Control for Embedded SoCs
Shengjun Kris Dong, Dima Nikiforov, Widyadewi Soedarmadji, Minh Nguyen, Vikram Jain, Christopher W. Fletcher, Yakun Sophia Shao (University of California, Berkeley)
  ClusterSim: Modeling Thread Block Clusters in Hopper GPUs
Tim Lühnen (Technische Universität Hamburg); Jyotirman Behera, Devashree Tripathy (IIT Bhubaneswar); Sohan Lal (Technische Universität Hamburg)
4:40
|
5:00
Closing and Best Paper Award

Poster Session

Session chair: TBD

Poster session details TBD. The poster session will be held during Monday, October 13 from 4:10 PM to 5:30 PM.