Accepted Papers

We are pleased to announce the accepted papers for IISWC 2025.

Accepted Papers (Sorted by Paper ID)

ZKProphet: Understanding Performance of Zero-Knowledge Proofs on GPUs
Tarunesh Verma, Yichao Yuan, Nishil Talati (University of Michigan); Todd Austin (University of Michigan / Agita Labs)

Characterizing Adaptive Mesh Refinement on Heterogeneous Platforms with Parthenon-VIBE
Akash Poptani, Alireza Khadem, Scott Mahlke (University of Michigan); Jonah Miller, Joshua Dolence, Galen Shipman (Los Alamos National Laboratory); Reetuparna Das (University of Michigan)

The Curious Case of Global Stable Loads
Shagnik Pal (University of Texas at Austin); Jeeho Ryoo (Fairleigh Dickinson University); Lizy K. John (UT Austin)

XRSight: An End-to-End Hardware–Software Co-Design Platform for XR SoC Evaluation
Prashanth Ganesh, Zekai Lin, Yakun Sophia Shao (UC Berkeley)

vACE: Exploring the Design Space of Vector Processing Units for Soft Error Vulnerability
George-Marios Fragkoulis, Dimitris Gizopoulos (University of Athens)

PangenomicsBench: A Benchmark Suite and Characterization of Pangenomics
Noah Kaplan (University of Michigan); Jan-Niklas Schmelzle (Cornell University); Yufeng Gu (University of Michigan); Erik Garrison (University of Tennessee Health Science Center); Christopher Batten (Cornell University); Reetuparna Das (University of Michigan)

Understanding Distributed Training of Large Language Models with Unified Virtual Memory
Jane Rhee, Eunbi Jeong (Ewha Womans University); Jiwon Lee (Samsung Electronics); Myung Kuk Yoon (Ewha Womans University)

Does Linux Provide Performance Isolation for NVMe SSDs? Configuring cgroups for I/O Control in the NVMe Era
Krijn Doekemeijer, Zebin Ren, Tiziano De Matteis, Balakrishnan Chandrasekaran (Vrije Universiteit Amsterdam); Animesh Trivedi (IBM Research Europe, Zurich)

Belenos: Bottleneck Evaluation to Link Biomechanics to Novel Computing Optimizations
Hana Chitsaz, Johnson Umeike, Amirmahdi Namjoo (University of Maryland, College Park); Babak N. Safa (University of South Florida); Bahar Asgari (University of Maryland, College Park)

Icicle: Open-source Hardware Support for Top-Down Microarchitectural Analysis on RISC-V
Matthew Edwin Weingarten, Michael Grieco, Stephen A Edwards, Tanvir Ahmed Khan (Columbia University)

Dissecting CPU–GPU Unified Physical Memory for HPC Applications on the AMD MI300A APU
Jacob Wahlgren, Gabin Schieffer, Ruimin Shi (KTH Royal Institute of Technology); Edgar Leon, Roger Pearce, Maya Gokhale (Lawrence Livermore National Laboratory); Ivy Peng (KTH Royal Institute of Technology)

Design and Accuracy Trade-offs in Computational Statistics
Tiancheng Xu, Alan L. Cox, Scott Rixner (Rice University)

decoder-bench: Benchmarking Decoders for Quantum Error Correction
Satvik Maurya (University of Wisconsin-Madison); Joshua Viszlai (University of Chicago); Nithin Raveendran (University of Arizona); Poulami Das (UT Austin); Swamit Tannu (University of Wisconsin-Madison)

CASM: A Generalizable and Accessible Security Metric to Evaluate Security of Cache Architectures
Phaedra Curlin, Tamara Silbergleit Lehman (University of Colorado Boulder)

HALO: Hybrid Systolic Arrays via Logical Partitioning for Acceleration of Complex-Valued Neural Networks
Ji Yeong Yi, Eunbi Jeong, SungHee Yum, Jane Rhee (Ewha Womans University); Sangun Choi, Gunjae Koo, Yunho Oh (Korea University); Myung Kuk Yoon (Ewha Womans University)

Workload Characterization Using Cross-Layer Features and Multilevel PCA
Lina Sawalha, Grant Deljevic (Western Michigan University)

Learning Architectural Cache Simulator Behaviour
Pranjali Jain (UC Santa Barbara); Meiru Han (University of Pennsylvania); Zhizhou Zhang (Uber Technologies Inc); Brandon Lee, Jonathan Balkind (UC Santa Barbara)

Confidential LLM Inference: Performance and Cost Across CPU and GPU TEEs
Marcin Chrapek, Marcin Copik, Etienne Mettaz, Torsten Hoefler (ETH Zurich)

Improving the Performance of Out-of-Core LLM Inference Using Heterogeneous Host Memory
Sudhanshu Gupta (University of Rochester); Sandhya Dwarkadas (University of Virginia)

Sweet or Sour CHERI: Performance Characterization of the Arm Morello Platform
Xiaoyang Sun (University of Leeds); Jeremy Singer (University of Glasgow); Zheng Wang (University of Leeds)

AlphaFold3 Workload Characterization: A Comprehensive Analysis of Bottlenecks and Performance Scaling
Jinpyo Kim, Mingi Kwon, Jishen Zhao (UCSD)

Athena: A Plug-and-Play Advisor for Retrieval-Augmented Generation using VectorDB
Ning Liang (Duke University); Fabian Wenz, Jana Giceva (TU Munich); Lisa Wu Wills (Duke University)

Characterizing and Optimizing Real-Time Optimal Control for Embedded SoCs
Shengjun Kris Dong, Dima Nikiforov, Widyadewi Soedarmadji, Minh Nguyen, Vikram Jain, Christopher W. Fletcher, Yakun Sophia Shao (University of California, Berkeley)

An Analysis of Ethereum Workloads from a Key-Value Storage Perspective
Yanjing Ren, Jia Zhao (The Chinese University of Hong Kong); Jingwei Li (University of Electronic Science and Technology of China); Patrick P. C. Lee (The Chinese University of Hong Kong)

miniGiraffe: A Pangenomic Mapping Proxy App
Jessica Imlau Dagostini (University of California Santa Cruz); Scott Beamer (University of California, Santa Cruz); Tyler Sorensen (Microsoft Research and UC Santa Cruz); Joseph Manzano (Pacific Northwest National Lab)

EdgeReasoning: Optimizing Reasoning LLM Deployment on Edge GPUs
Benjamin Kubwimana, Qijing Jenny Huang (NVIDIA)

Exploring Lossy Compression of Activation Data for Emerging AI Accelerators: A Case Study on the Graphcore IPU
Milan Shah (North Carolina State University); Xiaodong Yu (Stevens Institute of Technology); Sheng Di (Argonne National Laboratory); Michela Becchi (North Carolina State University); Franck Cappello (Argonne National Laboratory)

Storage-Based Approximate Nearest Neighbor Search: What are the Performance Cost and I/O Characteristics?
Zebin Ren (Vrije Universiteit Amsterdam); Krijn Doekemeijer (Vrije Universiteit Amsterdam, The Netherlands); Padma Apparao (Intel Corporation); Animesh Trivedi (IBM Research Europe, Zurich)

BetterTogether: A Interference-Aware Framework for Fine-grained Software Pipelining on Heterogeneous SoCs
Yanwen Xu, Rithik Sharma, Zheyuan Chen, Shaan Mistry (University of California, Santa Cruz); Tyler Sorensen (Microsoft Research, University of California Santa Cruz)

PRISM: Processing-In-Memory Sparse MTTKRP for Tensor Decomposition Acceleration
Daniel Pacheco, Leonel Sousa (INESC-ID, Instituto Superior Técnico, Universidade de Lisboa); Aleksandar Ilic (INESC-ID & Instituto Superior Técnico)

ALPHA-PIM: Analysis of Linear Algebraic Processing for High-Performance Graph Applications on a Real Processing-In-Memory System
Marzieh Barkhordar, Alireza Tabatabaeian (Simon Fraser University); Mohammad Sadrosadati (ETH Zürich); Christina Giannoula (University of Toronto); Juan Gomez Luna (NVIDIA); Izzat El Hajj (American University of Beirut); Onur Mutlu (ETH Zurich); Alaa Alameldeen (Simon Fraser University)

A Comprehensive Analysis of Graph Neural Networks Training at Different Scales
Mostafa Eghbali Zarch, Michela Becchi (North Carolina State University, USA)

Keeping up with Large Language Models: A Holistic Methodology of Compute, Memory, Communication, and Cost Modeling
Wenzhe Guo, Joyjit Kundu, Uras Tos, Weijiang Kong, Giuliano Sisto, Timon Evenblij, Manu Perumkunnil (imec)

The Fake-Busy and True-Idle Problems of Running Graph Applications on Chiplet-Based Multi-cores
Rashid Aligholipour, Yuan Yao (Uppsala University)

DABench-LLM: Standardized and In-Depth Benchmarking of Post-Moore Dataflow AI Accelerators for LLMs
Ziyu Hu, Zhiqing Zhong (Stevens Institute of Technology); Weijian Zheng (Argonne National Laboratory); Zhijing Ye (Stevens Institute of Technology); Xuwei Tan, Xueru Zhang (The Ohio State University); Zhen Xie (Binghamton University); Rajkumar Kettimuthu (Argonne National Laboratory); Xiaodong Yu (Stevens Institute of Technology)

WANify: Gauging and Balancing Runtime WAN Bandwidth for Geo-distributed Data Analytics
Anshuman Das Mohapatra, Kwangsung Oh (University of Nebraska at Omaha)

ClusterSim: Modeling Thread Block Clusters in Hopper GPUs
Tim Lühnen (Technische Universität Hamburg); Jyotirman Behera, Devashree Tripathy (IIT Bhubaneswar); Sohan Lal (Technische Universität Hamburg)

EntoBench: A Benchmark Suite and Evaluation Framework for Insect-Scale Robotics
Derin Ozturk, Nick Cebry, Angela Cui, Hang Gao, Julie Villamil, Farrell Helbling, Christopher Batten (Cornell University)