OrderLab Reading Group

Fall 2024

Wednesdays 2:00pm-3:30 pm, 3901 BBB

Coordinator: Wanning He

Description

The reading group organized by the OrderLab covers latest advances in the research of computer systems. Students will read and discuss recent papers in top systems conferences such as OSDI, SOSP, NSDI, EuroSys, and ASPLOS.

Each week, one student will present the paper and lead the discussion. Other students should read the paper to be presented before the seminar. This seminar is supposed to generate in-depth discussions. It is impossible to do so without reading the paper first.

The focus topics covered in the papers vary semester to semester. Example topics include fault-tolerance, reliability, verification, energy efficiency, and virtualization. The presenter decides which paper to present. In general, select the papers that are relevant to your research project first (i.e., depth-first). If you are not sure, check with me first before preparing the presentation. Also, try to avoid picking papers that have already been picked in the past (the past schedules are linked on the left-side menu).

The presentation announcements are sent via the mailing list orderlab-talk@umich.edu, which will also be used to generate follow-up discussions of the presented paper. Students who wish to sign up for the mailing list, please email ryanph@umich.edu.

Schedule

DatePresenterTitleConferenceMaterial
08/28/2024 Yuxuan Jiang

SuperBench: Improving Cloud AI Infrastructure Reliability with Proactive Validation (ATC'24)

Yifan Xiong, Yuting Jiang, Ziyue Yang, and Lei Qu, Microsoft Research; Guoshuai Zhao, Shuguang Liu, Dong Zhong, Boris Pinzur, Jie Zhang, Yang Wang, Jithin Jose, Hossein Pourreza, Jeff Baxter, Kushal Datta, Prabhat Ram, Luke Melton, and Joe Chau, Microsoft; Peng Cheng, Yongqiang Xiong, and Lidong Zhou, Microsoft Research

ATC '24 Paper
09/04/2024 Yiming Xiang

SquirrelFS: using the Rust compiler to check file-system crash consistency (OSDI'24)

Hayley LeBlanc, Nathan Taylor, James Bornholt, and Vijay Chidambaram, University of Texas at Austin

OSDI '24 Paper
09/11/2024 Yunchi Lu

nnScaler: Constraint-Guided Parallelization Plan Generation for Deep Learning Training (OSDI'24)

Zhiqi Lin, University of Science and Technology of China; Youshan Miao, Quanlu Zhang, Fan Yang, and Yi Zhu, Microsoft Research; Cheng Li, University of Science and Technology of China; Saeed Maleki, xAI; Xu Cao, Ning Shang, Yilei Yang, Weijiang Xu, and Mao Yang, Microsoft Research; Lintao Zhang, BaseBit Technologies; Lidong Zhou, Microsoft Research

OSDI '24 Paper
09/18/2024 Yi Chen

Identifying On-/Off-CPU Bottlenecks Together with Blocked Samples (OSDI'24)

Minwoo Ahn and Jeongmin Han, Sungkyunkwan University; Youngjin Kwon, Korea Advanced Institute of Science and Technology (KAIST); Jinkyu Jeong, Yonsei University

OSDI '24 Paper
09/25/2024 Kevin Xue

High-throughput and Flexible Host Networking for Accelerated Computing (OSDI'24)

Athinagoras Skiadopoulos, Zhiqiang Xie, and Mark Zhao, Stanford University; Qizhe Cai and Saksham Agarwal, Cornell University; Jacob Adelmann, David Ahern, Carlo Contavalli, Michael Goldflam, Vitaly Mayatskikh, Raghu Raja, and Daniel Walton, Enfabrica; Rachit Agarwal, Cornell University; Shrijeet Mukherjee, Enfabrica; Christos Kozyrakis, Stanford University

OSDI '24 Paper
10/02/2024 Yuzhuo Jing

VeriSMo: A Verified Security Module for Confidential VMs (OSDI'24)

Ziqiao Zhou, Microsoft Research; Anjali, University of Wisconsin-Madison; Weiteng Chen, Microsoft Research; Sishuai Gong, Purdue University; Chris Hawblitzel and Weidong Cui, Microsoft Research

OSDI '24 Paper
10/09/2024 Wanning He

Flock: A Framework for Deploying On-Demand Distributed Trust (OSDI'24)

Darya Kaviani and Sijun Tan, UC Berkeley; Pravein Govindan Kannan, IBM Research; Raluca Ada Popa, UC Berkeley

OSDI '24 Paper
10/18/2024 Tony Pan

Efficient Reproduction of Fault-Induced Failures in Distributed Systems with Feedback-Driven Fault Injection (SOSP'24)

Jia Pan*, Haoze Wu*, Tanakorn Leesatapornwongsa, Suman Nath, Peng Huang

SOSP '24 Paper
10/25/2024 Tony Pan

Efficient Reproduction of Fault-Induced Failures in Distributed Systems with Feedback-Driven Fault Injection (SOSP'24)

Jia Pan*, Haoze Wu*, Tanakorn Leesatapornwongsa, Suman Nath, Peng Huang

SOSP '24 Paper
11/01/2024 Yi Chen

Yi's Project Presentation

11/08/2024 Kevin Xue

Kevin's Project Presentation