I’m Chunyu Xue, a fourth-year Direct PhD Candidate in Emerging Parallel Computing Center (EPCC) from Shanghai Jiao Tong University (SJTU), advised by Prof. Quan Chen. My research interests lie in building efficient and scalable systems for LLM and MultiModal training, fine-tuning, and cluster-level scheduling. Currently, I’m working as a research intern in ByteDance Seed (Training Infrastructure). I worked as an engineering intern in Microsoft Cloud+AI. I received my B.S. in Computer Science and Technology from SJTU. Feel free to reach out if you are interested in potential collaboration!

Education

  • 2022.09 - Now, Shanghai Jiao Tong University Ph.D. in Computer Science
  • 2018.09 - 2022.06, Shanghai Jiao Tong University B.S. in Computer Science (Zhiyuan Honors Program)

Experiences

  • 2025.03 - Now, ByteDance Seed (Training Infrastructure) Research Intern
  • 2021.06 - 2021.09, Microsoft (Cloud+AI) Software Engineer Intern

Publications

Published

  • EuroSys 2026 #Chunyu Xue, Weihao Cui, Quan Chen, Chen Chen, Han Zhao, Shulai Zhang, Linmei Wang, Yan Li, Limin Xiao, Weifeng Zhang, Jing Yang, Bingsheng He, Minyi Guo. “Arena: Efficiently Training Large Models via Dynamic Scheduling and Adaptive Parallelism Co-Design”. 21st ACM European Conference on Computer Systems, Eurosys’26 (CCF-A). [Paper] [Code]

  • EuroSys 2026 #Chunyu Xue, Yangrui Chen, Jianyu Jiang, Ningxin Zheng, Junda Feng, Jingji Chen, Shixiong Zhao, Shen Yan, Yi Lin, Lei Shi, Zanbo Wang, Lishu Luo, Faming Wu, Haibin Lin, Yanghua Peng, Xin Liu, Quan Chen. “MegaScale-Omni: A Hyper-Scale, Workload-Resilient System for MultiModal LLM Training in Production”. 21st ACM European Conference on Computer Systems, Eurosys’26 (CCF-A). [Paper] [Code]

  • NSDI 2026 #Chunyu Xue, Yi Pan, Weihao Cui, Quan Chen, Shulai Zhang, Bingsheng He, Minyi Guo. “MuxTune: Efficient Multi-Task LLM Fine-Tuning in Multi-Tenant Datacenters via Spatial-Temporal Backbone Multiplexing”. 23rd USENIX Symposium on Networked Systems Design and Implementation, NSDI’26 (CCF-A). [Paper] [Arxiv] [Code]

  • EuroSys 2026 Yuxuan Wang, Yanbo Wang, Chen Chen, #Chunyu Xue, Qizhen Weng, Yin Chen, Zeren Li, Xuqi Zhu, Yongqiang Yang, Quan Chen, Minyi Guo. “Suika: Efficient and High-quality Re-scheduling of 3D-parallelized LLM Training Jobs in Shared Clusters”. 21st ACM European Conference on Computer Systems, Eurosys’26 (CCF-A). [Paper]

  • EuroSys 2025 Shulai Zhang, Quan Chen, Weihao Cui, Han Zhao, #Chunyu Xue, Zhen Zheng, Wei Lin, Minyi Guo. “Improving GPU Sharing Performance through Adaptive Bubbleless Spatial-Temporal Sharing”. 20th ACM European Conference on Computer Systems, Eurosys’25 (CCF-A). [Paper]

  • TACO 2025 Pengyu Yang, Weihao Cui, #Chunyu Xue, Han Zhao, Chen Chen, Quan Chen, Jing Yang, Minyi Guo. “Taming Flexible Job Packing in Deep Learning Training Clusters”. ACM Transactions on Architecture and Code Optimization, TACO’25 (CCF-A). [Paper]

Preprint

Projects

  • 2024.12 - Now, Efficient and Elastic Training System for Large Models, Lenovo.