I’m Chunyu Xue (薛春宇), a fourth-year PhD student in Emerging Parallel Computing Center (EPCC) from Shanghai Jiao Tong University (SJTU), advised by Prof. Quan Chen. My research interests lie in building efficient and scalable systems for LLM and MultiModal pretraining, supervised fine-tuning, post-training, and cluster-level scheduling. Currently, I’m working as a research intern in Moonshot AI (Kimi) (RL Infrastructure). I had a wonderful experience when working as a research intern in ByteDance Seed (Training Infrastructure). I worked as an engineering intern in Microsoft Cloud+AI. I received my B.S. in Computer Science and Technology from SJTU. Feel free to reach out if you are interested in potential collaboration!

🔥 News

  • 2026.05, Glad to join Moonshot (Kimi) RL Infra team as a research intern 🥳, solving ambitious moonshot problems together that will lead humanity to AGI! Had a wonderful experience in Seed team~
  • 2026.05, Give a talk at NSDI’26, Renton, WA USA.
  • 2026.04, Give two talks at EuroSys’26, Edinburgh, Scoland UK.
  • 2026.01, Three papers (Arena, MegaScale-Omni, and Suika) are accepted by EuroSys’26!
  • 2025.12, MuxTune is accepted by NSDI’26!
  • 2025.10, Serve as a reviewer for Concurrency and Computation: Practice and Experience.
  • 2025.03, Glad to join ByteDance Seed Training Infra team as a research intern 🥳, pushing the boundaries of AI together!

🎓 Education

  • 2022.09 - Now, Shanghai Jiao Tong University Ph.D. in Computer Science
  • 2018.09 - 2022.06, Shanghai Jiao Tong University B.S. in Computer Science (Zhiyuan Honors Program)

💼 Experiences

  • 2026.05 - Now, Moonshot AI (Kimi) (RL Infrastructure) Research Intern
  • 2025.03 - 2026.05, ByteDance Seed (Training Infrastructure) Research Intern
  • 2021.06 - 2021.09, Microsoft (Cloud+AI) Software Engineer Intern

📖 Publications

Published

  • EuroSys 2026 #Chunyu Xue, Weihao Cui, Quan Chen, Chen Chen, Han Zhao, Shulai Zhang, Linmei Wang, Yan Li, Limin Xiao, Weifeng Zhang, Jing Yang, Bingsheng He, Minyi Guo. “Arena: Efficiently Training Large Models via Dynamic Scheduling and Adaptive Parallelism Co-Design”. 21st ACM European Conference on Computer Systems, Eurosys’26 (CCF-A). [Paper] [Arxiv] [Code]

  • EuroSys 2026 #Chunyu Xue, Yangrui Chen, Jianyu Jiang, Ningxin Zheng, Junda Feng, Jingji Chen, Shixiong Zhao, Shen Yan, Yi Lin, Lei Shi, Zanbo Wang, Lishu Luo, Faming Wu, Haibin Lin, Yanghua Peng, Xin Liu, Quan Chen. “MegaScale-Omni: A Hyper-Scale, Workload-Resilient System for MultiModal LLM Training in Production”. 21st ACM European Conference on Computer Systems, Eurosys’26 (CCF-A). [Paper] [Code]

  • NSDI 2026 #Chunyu Xue, Yi Pan, Weihao Cui, Quan Chen, Shulai Zhang, Bingsheng He, Minyi Guo. “MuxTune: Efficient Multi-Task LLM Fine-Tuning in Multi-Tenant Datacenters via Spatial-Temporal Backbone Multiplexing”. 23rd USENIX Symposium on Networked Systems Design and Implementation, NSDI’26 (CCF-A). [Paper] [Arxiv] [Code]

  • EuroSys 2026 Yuxuan Wang, Yanbo Wang, Chen Chen, #Chunyu Xue, Qizhen Weng, Yin Chen, Zeren Li, Xuqi Zhu, Yongqiang Yang, Quan Chen, Minyi Guo. “Suika: Efficient and High-quality Re-scheduling of 3D-parallelized LLM Training Jobs in Shared Clusters”. 21st ACM European Conference on Computer Systems, Eurosys’26 (CCF-A). [Paper]

  • EuroSys 2025 Shulai Zhang, Quan Chen, Weihao Cui, Han Zhao, #Chunyu Xue, Zhen Zheng, Wei Lin, Minyi Guo. “Improving GPU Sharing Performance through Adaptive Bubbleless Spatial-Temporal Sharing”. 20th ACM European Conference on Computer Systems, Eurosys’25 (CCF-A). [Paper]

  • TACO 2025 Pengyu Yang, Weihao Cui, #Chunyu Xue, Han Zhao, Chen Chen, Quan Chen, Jing Yang, Minyi Guo. “Taming Flexible Job Packing in Deep Learning Training Clusters”. ACM Transactions on Architecture and Code Optimization, TACO’25 (CCF-A). [Paper]

Preprint

🔨 Projects

  • 2026.03 - Now, ViT-USM Encoder Unified Load Balancing and Dynamic Sequence Parallelism in Multimodal Pretraining. ByteDance Seed.
  • 2025.12 - Now, Efficient Checkpoint-Restore Techniques for Large Model Elastic Training. Lenovo.
  • 2024.12 - 2025.12, Efficient and Elastic Training System for Large Models. Lenovo.

🎉 Awards

  • Outstanding Graduate of Shanghai Jiao Tong University, 2022.06
  • Zhiyuan Honors Degree, Shanghai Jiao Tong University, 2022.06
  • SJTU Undergraduate Excellence Scholarship, 2019-2022
  • SJTU Zhiyuan Honors Scholarship, 2018-2022
  • SJTU Xiaomi Scholarship, 2021.12