Publications

* denotes equal contribution.

2026

  1. EuroSys
    Learn-to-Probe: Achieving Signal Distinguishability in Learning-based Congestion Control
    Han Tian , Wenbo Li, Junxue ZhangXudong Liao, Decang Sun , Donghui Chen , Bin Huang, Wenxue Li , Yong Wang, and Kai Chen
    In Proceedings of the 21th ACM European Conference on Computer Systems (EuroSys 2026) , 2026
  2. EuroSys
    MFS: An Efficient Model Family Serving System for LLMs
    Yunxuan Zhang, Hao WangHan Tian, Liu Yang, Xudong LiaoWenxue Li, Ping Yin, Bowen Liu, and Kai Chen
    In Proceedings of the 21th ACM European Conference on Computer Systems (EuroSys 2026) , 2026

2025

  1. SIGCOMM
    MixNet: A Runtime Reconfigurable Optical-Electrical Fabric for Distributed Mixture-of-Experts Training
    Xudong Liao, Yijun Sun, Han TianXinchen WanYilun JinZilong WangZhenghang RenXinyang HuangWenxue Li, Kin Fai Tse, Zhizhen Zhong, Guyue Liu , Ying Zhang, Xiaofeng Ye , Yiming Zhang, and Kai Chen
    In Proceedings of the 2025 ACM SIGCOMM Conference (SIGCOMM 2025) , 2025
  2. ATC
    Towards Optimal Rack-scale μs-level CPU Scheduling through In-Network Workload Shaping
    Xudong LiaoHan TianXinchen WanChaoliang ZengHao WangJunxue Zhang, Mengyu Ma, Guyue Liu, and Kai Chen
    In 2025 USENIX Annual Technical Conference (ATC 2025) , 2025
  3. OSDI
    Enabling Efficient GPU Communication over Multiple NICs with FuseLink
    Zhenghang Ren , Yuxuan Li , Zilong WangXinyang HuangWenxue Li, Kaiqiang Xu, Xudong Liao, Yijun Sun, Bowen Liu, Han TianJunxue Zhang , Mingfei Wang, Zhizhen Zhong, Guyue Liu , Ying Zhang, and Kai Chen
    In Proceedings of the 19th USENIX Symposium on Operating Systems Design and Implementation (OSDI 2025) , 2025
  4. EuroSys
    Achieving Fairness Generalizability for Learning-based Congestion Control with Jury
    Han TianXudong Liao, Decang Sun, Chaoliang ZengYilun JinJunxue ZhangXinchen WanZilong Wang , Yong Wang, and Kai Chen
    In Proceedings of the 20th ACM European Conference on Computer Systems (EuroSys 2025) , 2025
  5. INFOCOM
    A Generic and Efficient Communication Framework for Message-level In-Network Computing
    Xinchen Wan , Luyang Li, Han TianXudong LiaoXinyang HuangChaoliang ZengZilong Wang, Xinyu Yang, Ke Cheng, Qingsong Ning, Guyue Liu, Layong Luo, and Kai Chen
    In Proceedings of the IEEE International Conference on Computer Communications (INFOCOM 2025) , 2025
  6. ASPLOS
    Design and Operation of Shared Machine Learning Clusters on Campus
    Kaiqiang Xu, Decang Sun, Hao WangZhenghang RenXinchen WanXudong LiaoZilong WangJunxue Zhang, and Kai Chen
    In Proceedings of the 30th ACM International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS 2025) , 2025

2024

  1. EuroSys
    Astraea: Towards Fair and Efficient Learning-based Congestion Control
    Xudong Liao*Han Tian*Chaoliang ZengXinchen Wan, and Kai Chen
    In Proceedings of the 19th ACM European Conference on Computer Systems (EuroSys 2024) , 2024
  2. NSDI
    Accelerating Neural Recommendation Training with Embedding Scheduling
    Chaoliang Zeng*Xudong Liao*, Xiaodian Cheng, Han TianXinchen WanHao Wang, and Kai Chen
    In Proceedings of the 21st USENIX Symposium on Networked Systems Design and Implementation (NSDI 2024) , 2024

2023

  1. SIGMOD
    Scalable and Efficient Full-Graph GNN Training for Large Graphs
    Xinchen Wan, Kaiqiang Xu, Xudong LiaoYilun JinKai Chen , and Xin Jin
    In Proceedings of the ACM on Management of Data (SIGMOD 2023) , 2023
  2. TON
    Efficient DRL-Based Congestion Control With Ultra-Low Overhead
    Han Tian*Xudong Liao*Chaoliang Zeng, Decang Sun, Junxue Zhang, and Kai Chen
    IEEE/ACM Transactions on Networking, 2023

2022

  1. CoNEXT
    Spine: An Efficient DRL-Based Congestion Control with Ultra-Low Overhead
    Han Tian*Xudong Liao*Chaoliang ZengJunxue Zhang, and Kai Chen
    In Proceedings of the 18th International Conference on Emerging Networking EXperiments and Technologies (CoNEXT 2022) , 2022
  2. EuroSys
    Multi-Objective Congestion Control
    Yiqing Ma, Han TianXudong LiaoJunxue Zhang , Weiyan Wang, Kai Chen , and Xin Jin
    In Proceedings of the 17th European Conference on Computer Systems (EuroSys 2022) , 2022

2021

  1. ArXiv
    Tacc: A full-stack cloud computing infrastructure for machine learning tasks
    Kaiqiang Xu, Xinchen WanHao WangZhenghang RenXudong Liao, Decang Sun, Chaoliang Zeng, and Kai Chen
    arXiv preprint arXiv:2110.01556, 2021
  2. Book
    Datacenter Traffic Optimization with Deep Reinforcement Learning
    Li Chen, Justinas Lingys, Kai Chen, and Xudong Liao
    2021