publications

publications by categories in reversed chronological order.

2026

  1. MLSys
    FaaScale: Unlocking Fast LLM Scaling for Serverless Inference
    Minchen Yu, Rui Yang, Chaobo Jia, Zhaoyuan Su, Sheng Yao, Tingfeng Lan, Yuchen Yang, Yue Cheng, Wei Wang, Ao Wang, and Ruichuan Chen
    in the Proceedings of the 9th Annual Conference on Machine Learning and Systems, May 2026

2025

  1. SoCC
    ZipBatch: Multi-Tenant GPU Batching with Dual-Resource Regulation
    Haoxuan Yu, Sheng Yao, and Wei Wang
    in the Proceedings of the 2025 ACM Symposium on Cloud Computing, Nov 2025

2024

  1. SIGMOD
    WeBridge: Synthesizing Stored Procedures for Large-Scale Real-World Web Applications
    Gansen Hu, Zhaoguo Wang, Chuzhe Tang, Jiahuan Shen, Zhiyuan Dong, Sheng Yao, and Haibo Chen
    in the Proceedings of the 2024 ACM SIGMOD International Conference on Management of Data, Jun 2024