publications

publications by categories in reversed chronological order.

2025

  1. SoCC
    ZipBatch: Multi-Tenant GPU Batching with Dual-Resource Regulation
    Haoxuan Yu, Sheng Yao, and Wei Wang
    in the Proceedings of ACM Symposium on Cloud Computing, Nov 2025
  2. preprint
    λScale: Enabling Fast Scaling for Serverless Large Language Model Inference
    Minchen Yu, Rui Yang, Chaobo Jia, Zhaoyuan Su, Sheng Yao, Tingfeng Lan, Yuchen Yang, Yue Cheng, Wei Wang, Ao Wang, and Ruichuan Chen
    Feb 2025

2024

  1. SIGMOD
    WeBridge: Synthesizing Stored Procedures for Large-Scale Real-World Web Applications
    Gansen Hu, Zhaoguo Wang, Chuzhe Tang, Jiahuan Shen, Zhiyuan Dong, Sheng Yao, and Haibo Chen
    in the Proceedings of the 2024 ACM SIGMOD International Conference on Management of Data, Jun 2024