publications

publications by categories in reversed chronological order. generated by jekyll-scholar.

2026

  1. correct.png
    CORRECT: COndensed eRror RECognition via knowledge Transfer in multi-agent systems
    Yifan Yu, Moyan Li, Shaoyuan Xu, and 4 more authors
    In International Conference on Machine Learning, 2026
  2. xrpo.png
    XRPO: Pushing the limits of GRPO with Targeted Exploration and Exploitation
    Udbhav* Bamba, Minghao* Fang, Yifan* Yu, and 2 more authors
    In International Conference on Machine Learning, 2026
  3. idlm.png
    Introspective Diffusion Language Models
    Yifan* Yu, Yuqing* Jian, Junxiong Wang, and 12 more authors
    arXiv preprint arXiv:2604.11035, 2026
  4. OPPO: Accelerating PPO-based RLHF via Pipeline Overlap
    Kaizhuo Yan, Yingjie Yu, Yifan Yu, and 2 more authors
    In International Conference on Learning Representations, 2026

2025

  1. echolm.png
    IC-Cache: Efficient Large Language Model Serving via In-context Caching
    Yifan* Yu, Yu* Gan, Nikhil Sarda, and 7 more authors
    In Proceedings of the ACM SIGOPS 31st Symposium on Operating Systems Principles, Lotte Hotel World, Seoul, Republic of Korea, 2025

2024

  1. loftq.png
    Loftq: Lora-fine-tuning-aware quantization for large language models
    Yixiao* Li, Yifan* Yu, Chen Liang, and 4 more authors
    The Twelfth International Conference on Learning Representations, 2024

2023

  1. losparse.png
    Losparse: Structured compression of large language models based on low-rank and sparse approximation
    Yixiao* Li, Yifan* Yu, Qingru Zhang, and 4 more authors
    In International Conference on Machine Learning, 2023