publications
publications by categories in reversed chronological order. generated by jekyll-scholar.
2024
- Loftq: Lora-fine-tuning-aware quantization for large language modelsThe Twelfth International Conference on Learning Representations, 2024
2023
- Losparse: Structured compression of large language models based on low-rank and sparse approximationIn International Conference on Machine Learning, 2023