publications
publications by categories in reversed chronological order. generated by jekyll-scholar.
2025
- ArXivSEAL: Scaling to Emphasize Attention for Long-Context Retrieval2025
- CALCost-effective Extension of DRAM-PIM for Group-wise LLM QuantizationIEEE Computer Architecture Letters, 2025
- WACV OralPTQ4VM: Post-Training Quantization for Visual MambaIn Proceedings of the Winter Conference on Applications of Computer Vision (WACV), Feb 2025
2024
- EMNLP FindingsQEFT: Quantization for Efficient Fine-Tuning of LLMsIn Findings of the Association for Computational Linguistics: EMNLP 2024, Feb 2024
- AAAI OralOwq: Outlier-aware weight quantization for efficient fine-tuning and inference of large language modelsIn Proceedings of the AAAI Conference on Artificial Intelligence, Feb 2024
2023
- ICCVINSTA-BNN: Binary neural network with instance-aware thresholdIn Proceedings of the IEEE/CVF International Conference on Computer Vision, Feb 2023
2021
- CVPRImproving accuracy of binary neural networks using unbalanced activation distributionIn Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, Feb 2021