K2V2: Optimizing KV Cache Memory Management via Channel-Specific Mixed-Precision Quantization
Li, J., et al. (2025). "K2V2: Optimizing KV Cache Memory Management via Channel-Specific Mixed-Precision Quantization." MLSys 2026.
Li, J., et al. (2025). "K2V2: Optimizing KV Cache Memory Management via Channel-Specific Mixed-Precision Quantization." MLSys 2026.
Li, J., et al. (2025). "AI Agent Protocol Benchmark: A Unified Framework for Evaluating Multi-Agent Communication." arXiv preprint arXiv:2510.17149.