SliM-LLM: Salience-Driven Mixed-Precision Quantization for Large Language Models Paper • 2405.14917 • Published May 23, 2024 • 1