SPADE: Synthesizing Assertions for Large Language Model Pipelines Paper • 2401.03038 • Published Jan 5, 2024 • 2
Accelerating Retrieval-Augmented Language Model Serving with Speculation Paper • 2401.14021 • Published Jan 25, 2024 • 2