view article Article Welcome EmbeddingGemma, Google's new efficient embedding model +4 Sep 4, 2025 • 272
MobileLLM Collection Optimizing Sub-billion Parameter Language Models for On-Device Use Cases (ICML 2024) https://arxiv.org/abs/2402.14905 • 50 items • Updated Dec 11, 2025 • 137
Searching for Best Practices in Retrieval-Augmented Generation Paper • 2407.01219 • Published Jul 1, 2024 • 11
Extending Context Window of Large Language Models via Positional Interpolation Paper • 2306.15595 • Published Jun 27, 2023 • 53