ModernVBERT: Towards Smaller Visual Document Retrievers Paper • 2510.01149 • Published Oct 1, 2025 • 30
Should We Still Pretrain Encoders with Masked Language Modeling? Paper • 2507.00994 • Published Jul 1, 2025 • 80
ViDoRe Benchmark V2: Raising the Bar for Visual Retrieval Paper • 2505.17166 • Published May 22, 2025
EuroBERT: Scaling Multilingual Encoders for European Languages Paper • 2503.05500 • Published Mar 7, 2025 • 80
MMTEB: Massive Multilingual Text Embedding Benchmark Paper • 2502.13595 • Published Feb 19, 2025 • 43
ColPali: Efficient Document Retrieval with Vision Language Models Paper • 2407.01449 • Published Jun 27, 2024 • 50
Towards Trustworthy Reranking: A Simple yet Effective Abstention Mechanism Paper • 2402.12997 • Published Feb 20, 2024 • 9
CroissantLLM: A Truly Bilingual French-English Language Model Paper • 2402.00786 • Published Feb 1, 2024 • 26
Revisiting Instruction Fine-tuned Model Evaluation to Guide Industrial Applications Paper • 2310.14103 • Published Oct 21, 2023 • 1