MastermindEval: A Simple But Scalable Reasoning Benchmark Paper • 2503.05891 • Published Mar 7, 2025 • 1
Empirical Evaluation of Knowledge Distillation from Transformers to Subquadratic Language Models Paper • 2504.14366 • Published Apr 19, 2025 • 1
Sample-Efficient Language Modeling with Linear Attention and Lightweight Enhancements Paper • 2511.05560 • Published Nov 4, 2025 • 1
Familiarity: Better Evaluation of Zero-Shot Named Entity Recognition by Quantifying Label Shifts in Synthetic Training Data Paper • 2412.10121 • Published Dec 13, 2024 • 2
BabyHGRN: Exploring RNNs for Sample-Efficient Training of Language Models Paper • 2412.15978 • Published Dec 20, 2024 • 1
LM-PUB-QUIZ: A Comprehensive Framework for Zero-Shot Evaluation of Relational Knowledge in Language Models Paper • 2408.15729 • Published Aug 28, 2024 • 2
BEAR: A Unified Framework for Evaluating Relational Knowledge in Causal and Masked Language Models Paper • 2404.04113 • Published Apr 5, 2024 • 4