Global PIQA: Evaluating Physical Commonsense Reasoning Across 100+ Languages and Cultures Paper • 2510.24081 • Published 13 days ago • 10
Fishing for Magikarp: Automatically Detecting Under-trained Tokens in Large Language Models Paper • 2405.05417 • Published May 8, 2024 • 1