-
The Stochastic Parrot on LLM's Shoulder: A Summative Assessment of Physical Concept Understanding
Paper • 2502.08946 • Published • 193 -
PRELUDE: A Benchmark Designed to Require Global Comprehension and Reasoning over Long Contexts
Paper • 2508.09848 • Published • 67 -
ttchungc/PRELUDE
Viewer • Updated • 1.16k • 47 • 17 -
ShunchiZhang/PhysiCo
Viewer • Updated • 600 • 36 • 6
Mo
BishopGorov
AI & ML interests
None yet
Organizations
None yet