-
The Stochastic Parrot on LLM's Shoulder: A Summative Assessment of Physical Concept Understanding
Paper • 2502.08946 • Published • 191 -
PRELUDE: A Benchmark Designed to Require Global Comprehension and Reasoning over Long Contexts
Paper • 2508.09848 • Published • 71 -
ttchungc/PRELUDE
Viewer • Updated • 1.16k • 67 • 17 -
ShunchiZhang/PhysiCo
Viewer • Updated • 600 • 64 • 6
Mo
BishopGorov
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 6 hours ago
Scaling Open-Ended Reasoning to Predict the Future
upvoted
a
paper
about 6 hours ago
AI Meets Brain: Memory Systems from Cognitive Neuroscience to Autonomous Agents
authored
a paper
1 day ago
Mindscape-Aware Retrieval Augmented Generation for Improved Long Context Understanding
Organizations
None yet