Enhance model retrieval logic in read_evals.py to support dict-based model_args and improve org_and_model extraction; update .gitignore to exclude eval-results files. 5afbc08 djstrong commited on 4 days ago
Change ColumnContent dataclass to be immutable by adding frozen=True decorator 0cd768c djstrong commited on Jan 29
Update requirements.txt to include setuptools and upgrade numpy to version 1.26.0; enhance check_validity.py to recognize 'Qwen3-' model names. 4ce3209 djstrong commited on Jan 29
Add calc_avg.py for average score calculation and refactor task retrieval in about.py b9262b0 djstrong commited on Mar 25, 2025