Spaces:
Running
Running
Commit History
Merge pull request #6 from filbench/convert-to-uv
94a0ade
unverified
Lj Miranda
commited on
Update documentation
2b8a748
Fix dependencies and python versions
2b1eb04
Convert to uv
dfd4983
Add plots in the leaderboard (#5)
e2c374a
unverified
Lj Miranda
commited on
Fix misc citation
380beab
Move number of models
06de1cc
Update metadata (#4)
6829d60
unverified
Lj Miranda
commited on
Add new models
2a64d15
Always use zero-shot results
d00fdf6
Update models
40882d2
Update schema
523da05
Use rougeL scores instead
52583fc
Fix Llama 3.1 parameter size
902de67
Better computation of average
919a2b5
Add new models
9ac44b1
Always strip results after passing through cleaning func
9d4ad0a
Add more models in schema
90c7701
Pin pydantic version
eed7557
Attempt pinning gradio version
7e892aa
Add option to download file
15fe18d
Add indicators for each evals
7ffe204
Improve setup
d7a42ca
Update README.md
331f902
unverified
Lj Miranda
commited on
Add model details (#3)
04447ea
unverified
Lj Miranda
commited on
Add new benchmarks
b29ae1f
Add README metadata
0189dbf
Add initial leaderboard implementation (#2)
12f59da
unverified
Lj Miranda
commited on
Proper parsing of tasks
22ca5af
Move GH action location
d56ae29
Merge pull request #1 from filbench/add/parser
2a65969
unverified
Lj Miranda
commited on
Add GitHub workflow
f70bf3c
Add results parser
a38a5b4
Update README with instructions on how to run locally
6a4841e
Apply pre-commit and all fixes
b5b19aa
Apply formatter
814a536
Update README
e0bc74c
Update README.md
2b848e2
verified
gradio_leaderboard==0.0.13 (#15)
d81956b
Apply CSS to the model name column (#14)
fc32710
os.isfile --> os.path.isfile (#12)
abf5a88
Minimal edit for directory structure assumption. (#11)
1e9865f
Fix typo (#7)
35b7b80
Update src/display/utils.py
f867bb5
Update src/envs.py
30da67d
added leaderboard component to simplify main script
4103566
Clémentine
commited on