SeasonalFall84 commited on
Commit
8ac5fb2
·
verified ·
1 Parent(s): 51c1f0b

Add Artificial Analysis evaluations for olmo-3-7b-instruct

Browse files

This commit adds structured evaluation results to the model card. The results are formatted using the model-index specification and will be displayed in the model card's evaluation widget.

Files changed (1) hide show
  1. README.md +51 -0
README.md CHANGED
@@ -6,6 +6,57 @@ language:
6
  library_name: transformers
7
  datasets:
8
  - allenai/Dolci-Instruct-RL-7B
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
9
  ---
10
 
11
  ## Model Details
 
6
  library_name: transformers
7
  datasets:
8
  - allenai/Dolci-Instruct-RL-7B
9
+ model-index:
10
+ - name: Olmo-3-7B-Instruct
11
+ results:
12
+ - task:
13
+ type: evaluation
14
+ dataset:
15
+ name: Artificial Analysis Benchmarks
16
+ type: artificial_analysis
17
+ metrics:
18
+ - name: Artificial Analysis Intelligence Index
19
+ type: artificial_analysis_intelligence_index
20
+ value: 22.2
21
+ - name: Artificial Analysis Coding Index
22
+ type: artificial_analysis_coding_index
23
+ value: 12.3
24
+ - name: Artificial Analysis Math Index
25
+ type: artificial_analysis_math_index
26
+ value: 41.3
27
+ - name: Mmlu Pro
28
+ type: mmlu_pro
29
+ value: 0.522
30
+ - name: Gpqa
31
+ type: gpqa
32
+ value: 0.4
33
+ - name: Hle
34
+ type: hle
35
+ value: 0.058
36
+ - name: Livecodebench
37
+ type: livecodebench
38
+ value: 0.266
39
+ - name: Scicode
40
+ type: scicode
41
+ value: 0.103
42
+ - name: Aime 25
43
+ type: aime_25
44
+ value: 0.413
45
+ - name: Ifbench
46
+ type: ifbench
47
+ value: 0.328
48
+ - name: Lcr
49
+ type: lcr
50
+ value: 0
51
+ - name: Terminalbench Hard
52
+ type: terminalbench_hard
53
+ value: 0
54
+ - name: Tau2
55
+ type: tau2
56
+ value: 0.126
57
+ source:
58
+ name: Artificial Analysis API
59
+ url: https://artificialanalysis.ai
60
  ---
61
 
62
  ## Model Details