1 7 8

Alexander Rubinstein

arubique

AI & ML interests

None yet

Recent Activity

updated a model 6 days ago

arubique/DISCO-MMLU

upvoted a paper 7 days ago

SWE-rebench V2: Language-Agnostic SWE Task Collection at Scale

upvoted a paper 8 days ago

Compositional Generalization Requires Linear, Orthogonal Representations in Vision Embedding Models

View all activity

Organizations

None yet

updated a model 6 days ago

arubique/DISCO-MMLU

Updated 6 days ago • 38

upvoted a paper 7 days ago

SWE-rebench V2: Language-Agnostic SWE Task Collection at Scale

Paper • 2602.23866 • Published 11 days ago • 80

upvoted a paper 8 days ago

Compositional Generalization Requires Linear, Orthogonal Representations in Vision Embedding Models

Paper • 2602.24264 • Published 11 days ago • 14

liked 2 Spaces 30 days ago

Open LLM Leaderboard

🏆

13.9k

Track, rank and evaluate open LLMs and chatbots

MMLU-Pro Leaderboard

🥇

244

More advanced and challenging multi-task evaluation

updated a dataset about 1 month ago

arubique/flattened-MMLU

Viewer • Updated about 1 month ago • 14k • 52

published a dataset about 1 month ago

arubique/flattened-MMLU

Viewer • Updated about 1 month ago • 14k • 52

published a model about 1 month ago

arubique/DISCO-MMLU

Updated 6 days ago • 38

liked a model about 2 months ago

openai/gpt-oss-120b

Text Generation • 120B • Updated Aug 26, 2025 • 4.44M • • 4.56k

upvoted 2 papers 5 months ago

Adaptive Attacks on Trusted Monitors Subvert AI Control Protocols

Paper • 2510.09462 • Published Oct 10, 2025 • 6

DISCO: Diversifying Sample Condensation for Efficient Model Evaluation

Paper • 2510.07959 • Published Oct 9, 2025 • 15

commented a paper 5 months ago

DISCO: Diversifying Sample Condensation for Efficient Model Evaluation

Paper • 2510.07959 • Published Oct 9, 2025 • 15 •

upvoted a paper 8 months ago

On the rankability of visual embeddings

Paper • 2507.03683 • Published Jul 4, 2025 • 16

liked a dataset 9 months ago

xlangai/BRIGHT

Viewer • Updated Mar 1, 2025 • 1.35M • 26k • 63

liked a model 9 months ago

meta-llama/Llama-3.1-8B-Instruct

Text Generation • Updated Sep 25, 2024 • 7.35M • • 5.55k

upvoted a paper 10 months ago

Diffusion Classifiers Understand Compositionality, but Conditions Apply

Paper • 2505.17955 • Published May 23, 2025 • 22

liked a dataset 10 months ago

Rowan/hellaswag

Viewer • Updated Jul 10, 2025 • 60k • 237k • 163

upvoted a paper 11 months ago

Are We Done with Object-Centric Learning?

Paper • 2504.07092 • Published Apr 9, 2025 • 6

authored a paper 11 months ago

Are We Done with Object-Centric Learning?

Paper • 2504.07092 • Published Apr 9, 2025 • 6

liked a model almost 3 years ago

CompVis/stable-diffusion-v1-4

Text-to-Image • Updated Aug 23, 2023 • 536k • 6.98k

Alexander Rubinstein

AI & ML interests

Recent Activity

Organizations

arubique's activity

Open LLM Leaderboard

MMLU-Pro Leaderboard