Oliver Pfaffel
OliP
AI & ML interests
None yet
Recent Activity
liked
a model
3 days ago
moonshotai/Kimi-K2-Thinking
liked
a model
7 days ago
Alibaba-NLP/Tongyi-DeepResearch-30B-A3B
liked
a model
21 days ago
PaddlePaddle/PaddleOCR-VL
Organizations
2024 Papers of the year
LLM Deployment
-
Running273273
Llm Pricing
📊Display a React app with TypeScript
-
Running1.02k1.02k
Can You Run It? LLM version
🚀Calculate GPU requirements for running LLMs
-
Towards Efficient Generative Large Language Model Serving: A Survey from Algorithms to Systems
Paper • 2312.15234 • Published • 3 -
EfficientQAT: Efficient Quantization-Aware Training for Large Language Models
Paper • 2407.11062 • Published • 10
Long-Context
-
LazyLLM: Dynamic Token Pruning for Efficient Long Context LLM Inference
Paper • 2407.14057 • Published • 46 -
ChatQA 2: Bridging the Gap to Proprietary LLMs in Long Context and RAG Capabilities
Paper • 2407.14482 • Published • 26 -
NeedleBench: Can LLMs Do Retrieval and Reasoning in 1 Million Context Window?
Paper • 2407.11963 • Published • 44
Special LMs <10B
Evaluation
-
Self-Taught Evaluators
Paper • 2408.02666 • Published • 30 -
Michelangelo: Long Context Evaluations Beyond Haystacks via Latent Structure Queries
Paper • 2409.12640 • Published • 2 -
openai/MMMLU
Viewer • Updated • 393k • 11k • 507 -
HelloBench: Evaluating Long Text Generation Capabilities of Large Language Models
Paper • 2409.16191 • Published • 42
Coding
-
SciCode: A Research Coding Benchmark Curated by Scientists
Paper • 2407.13168 • Published • 14 -
OpenDevin: An Open Platform for AI Software Developers as Generalist Agents
Paper • 2407.16741 • Published • 73 -
CodexGraph: Bridging Large Language Models and Code Repositories via Code Graph Databases
Paper • 2408.03910 • Published • 18 -
Diversity Empowers Intelligence: Integrating Expertise of Software Engineering Agents
Paper • 2408.07060 • Published • 42
Leading Leaderboards
-
Running on CPU Upgrade13.7k13.7k
Open LLM Leaderboard
🏆Track, rank and evaluate open LLMs and chatbots
-
Running on CPU Upgrade6.66k6.66k
MTEB Leaderboard
🥇Embedding Leaderboard
-
Running4.66k4.66k
LMArena Leaderboard
🏆Display LMArena Leaderboard
-
Running226226
BigCodeBench Leaderboard
🥇Explore and analyze code completion benchmarks
2023 (and before) Papers of the Year
-
Hiera: A Hierarchical Vision Transformer without the Bells-and-Whistles
Paper • 2306.00989 • Published • 1 -
Direct Preference Optimization: Your Language Model is Secretly a Reward Model
Paper • 2305.18290 • Published • 63 -
Scalable Diffusion Models with Transformers
Paper • 2212.09748 • Published • 18 -
Matryoshka Representation Learning
Paper • 2205.13147 • Published • 24
Vision-Language
-
EVLM: An Efficient Vision-Language Model for Visual Understanding
Paper • 2407.14177 • Published • 45 -
ChartGemma: Visual Instruction-tuning for Chart Reasoning in the Wild
Paper • 2407.04172 • Published • 26 -
facebook/chameleon-7b
Image-Text-to-Text • 7B • Updated • 61.7k • 193 -
vidore/colpali
Visual Document Retrieval • Updated • 3.83k • 463
Audio
🌶️ Spaces
Applications
-
Integrating Large Language Models into a Tri-Modal Architecture for Automated Depression Classification
Paper • 2407.19340 • Published • 58 -
MedTrinity-25M: A Large-scale Multimodal Dataset with Multigranular Annotations for Medicine
Paper • 2408.02900 • Published • 30 -
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery
Paper • 2408.06292 • Published • 126
NewGen small LMs
Leading Leaderboards
-
Running on CPU Upgrade13.7k13.7k
Open LLM Leaderboard
🏆Track, rank and evaluate open LLMs and chatbots
-
Running on CPU Upgrade6.66k6.66k
MTEB Leaderboard
🥇Embedding Leaderboard
-
Running4.66k4.66k
LMArena Leaderboard
🏆Display LMArena Leaderboard
-
Running226226
BigCodeBench Leaderboard
🥇Explore and analyze code completion benchmarks
2024 Papers of the year
2023 (and before) Papers of the Year
-
Hiera: A Hierarchical Vision Transformer without the Bells-and-Whistles
Paper • 2306.00989 • Published • 1 -
Direct Preference Optimization: Your Language Model is Secretly a Reward Model
Paper • 2305.18290 • Published • 63 -
Scalable Diffusion Models with Transformers
Paper • 2212.09748 • Published • 18 -
Matryoshka Representation Learning
Paper • 2205.13147 • Published • 24
LLM Deployment
-
Running273273
Llm Pricing
📊Display a React app with TypeScript
-
Running1.02k1.02k
Can You Run It? LLM version
🚀Calculate GPU requirements for running LLMs
-
Towards Efficient Generative Large Language Model Serving: A Survey from Algorithms to Systems
Paper • 2312.15234 • Published • 3 -
EfficientQAT: Efficient Quantization-Aware Training for Large Language Models
Paper • 2407.11062 • Published • 10
Vision-Language
-
EVLM: An Efficient Vision-Language Model for Visual Understanding
Paper • 2407.14177 • Published • 45 -
ChartGemma: Visual Instruction-tuning for Chart Reasoning in the Wild
Paper • 2407.04172 • Published • 26 -
facebook/chameleon-7b
Image-Text-to-Text • 7B • Updated • 61.7k • 193 -
vidore/colpali
Visual Document Retrieval • Updated • 3.83k • 463
Long-Context
-
LazyLLM: Dynamic Token Pruning for Efficient Long Context LLM Inference
Paper • 2407.14057 • Published • 46 -
ChatQA 2: Bridging the Gap to Proprietary LLMs in Long Context and RAG Capabilities
Paper • 2407.14482 • Published • 26 -
NeedleBench: Can LLMs Do Retrieval and Reasoning in 1 Million Context Window?
Paper • 2407.11963 • Published • 44
Audio
Special LMs <10B
🌶️ Spaces
Evaluation
-
Self-Taught Evaluators
Paper • 2408.02666 • Published • 30 -
Michelangelo: Long Context Evaluations Beyond Haystacks via Latent Structure Queries
Paper • 2409.12640 • Published • 2 -
openai/MMMLU
Viewer • Updated • 393k • 11k • 507 -
HelloBench: Evaluating Long Text Generation Capabilities of Large Language Models
Paper • 2409.16191 • Published • 42
Applications
-
Integrating Large Language Models into a Tri-Modal Architecture for Automated Depression Classification
Paper • 2407.19340 • Published • 58 -
MedTrinity-25M: A Large-scale Multimodal Dataset with Multigranular Annotations for Medicine
Paper • 2408.02900 • Published • 30 -
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery
Paper • 2408.06292 • Published • 126
Coding
-
SciCode: A Research Coding Benchmark Curated by Scientists
Paper • 2407.13168 • Published • 14 -
OpenDevin: An Open Platform for AI Software Developers as Generalist Agents
Paper • 2407.16741 • Published • 73 -
CodexGraph: Bridging Large Language Models and Code Repositories via Code Graph Databases
Paper • 2408.03910 • Published • 18 -
Diversity Empowers Intelligence: Integrating Expertise of Software Engineering Agents
Paper • 2408.07060 • Published • 42