Guille Pérez-Torró

guishe

AI & ML interests

Information Retrieval, Few-Shot Learning, Named Entity Recognition, Named Entity Disambiguation, Semantic Search, Aspect-based Sentiment Analysis

Recent Activity

upvoted an article 11 days ago

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

upvoted an article 13 days ago

Merge Large Language Models with mergekit

liked a Space 13 days ago

HuggingFaceTB/smol-training-playbook

View all activity

Organizations

None yet

upvoted an article 11 days ago

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

Feb 7

•

249

upvoted an article 13 days ago

Article

Merge Large Language Models with mergekit

Jan 9, 2024

•

145

upvoted an article 21 days ago

Article

Introducing MTEB v2: Evaluation of embedding and retrieval systems for more than just text

28 days ago

•

upvoted an article 2 months ago

Article

Welcome EmbeddingGemma, Google's new efficient embedding model

Sep 4

•

256

upvoted a collection 3 months ago

gpt-oss

Collection

Open-weight models designed for powerful reasoning, agentic tasks, and versatile developer use cases. • 2 items • Updated Aug 7 • 382

upvoted an article 5 months ago

Article

Model2Vec: Distill a Small Fast Model from any Sentence Transformer

Oct 14, 2024

•

upvoted a collection 7 months ago

Qwen3

Collection

Qwen's new Qwen3 models. In Unsloth Dynamic 2.0, GGUF, 4-bit and 16-bit Safetensor formats. Includes 128K Context Length variants. • 79 items • Updated 17 days ago • 233

upvoted an article 7 months ago

Article

MIEB: The Benchmark That Stress-Tests Image-Text Embeddings Like Never Before

Apr 24

•

upvoted a collection 7 months ago

Unsloth Dynamic 2.0 Quants

Collection

New 2.0 version of our Dynamic GGUF + Quants. Dynamic 2.0 achieves superior accuracy & SOTA quantization performance. • 54 items • Updated 10 days ago • 247

upvoted a paper 8 months ago

Atla Selene Mini: A General Purpose Evaluation Model

Paper • 2501.17195 • Published Jan 27 • 35

upvoted an article 8 months ago

Article

Judge Arena: Benchmarking LLMs as Evaluators

Nov 19, 2024

•

upvoted 3 papers 8 months ago

SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion

Paper • 2503.11576 • Published Mar 14 • 117

DAPO: An Open-Source LLM Reinforcement Learning System at Scale

Paper • 2503.14476 • Published Mar 18 • 141

START: Self-taught Reasoner with Tools

Paper • 2503.04625 • Published Mar 6 • 113

upvoted an article 8 months ago

Article

DABStep: Data Agent Benchmark for Multi-step Reasoning

Feb 4

•

121

upvoted 2 collections 8 months ago

reranking series v2

Collection

V2 crispy rerank series • 3 items • Updated Jun 25 • 24

Unsloth 4-bit Dynamic Quants

Collection

Unsloths Dynamic 4bit Quants selectively skips quantizing certain parameters; greatly improving accuracy while only using <10% more VRAM than BnB 4bit • 28 items • Updated 17 days ago • 89

upvoted a paper 9 months ago

Instruction Pre-Training: Language Models are Supervised Multitask Learners

Paper • 2406.14491 • Published Jun 20, 2024 • 95

upvoted a collection 9 months ago

4bit Instruct Models

Collection

18 items • Updated 17 days ago • 32

upvoted an article 9 months ago

Article

Tutorial: Quantizing Llama 3+ Models for Efficient Deployment

Dec 15, 2024

•

Guille Pérez-Torró

AI & ML interests

Recent Activity

Organizations

guishe's activity

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

Merge Large Language Models with mergekit

Introducing MTEB v2: Evaluation of embedding and retrieval systems for more than just text

Welcome EmbeddingGemma, Google's new efficient embedding model

Model2Vec: Distill a Small Fast Model from any Sentence Transformer

MIEB: The Benchmark That Stress-Tests Image-Text Embeddings Like Never Before

Judge Arena: Benchmarking LLMs as Evaluators

DABStep: Data Agent Benchmark for Multi-step Reasoning

Tutorial: Quantizing Llama 3+ Models for Efficient Deployment