Abreu Magalhães's picture

151 114

Abreu Magalhães

Hildeberto

·

AI & ML interests

None yet

Recent Activity

liked a model 2 days ago

google/embeddinggemma-300m

upvoted a paper 28 days ago

QeRL: Beyond Efficiency -- Quantization-enhanced Reinforcement Learning for LLMs

upvoted a paper about 1 month ago

SINQ: Sinkhorn-Normalized Quantization for Calibration-Free Low-Precision LLM Weights

View all activity

Organizations

upvoted a paper 28 days ago

QeRL: Beyond Efficiency -- Quantization-enhanced Reinforcement Learning for LLMs

Paper • 2510.11696 • Published 29 days ago • 173

upvoted 4 papers about 1 month ago

SINQ: Sinkhorn-Normalized Quantization for Calibration-Free Low-Precision LLM Weights

Paper • 2509.22944 • Published Sep 26 • 78

ACON: Optimizing Context Compression for Long-horizon LLM Agents

Paper • 2510.00615 • Published Oct 1 • 32

GEM: A Gym for Agentic LLMs

Paper • 2510.01051 • Published Oct 1 • 88

MCPMark: A Benchmark for Stress-Testing Realistic and Comprehensive MCP Use

Paper • 2509.24002 • Published Sep 28 • 170

upvoted 2 papers 2 months ago

Why Language Models Hallucinate

Paper • 2509.04664 • Published Sep 4 • 190

NER Retriever: Zero-Shot Named Entity Retrieval with Type-Aware Embeddings

Paper • 2509.04011 • Published Sep 4 • 28

upvoted 5 papers 3 months ago

AgentScope 1.0: A Developer-Centric Framework for Building Agentic Applications

Paper • 2508.16279 • Published Aug 22 • 52

AgentFly: Fine-tuning LLM Agents without Fine-tuning LLMs

Paper • 2508.16153 • Published Aug 22 • 154

Speed Always Wins: A Survey on Efficient Architectures for Large Language Models

Paper • 2508.09834 • Published Aug 13 • 53

ComoRAG: A Cognitive-Inspired Memory-Organized RAG for Stateful Long Narrative Reasoning

Paper • 2508.10419 • Published Aug 14 • 73

Prompt Orchestration Markup Language

Paper • 2508.13948 • Published Aug 19 • 48

upvoted a collection 3 months ago

SSRL

6 items • Updated Aug 18 • 2

upvoted a paper 4 months ago

A Survey of Context Engineering for Large Language Models

Paper • 2507.13334 • Published Jul 17 • 258

upvoted 2 papers 5 months ago

OAgents: An Empirical Study of Building Effective Agents

Paper • 2506.15741 • Published Jun 17 • 35

Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9 • 262

upvoted a paper 6 months ago

Perception, Reason, Think, and Plan: A Survey on Large Multimodal Reasoning Models

Paper • 2505.04921 • Published May 8 • 185

upvoted 2 papers 8 months ago

SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion

Paper • 2503.11576 • Published Mar 14 • 117

NeoBERT: A Next-Generation BERT

Paper • 2502.19587 • Published Feb 26 • 38

upvoted a paper 9 months ago

Jailbreaking with Universal Multi-Prompts

Paper • 2502.01154 • Published Feb 3 • 10