fanwanx

FANTKwan

AI & ML interests

Natural Language Processing

Recent Activity

upvoted a paper about 1 month ago

Agent Learning via Early Experience

upvoted a paper about 2 months ago

LongCodeZip: Compress Long Context for Code Language Models

upvoted a paper about 2 months ago

SWE-QA: Can Language Models Answer Repository-level Code Questions?

View all activity

Organizations

None yet

upvoted a paper about 1 month ago

Agent Learning via Early Experience

Paper • 2510.08558 • Published Oct 9 • 263

upvoted 3 papers about 2 months ago

upvoted a paper 2 months ago

MachineLearningLM: Continued Pretraining Language Models on Millions of Synthetic Tabular Prediction Tasks Scales In-Context ML

Paper • 2509.06806 • Published Sep 8 • 63

liked a model 2 months ago

MachineLearningLM/MachineLearningLM-7B-v1

Text Generation • 8B • Updated Oct 1 • 49 • 32

upvoted 3 papers 3 months ago

Intern-S1: A Scientific Multimodal Foundation Model

Paper • 2508.15763 • Published Aug 21 • 256

Beyond Pass@1: Self-Play with Variational Problem Synthesis Sustains RLVR

Paper • 2508.14029 • Published Aug 19 • 118

InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency

Paper • 2508.18265 • Published Aug 25 • 206

upvoted 2 papers 6 months ago

Table-R1: Inference-Time Scaling for Table Reasoning

Paper • 2505.23621 • Published May 29 • 94

Sherlock: Self-Correcting Reasoning in Vision-Language Models

Paper • 2505.22651 • Published May 28 • 50

upvoted 3 papers 7 months ago

The Sparse Frontier: Sparse Attention Trade-offs in Transformer LLMs

Paper • 2504.17768 • Published Apr 24 • 14

Towards Understanding Camera Motions in Any Video

Paper • 2504.15376 • Published Apr 21 • 158

CLIMB: CLustering-based Iterative Data Mixture Bootstrapping for Language Model Pre-training

Paper • 2504.13161 • Published Apr 17 • 94

upvoted 3 papers 8 months ago

Unified Reward Model for Multimodal Understanding and Generation

Paper • 2503.05236 • Published Mar 7 • 123

SafeArena: Evaluating the Safety of Autonomous Web Agents

Paper • 2503.04957 • Published Mar 6 • 21

SEAP: Training-free Sparse Expert Activation Pruning Unlock the Brainpower of Large Language Models

Paper • 2503.07605 • Published Mar 10 • 68

upvoted a paper 9 months ago

The Stochastic Parrot on LLM's Shoulder: A Summative Assessment of Physical Concept Understanding

Paper • 2502.08946 • Published Feb 13 • 193

liked 2 models 9 months ago

Qwen/Qwen2.5-Coder-32B-Instruct

Text Generation • 33B • Updated Jan 12 • 186k • • 1.95k

Zyphra/Zonos-v0.1-transformer

Text-to-Speech • Updated Jun 3 • 16.4k • 418

fanwanx

AI & ML interests

Recent Activity

Organizations

FANTKwan's activity