31 8 32

Haoxiang Wang

Haoxiang-Wang

https://haoxiang-wang.github.io/

AI & ML interests

Machine Learning (Transfer Learning, OOD Generalization, Domain Adaptation, Meta-Learning)

Recent Activity

new activity 11 days ago

HuggingFaceM4/FineVision:All images of the tqt_dqa subset are of 224x224 documents

upvoted a paper 3 months ago

Reinforce-Ada: An Adaptive Sampling Framework for Reinforce-Style LLM Training

updated a model 6 months ago

nvidia/NFT-32B

View all activity

Organizations

New activity in HuggingFaceM4/FineVision 11 days ago

All images of the tqt_dqa subset are of 224x224 documents

#31 opened 11 days ago by

Haoxiang-Wang

upvoted a paper 3 months ago

Reinforce-Ada: An Adaptive Sampling Framework for Reinforce-Style LLM Training

Paper • 2510.04996 • Published Oct 6, 2025 • 15

updated a model 6 months ago

nvidia/NFT-32B

Text Generation • 33B • Updated Jul 15, 2025 • 64 • • 6

published 2 models 6 months ago

nvidia/NFT-32B

Text Generation • 33B • Updated Jul 15, 2025 • 64 • • 6

nvidia/NFT-7B

Text Generation • 8B • Updated Jul 15, 2025 • 55 • 2

updated a model 6 months ago

nvidia/NFT-7B

Text Generation • 8B • Updated Jul 15, 2025 • 55 • 2

upvoted a paper 7 months ago

Bridging Supervised Learning and Reinforcement Learning in Math Reasoning

Paper • 2505.18116 • Published May 23, 2025 • 4

commented a paper 7 months ago

Bridging Supervised Learning and Reinforcement Learning in Math Reasoning

Paper • 2505.18116 • Published May 23, 2025 • 4 •

upvoted 2 papers 10 months ago

Cosmos-Reason1: From Physical Common Sense To Embodied Reasoning

Paper • 2503.15558 • Published Mar 18, 2025 • 50

Self-rewarding correction for mathematical reasoning

Paper • 2502.19613 • Published Feb 26, 2025 • 82

New activity in nvidia/Cosmos-1.0-Autoregressive-4B 12 months ago

access restriction

#3 opened 12 months ago by

qyx915915

Access restrictions

#2 opened 12 months ago by

fximax

updated 8 models 12 months ago

Haoxiang Wang

AI & ML interests

Recent Activity

Organizations

Haoxiang-Wang's activity

All images of the tqt_dqa subset are of 224x224 documents

access restriction

Access restrictions