2 64 172

wangrui

varuy322

varuy322

AI & ML interests

None yet

Recent Activity

upvoted a paper 4 days ago

ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models

liked a dataset 4 days ago

m-a-p/LPFQA

liked a model 6 days ago

HuggingFaceFW/finepdfs_edu_classifier_eng_Latn

View all activity

Organizations

None yet

upvoted a paper 4 days ago

ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models

Paper • 2505.24864 • Published May 30 • 141

liked a dataset 4 days ago

m-a-p/LPFQA

Viewer • Updated 8 days ago • 502 • 96 • 3

liked a model 6 days ago

HuggingFaceFW/finepdfs_edu_classifier_eng_Latn

0.4B • Updated 6 days ago • 22 • 2

liked a dataset 7 days ago

EssentialAI/essential-web-v1.0

Preview • Updated Oct 2 • 29.2k • 206

upvoted an article 11 days ago

Article

The 1 Billion Token Challenge: Finding the Perfect Pre-training Mix

15 days ago

•

liked a dataset 12 days ago

nvidia/Nemotron-VLM-Dataset-v2

Viewer • Updated 12 days ago • 4.58M • 9.07k • 63

liked a dataset 21 days ago

open-r1/codeforces-cots

Viewer • Updated Mar 28 • 254k • 5.29k • 193

upvoted a paper 29 days ago

Robot Learning: A Tutorial

Paper • 2510.12403 • Published Oct 14 • 107

liked a dataset 29 days ago

HuggingFaceFW/finepdfs

Viewer • Updated 6 days ago • 476M • 57.7k • 669

upvoted a collection about 1 month ago

Ferret

Collection

A framework for training LLM agents via RL with advanced search capability: https://github.com/Tree-Shu-Zhao/ferret • 7 items • Updated 27 days ago • 1

liked a model about 1 month ago

nvidia/omni-embed-nemotron-3b

Feature Extraction • 5B • Updated Oct 9 • 41.6k • 68

liked a dataset about 1 month ago

OpenGVLab/InternVL-Chat-V1-2-SFT-Data

Viewer • Updated Sep 20, 2024 • 573k • 916 • 29

liked 2 models about 1 month ago

Qwen/Qwen3-VL-4B-Instruct

Image-Text-to-Text • 4B • Updated Oct 15 • 628k • 233

internlm/CapRL-3B

Image-Text-to-Text • 4B • Updated 27 days ago • 502 • 44

liked a dataset about 1 month ago

zai-org/DeepDive

Viewer • Updated Sep 17 • 4.11k • 1.05k • 15

liked a model about 2 months ago

Qwen/Qwen3-Omni-30B-A3B-Instruct

Any-to-Any • 35B • Updated Sep 22 • 270k • 718

liked a dataset about 2 months ago

google/simpleqa-verified

Viewer • Updated Sep 22 • 1k • 1.02k • 21

upvoted 2 papers about 2 months ago

MiniCPM-V 4.5: Cooking Efficient MLLMs via Architecture, Data, and Training Recipe

Paper • 2509.18154 • Published Sep 16 • 50

ARE: Scaling Up Agent Environments and Evaluations

Paper • 2509.17158 • Published Sep 21 • 35

upvoted a collection about 2 months ago

ZeroSearch_Policy_Google_V2