15 11 5

MingHua Ma

Gezelligheid520

AI & ML interests

None yet

Recent Activity

upvoted a paper 27 days ago

Glyph: Scaling Context Windows via Visual-Text Compression

liked a model 2 months ago

Qwen/Qwen3-Next-80B-A3B-Instruct

upvoted a collection 3 months ago

VisionLM

View all activity

Organizations

None yet

upvoted a paper 27 days ago

Glyph: Scaling Context Windows via Visual-Text Compression

Paper • 2510.17800 • Published 28 days ago • 66

liked a model 2 months ago

Qwen/Qwen3-Next-80B-A3B-Instruct

Text Generation • 81B • Updated Sep 17 • 1.37M • • 864

upvoted a collection 3 months ago

VisionLM

Collection

1749 items • Updated 4 days ago • 131

liked a model 3 months ago

openai/gpt-oss-20b

Text Generation • 22B • Updated Aug 26 • 5.47M • • 3.94k

upvoted a collection 5 months ago

Qwen3

Collection

84 items • Updated Aug 6 • 1.42k

upvoted 3 papers 6 months ago

SageAttention2++: A More Efficient Implementation of SageAttention2

Paper • 2505.21136 • Published May 27 • 45

The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models

Paper • 2505.22617 • Published May 28 • 131

Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning

Paper • 2506.01939 • Published Jun 2 • 185

liked a dataset 6 months ago

ChenShawn/DeepEyes-Datasets-47k

Preview • Updated May 22 • 385 • 15

upvoted 2 papers 6 months ago

Model Merging in Pre-training of Large Language Models

Paper • 2505.12082 • Published May 17 • 40

AdaptThink: Reasoning Models Can Learn When to Think

Paper • 2505.13417 • Published May 19 • 82

authored a paper 7 months ago

CFSP: An Efficient Structured Pruning Framework for LLMs with Coarse-to-Fine Activation Information

Paper • 2409.13199 • Published Sep 20, 2024

upvoted a paper 7 months ago

Towards Large Reasoning Models: A Survey of Reinforced Reasoning with Large Language Models

Paper • 2501.09686 • Published Jan 16 • 41

authored a paper 7 months ago

UFO2: The Desktop AgentOS

Paper • 2504.14603 • Published Apr 20 • 29

upvoted 2 papers 7 months ago

BitNet b1.58 2B4T Technical Report

Paper • 2504.12285 • Published Apr 16 • 75

InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

Paper • 2504.10479 • Published Apr 14 • 301

New activity in Gezelligheid520/Qwen-VL-7B-GRPO-2025-02-19-05-50-45 9 months ago

Upload folder using huggingface_hub

#2 opened 9 months ago by

Gezelligheid520

Upload folder using huggingface_hub

#1 opened 9 months ago by

Gezelligheid520

updated a model 9 months ago

Gezelligheid520/Qwen-VL-7B-GRPO-2025-02-19-05-50-45

8B • Updated Feb 22 • 1

New activity in Gezelligheid520/Qwen-VL-2B-GRPO-2025-02-19-05-36-42 9 months ago

Upload folder using huggingface_hub

#1 opened 9 months ago by

Gezelligheid520

MingHua Ma

AI & ML interests

Recent Activity

Organizations

Gezelligheid520's activity

Upload folder using huggingface_hub

Upload folder using huggingface_hub

Upload folder using huggingface_hub