Dũng Võ PRO

tuandunghcmut

tuandung222

AI & ML interests

This account is used primarily for private storage and archiving. It is not intended to showcase any work. Most of my public repositories are private or access-restricted.

Recent Activity

liked a dataset about 17 hours ago

nvidia/Nemotron-VLM-Dataset-v2

liked a dataset about 23 hours ago

tuandunghcmut/nvidia-nemotron-fc-v1

updated a dataset 1 day ago

tuandunghcmut/nvidia-nemotron-fc-v1

View all activity

Organizations

liked a dataset about 17 hours ago

nvidia/Nemotron-VLM-Dataset-v2

Viewer • Updated 2 days ago • 4.58M • 11.4k • 67

liked a dataset about 23 hours ago

tuandunghcmut/nvidia-nemotron-fc-v1

Viewer • Updated 1 day ago • 310k • 29 • 1

liked a dataset 1 day ago

tarudesu/VOZ-HSD

Viewer • Updated Aug 13, 2024 • 10.7M • 147 • 12

liked 2 datasets 2 days ago

tuandunghcmut/glaive-v1-processed

Viewer • Updated 3 days ago • 52.9k • 15 • 1

tuandunghcmut/glaive-v2-processed

Viewer • Updated 3 days ago • 113k • 16 • 1

liked a model 3 days ago

WeiboAI/VibeThinker-1.5B

Text Generation • 2B • Updated 3 days ago • 18.6k • 480

liked a model 4 days ago

ByteDance/Dolphin-1.5

Image-Text-to-Text • 0.4B • Updated 15 days ago • 6.55k • 24

liked a dataset 4 days ago

nvidia/describe-anything-dataset

Viewer • Updated Apr 24 • 916k • 6.71k • 52

liked a Space 4 days ago

FAT5 (Flash Attention T5) report

⚡

English version of the blog post introducing FAT5 model

liked a model 4 days ago

CATIE-AQ/FAT5-small

0.1B • Updated Mar 17 • 5 • 2

liked 2 models 6 days ago

Qwen/Qwen3-Next-80B-A3B-Instruct-FP8

Text Generation • Updated Sep 22 • 491k • 61

google/t5-efficient-tiny

Updated Jan 24, 2023 • 12.4k • 29

liked 2 models 7 days ago

moonshotai/Kimi-Linear-48B-A3B-Instruct

Text Generation • 49B • Updated 2 days ago • 358k • 487

Qwen/Qwen3-VL-235B-A22B-Instruct

Image-Text-to-Text • 236B • Updated about 20 hours ago • 74.9k • • 318

liked a dataset 7 days ago

tuandunghcmut/toolbench-v1

Viewer • Updated 6 days ago • 189k • 76 • 1

liked a model 10 days ago

Qwen/Qwen3-235B-A22B-Instruct-2507

Text Generation • 235B • Updated Sep 17 • 120k • • 725

liked 2 Spaces 11 days ago

The Ultra-Scale Playbook

🌌

3.52k

The ultimate guide to training LLM on large GPU Clusters

Unlocking On-Policy Distillation for Any Model Family

📝

liked a dataset 12 days ago

sonlam1102/vihsd

Viewer • Updated Apr 6 • 33.4k • 420 • 2

liked a model 14 days ago

HuggingFaceM4/Florence-2-DocVQA

Image-Text-to-Text • 0.8B • Updated Oct 30, 2024 • 905 • 64

Dũng Võ PRO

AI & ML interests

Recent Activity

Organizations

tuandunghcmut's activity

FAT5 (Flash Attention T5) report

The Ultra-Scale Playbook

Unlocking On-Policy Distillation for Any Model Family