leeloolee (Dokyoon)

liked 2 models 2 days ago

inclusionAI/ZwZ-4B

Image-Text-to-Text • 5B • Updated about 10 hours ago • 71 • 10

inclusionAI/Ring-2.5-1T

Text Generation • Updated 2 days ago • 1.24k • 165

updated a Space 2 days ago

README

🐠

upvoted an article 2 days ago

Article

OpenEnv in Practice: Evaluating Tool-Using Agents in Real-World Environments

+3

3 days ago

•

15

published a Space 3 days ago

README

🐠

upvoted a paper 3 days ago

Step 3.5 Flash: Open Frontier-Level Intelligence with 11B Active Parameters

Paper • 2602.10604 • Published 4 days ago • 173

upvoted a paper 4 days ago

Towards Pixel-Level VLM Perception via Simple Points Prediction

Paper • 2601.19228 • Published 19 days ago • 17

liked a Space 7 days ago

Open FinLLM Leaderboard

🥇

129

Explore LLM performance on financial benchmarks

reacted to raincandy-u's post with 🔥 11 days ago

Post

2927

Introducing Rain-v2: Democratizing LLM training on gaming GPUs! ⚡

Following Rain-100M, we’re scaling up. Rain-v2 features a larger training dataset.

We’ve published a comprehensive blog covering the end-to-end journey—from raw data collection to rigorous evaluation and safety testing.

HF Repo: 🤗 raincandy-u/Rain-v2

Blog: 📚
https://angelkawaii.xyz/2026/01/29/rain-v2/

Special thanks to the open-source community and the SmolLM2 team for their foundational work! 🚀

HuggingFaceTB
SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model (2502.02737)

upvoted an article 16 days ago

Article

Unlocking Agentic RL Training for GPT-OSS: A Practical Retrospective

19 days ago

•

54

reacted to Parveshiiii's post with 🔥 19 days ago

Post

1595

🚀 Wanna train your own AI Model or Tokenizer from scratch?

Building models isn’t just for big labs anymore — with the right data, compute, and workflow, you can create **custom AI models** and **tokenizers** tailored to any domain. Whether it’s NLP, domain‑specific datasets, or experimental architectures, training from scratch gives you full control over vocabulary, embeddings, and performance.

✨ Why train your own?
- Full control over vocabulary & tokenization
- Domain‑specific optimization (medical, legal, technical, etc.)
- Better performance on niche datasets
- Freedom to experiment with architectures

⚡ The best part?
- Tokenizer training (TikToken / BPE) can be done in **just 3 lines of code**.
- Model training runs smoothly on **Google Colab notebooks** — no expensive hardware required.

📂 Try out my work:
- 🔗 https://github.com/OE-Void/Tokenizer-from_scratch
- 🔗 https://github.com/OE-Void/GPT

upvoted a paper 19 days ago

VISTA-PATH: An interactive foundation model for pathology image segmentation and quantitative analysis in computational pathology

Paper • 2601.16451 • Published 23 days ago • 2

liked a dataset 19 days ago

Reubencf/2023_events

Viewer • Updated 21 days ago • 4.68k • 32 • 3

upvoted a paper 19 days ago

Jet-RL: Enabling On-Policy FP8 Reinforcement Learning with Unified Training and Rollout Precision Flow

Paper • 2601.14243 • Published 25 days ago • 21

liked a model 25 days ago

lightonai/LightOnOCR-2-1B

Image-Text-to-Text • 1B • Updated 12 days ago • 208k • 517

upvoted a paper 26 days ago

Think-Then-Generate: Reasoning-Aware Text-to-Image Diffusion with LLM Encoders

Paper • 2601.10332 • Published about 1 month ago • 28

reacted to MonsterMMORPG's post with 👀 27 days ago

Post

4004

Compared Quality and Speed Difference (with CUDA 13 & Sage Attention) of BF16 vs GGUF Q8 vs FP8 Scaled vs NVFP4 for Z Image Turbo, FLUX Dev, FLUX SRPO, FLUX Kontext, FLUX 2 - Full 4K step by step tutorial also published

Full 4K tutorial : https://youtu.be/XDzspWgnzxI

Check above full 4K tutorial to learn more and see uncompressed original quality and size images

It was always wondered how much quality and speed difference exists between BF16, GGUF, FP8 Scaled and NVFP4 precisions. In this tutorial I have compared all these precision and quantization variants for both speed and quality. The results are pretty surprising. Moreover, we have developed and published NVFP4 model quant generator app and FP8 Scaled quant generator apps. The links of the apps are below if you want to use them. Furthermore, upgrading ComfyUI to CUDA 13 with properly compiled libraries is now very much recommended. We have observed some noticeable performance gains with CUDA 13. So for both SwarmUI and ComfyUI solo users, CUDA 13 ComfyUI is now recommended.