1 248 638

Gurumurthi V Ramanan

GVR

https://surasys.co

AI & ML interests

Recent Activity

upvoted a paper about 24 hours ago

Nemotron 3 Nano: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning

liked a model 3 days ago

nvidia/Nemotron-Cascade-8B-Intermediate-ckpts

upvoted an article 3 days ago

Efficient MultiModal Data Pipeline

View all activity

Organizations

upvoted a paper about 24 hours ago

Nemotron 3 Nano: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning

Paper • 2512.20848 • Published 6 days ago • 28

upvoted 3 articles 3 days ago

Article

Efficient MultiModal Data Pipeline

Jul 8

•

Article

nanoVLM: The simplest repository to train your VLM in pure PyTorch

May 21

•

245

Article

Tricks from OpenAI gpt-oss YOU 🫵 can use with transformers

Sep 11

•

176

upvoted an article 8 days ago

Article

Tokenization in Transformers v5: Simpler, Clearer, and More Modular

12 days ago

•

upvoted 2 papers 15 days ago

A Survey of Vibe Coding with Large Language Models

Paper • 2510.12399 • Published Oct 14 • 49

Achieving Olympia-Level Geometry Large Language Model Agent via Complexity Boosting Reinforcement Learning

Paper • 2512.10534 • Published 18 days ago • 31

upvoted a paper 16 days ago

One Layer Is Enough: Adapting Pretrained Visual Encoders for Image Generation

Paper • 2512.07829 • Published 21 days ago • 21

upvoted an article 16 days ago

Article

MiniGuard-v0.1: Prem's Guardrail Model Redefining the Pareto Frontier

17 days ago

•

upvoted 2 papers 17 days ago

CLaRa: Bridging Retrieval and Generation with Continuous Latent Reasoning

Paper • 2511.18659 • Published Nov 24 • 18

From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence

Paper • 2511.18538 • Published Nov 23 • 278

upvoted a collection 17 days ago

rnj-1

Collection

5 items • Updated 10 days ago • 39

upvoted a paper 17 days ago

Stabilizing Reinforcement Learning with LLMs: Formulation and Practices

Paper • 2512.01374 • Published 28 days ago • 93

upvoted a paper 18 days ago

Native Parallel Reasoner: Reasoning in Parallelism via Self-Distilled Reinforcement Learning

Paper • 2512.07461 • Published 21 days ago • 74

upvoted an article 18 days ago

Article

Apriel-1.6-15b-Thinker: Cost-efficient Frontier Multimodal Performance

20 days ago

•

upvoted an article 25 days ago

Article

We Got Claude to Fine-Tune an Open Source LLM

26 days ago

•

544

upvoted a collection 27 days ago

BERT-Chat

Collection

BERTs that chat • 2 items • Updated Nov 28 • 12

upvoted a paper about 1 month ago

NVIDIA Nemotron Parse 1.1

Paper • 2511.20478 • Published Nov 25 • 21

upvoted a collection about 1 month ago

Tarka Embed V1

Collection

Efficient DFKD embeddings for language understanding • 5 items • Updated 12 days ago • 6

upvoted an article about 1 month ago

Article

Continuous batching from first principles

Nov 25

•

288

Gurumurthi V Ramanan

AI & ML interests

Recent Activity

Organizations

GVR's activity

Efficient MultiModal Data Pipeline

nanoVLM: The simplest repository to train your VLM in pure PyTorch

Tricks from OpenAI gpt-oss YOU 🫵 can use with transformers

Tokenization in Transformers v5: Simpler, Clearer, and More Modular

MiniGuard-v0.1: Prem's Guardrail Model Redefining the Pareto Frontier

Apriel-1.6-15b-Thinker: Cost-efficient Frontier Multimodal Performance

We Got Claude to Fine-Tune an Open Source LLM

Continuous batching from first principles