Open to Work

Tek Raj Awasthi

Tekraj15

AI & ML interests

Advanced Deep Learning | Multimodal Gen AI(LLM/VLM Compression, Fine-tuning, Post-training, RAG, Agents) | High Performance/Edge Computation; GPU sounds sexy, but "Edge Optimization" gives me real orgasm!

Recent Activity

liked a Space 5 days ago

r3gm/wan2-2-fp8da-aoti-preview

upvoted an article 6 days ago

SDXL in 4 steps with Latent Consistency LoRAs

liked a Space 8 days ago

Tekraj15/Sketch-to-Render-AI

View all activity

Organizations

upvoted an article 6 days ago

Article

SDXL in 4 steps with Latent Consistency LoRAs

Nov 9, 2023

•

upvoted a paper 29 days ago

Efficient Memory Management for Large Language Model Serving with PagedAttention

Paper • 2309.06180 • Published Sep 12, 2023 • 34

upvoted a paper 30 days ago

SimpleMem: Efficient Lifelong Memory for LLM Agents

Paper • 2601.02553 • Published Jan 5 • 36

upvoted 2 articles 6 months ago

Article

From Zero to GPU: A Guide to Building and Scaling Production-Ready CUDA Kernels

Aug 18, 2025

•

Article

Code a simple RAG from scratch

Oct 29, 2024

•

293

Tek Raj Awasthi

AI & ML interests

Recent Activity

Organizations

Tekraj15's activity

SDXL in 4 steps with Latent Consistency LoRAs

From Zero to GPU: A Guide to Building and Scaling Production-Ready CUDA Kernels

Code a simple RAG from scratch