Coleman Hooper
chooper1
AI & ML interests
Efficient NLP
Recent Activity
upvoted
a
paper
27 days ago
ParallelBench: Understanding the Trade-offs of Parallel Decoding in
Diffusion LLMs
upvoted
a
paper
3 months ago
XQuant: Breaking the Memory Wall for LLM Inference with KV Cache
Rematerialization