1 15 1

Sukmin Cho

zomss

AI & ML interests

None yet

Recent Activity

upvoted a paper about 12 hours ago

Adaptive Multi-Agent Response Refinement in Conversational Systems

upvoted a paper 1 day ago

LUT-LLM: Efficient Large Language Model Inference with Memory-based Computations on FPGAs

upvoted a paper 30 days ago

KORMo: Korean Open Reasoning Model for Everyone

View all activity

Organizations

None yet

upvoted a paper about 12 hours ago

Adaptive Multi-Agent Response Refinement in Conversational Systems

Paper • 2511.08319 • Published 1 day ago • 31

upvoted a paper 1 day ago

LUT-LLM: Efficient Large Language Model Inference with Memory-based Computations on FPGAs

Paper • 2511.06174 • Published 4 days ago • 2

upvoted a paper 30 days ago

KORMo: Korean Open Reasoning Model for Everyone

Paper • 2510.09426 • Published Oct 10 • 79

liked a model 30 days ago

KORMo-Team/KORMo-10B-sft

Text Generation • 11B • Updated 9 days ago • 3.27k • 115

upvoted 2 papers about 1 month ago

When Thoughts Meet Facts: Reusable Reasoning for Long-Context LMs

Paper • 2510.07499 • Published Oct 8 • 48

Cache-to-Cache: Direct Semantic Communication Between Large Language Models

Paper • 2510.03215 • Published Oct 3 • 96

upvoted a paper about 2 months ago

EpiCache: Episodic KV Cache Management for Long Conversational Question Answering

Paper • 2509.17396 • Published Sep 22 • 19

upvoted a paper 3 months ago

DINOv3

Paper • 2508.10104 • Published Aug 13 • 276

upvoted a paper 6 months ago

System Prompt Optimization with Meta-Learning

Paper • 2505.09666 • Published May 14 • 71

upvoted a paper 7 months ago

Paper2Code: Automating Code Generation from Scientific Papers in Machine Learning

Paper • 2504.17192 • Published Apr 24 • 120

upvoted a paper 9 months ago

Autellix: An Efficient Serving Engine for LLM Agents as General Programs

Paper • 2502.13965 • Published Feb 19 • 19

authored a paper 9 months ago

Lossless Acceleration of Large Language Models with Hierarchical Drafting based on Temporal Locality in Speculative Decoding

Paper • 2502.05609 • Published Feb 8 • 19

upvoted a paper 9 months ago

Lossless Acceleration of Large Language Models with Hierarchical Drafting based on Temporal Locality in Speculative Decoding

Paper • 2502.05609 • Published Feb 8 • 19

commented a paper 9 months ago

Lossless Acceleration of Large Language Models with Hierarchical Drafting based on Temporal Locality in Speculative Decoding

Paper • 2502.05609 • Published Feb 8 • 19 •

upvoted a paper 9 months ago

Typos that Broke the RAG's Back: Genetic Attack on RAG Pipeline by Simulating Documents in the Wild via Low-level Perturbations

Paper • 2404.13948 • Published Apr 22, 2024 • 2

authored 5 papers 9 months ago

Test-Time Self-Adaptive Small Language Models for Question Answering

Paper • 2310.13307 • Published Oct 20, 2023

Discrete Prompt Optimization via Constrained Generation for Zero-shot Re-ranker

Paper • 2305.13729 • Published May 23, 2023 • 1

Improving Zero-shot Reader by Reducing Distractions from Irrelevant Documents in Open-Domain Question Answering

Paper • 2310.17490 • Published Oct 26, 2023

Augmenting Document Representations for Dense Retrieval with Interpolation and Perturbation

Paper • 2203.07735 • Published Mar 15, 2022

Adaptive-RAG: Learning to Adapt Retrieval-Augmented Large Language Models through Question Complexity

Paper • 2403.14403 • Published Mar 21, 2024 • 7

Sukmin Cho

AI & ML interests

Recent Activity

Organizations

zomss's activity