vbnm2103
's Collections
To Read
updated
Writing in the Margins: Better Inference Pattern for Long Context
Retrieval
Paper
•
2408.14906
•
Published
•
144
Training Language Models to Self-Correct via Reinforcement Learning
Paper
•
2409.12917
•
Published
•
140
Towards a Unified View of Preference Learning for Large Language Models:
A Survey
Paper
•
2409.02795
•
Published
•
72
Attention Heads of Large Language Models: A Survey
Paper
•
2409.03752
•
Published
•
92
Building and better understanding vision-language models: insights and
future directions
Paper
•
2408.12637
•
Published
•
133
Transformer Explainer: Interactive Learning of Text-Generative Models
Paper
•
2408.04619
•
Published
•
172
Gemma 2: Improving Open Language Models at a Practical Size
Paper
•
2408.00118
•
Published
•
79
Why Does the Effective Context Length of LLMs Fall Short?
Paper
•
2410.18745
•
Published
•
18
Your Mixture-of-Experts LLM Is Secretly an Embedding Model For Free
Paper
•
2410.10814
•
Published
•
51
Toward General Instruction-Following Alignment for Retrieval-Augmented
Generation
Paper
•
2410.09584
•
Published
•
49
Can Knowledge Editing Really Correct Hallucinations?
Paper
•
2410.16251
•
Published
•
55
What Happened in LLMs Layers when Trained for Fast vs. Slow Thinking: A
Gradient Perspective
Paper
•
2410.23743
•
Published
•
63
Scaling Latent Reasoning via Looped Language Models
Paper
•
2510.25741
•
Published
•
211
DeepAgent: A General Reasoning Agent with Scalable Toolsets
Paper
•
2510.21618
•
Published
•
95
Reasoning with Sampling: Your Base Model is Smarter Than You Think
Paper
•
2510.14901
•
Published
•
47
Continual Learning via Sparse Memory Finetuning
Paper
•
2510.15103
•
Published
•
3