Jack Lanchantin's picture

1 3 1

Jack Lanchantin

jcklcn

·

AI & ML interests

None yet

Recent Activity

authored a paper 13 days ago

Bridging Offline and Online Reinforcement Learning for LLMs

authored a paper 13 days ago

CoT-Self-Instruct: Building high-quality synthetic prompts for reasoning and non-reasoning tasks

authored a paper 13 days ago

OptimalThinkingBench: Evaluating Over and Underthinking in LLMs

View all activity

Organizations

authored 4 papers 13 days ago

Bridging Offline and Online Reinforcement Learning for LLMs

Paper • 2506.21495 • Published Jun 26 • 3

CoT-Self-Instruct: Building high-quality synthetic prompts for reasoning and non-reasoning tasks

Paper • 2507.23751 • Published Jul 31 • 4

OptimalThinkingBench: Evaluating Over and Underthinking in LLMs

Paper • 2508.13141 • Published Aug 18

SPICE: Self-Play In Corpus Environments Improves Reasoning

Paper • 2510.24684 • Published 15 days ago • 13

authored a paper 2 months ago

Jointly Reinforcing Diversity and Quality in Language Model Generations

Paper • 2509.02534 • Published Sep 2 • 24

authored a paper 9 months ago

LLM Pretraining with Continuous Concepts

Paper • 2502.08524 • Published Feb 12 • 29

authored 4 papers 12 months ago

Learning to Reason and Memorize with Self-Notes

Paper • 2305.00833 • Published May 1, 2023 • 5

A Data Source for Reasoning Embodied Agents

Paper • 2309.07974 • Published Sep 14, 2023 • 7

TOOLVERIFIER: Generalization to New Tools via Self-Verification

Paper • 2402.14158 • Published Feb 21, 2024

Adaptive Decoding via Latent Preference Optimization

Paper • 2411.09661 • Published Nov 14, 2024 • 10