The Surprising Effectiveness of Negative Reinforcement in LLM Reasoning Paper • 2506.01347 • Published Jun 2 • 3
ToolBeHonest: A Multi-level Hallucination Diagnostic Benchmark for Tool-Augmented Large Language Models Paper • 2406.20015 • Published Jun 28, 2024 • 1
HoLLMwood: Unleashing the Creativity of Large Language Models in Screenwriting via Role Playing Paper • 2406.11683 • Published Jun 17, 2024
Zero-Shot Learners for Natural Language Understanding via a Unified Multiple Choice Perspective Paper • 2210.08590 • Published Oct 16, 2022
Solving Math Word Problems via Cooperative Reasoning induced Language Models Paper • 2210.16257 • Published Oct 28, 2022
Fengshenbang 1.0: Being the Foundation of Chinese Cognitive Intelligence Paper • 2209.02970 • Published Sep 7, 2022
Unchosen Experts Can Contribute Too: Unleashing MoE Models' Power by Self-Contrast Paper • 2405.14507 • Published May 23, 2024
ChartMimic: Evaluating LMM's Cross-Modal Reasoning Capability via Chart-to-Code Generation Paper • 2406.09961 • Published Jun 14, 2024 • 55
Question Answering as Programming for Solving Time-Sensitive Questions Paper • 2305.14221 • Published May 23, 2023
AutoConv: Automatically Generating Information-seeking Conversations with Large Language Models Paper • 2308.06507 • Published Aug 12, 2023 • 1