Debrup-61/merged_RaDeR_retriever_Mixtral_MATH_qCoT_and_LLMquery_lexical Feature Extraction • 7B • Updated Jun 28 • 9
Debrup-61/merged_RaDeR_retriever_Mixtral_MATH_qCoT_and_LLMquery_lexical Feature Extraction • 7B • Updated Jun 28 • 9
Debrup-61/merged_RaDeR_Qwen2.5-7B-instruct_MATH_onlylexical_final Feature Extraction • 7B • Updated Jun 27 • 9
Debrup-61/merged_RaDeR_Qwen2.5-7B-instruct_MATH_onlylexical_final Feature Extraction • 7B • Updated Jun 27 • 9
Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning Paper • 2503.09516 • Published Mar 12 • 36
IR4MATH2/merged_retriever_Qwen25-7B-instruct_MATH_allquerytypes_reward1 Feature Extraction • 7B • Updated Jun 17 • 6
IR4MATH2/merged_retriever_Qwen25-7B-instruct_MATH_allquerytypes_reward1 Feature Extraction • 7B • Updated Jun 17 • 6
MATHSENSEI: A Tool-Augmented Large Language Model for Mathematical Reasoning Paper • 2402.17231 • Published Feb 27, 2024 • 3
Raderspace/RaDeR_Qwen25_3B_NuminaMath_MATH_allquerytypes Feature Extraction • 3B • Updated Jun 12 • 9 • 2