-
KAN or MLP: A Fairer Comparison
Paper • 2407.16674 • Published • 43 -
MathScale: Scaling Instruction Tuning for Mathematical Reasoning
Paper • 2403.02884 • Published • 17 -
Synthetic Data (Almost) from Scratch: Generalized Instruction Tuning for Language Models
Paper • 2402.13064 • Published • 50 -
DPTDR: Deep Prompt Tuning for Dense Passage Retrieval
Paper • 2208.11503 • Published
Collections
Discover the best community collections!
Collections including paper arxiv:2403.02884
-
introspector/unimath
Updated • 900 • 7 -
MathQA: Towards Interpretable Math Word Problem Solving with Operation-Based Formalisms
Paper • 1905.13319 • Published • 2 -
Measuring Mathematical Problem Solving With the MATH Dataset
Paper • 2103.03874 • Published • 5 -
MathScale: Scaling Instruction Tuning for Mathematical Reasoning
Paper • 2403.02884 • Published • 17
-
How to Train Data-Efficient LLMs
Paper • 2402.09668 • Published • 42 -
Adapting Large Language Models via Reading Comprehension
Paper • 2309.09530 • Published • 81 -
GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection
Paper • 2403.03507 • Published • 189 -
MathScale: Scaling Instruction Tuning for Mathematical Reasoning
Paper • 2403.02884 • Published • 17
-
System 2 Attention (is something you might need too)
Paper • 2311.11829 • Published • 44 -
TPTU-v2: Boosting Task Planning and Tool Usage of Large Language Model-based Agents in Real-world Systems
Paper • 2311.11315 • Published • 8 -
Alignment for Honesty
Paper • 2312.07000 • Published • 16 -
Steering Llama 2 via Contrastive Activation Addition
Paper • 2312.06681 • Published • 15
-
MathScale: Scaling Instruction Tuning for Mathematical Reasoning
Paper • 2403.02884 • Published • 17 -
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models
Paper • 2402.03300 • Published • 131 -
Improving Small Language Models' Mathematical Reasoning via Mix Thoughts Distillation
Paper • 2401.11864 • Published • 2 -
Common 7B Language Models Already Possess Strong Math Capabilities
Paper • 2403.04706 • Published • 20
-
Orca-Math: Unlocking the potential of SLMs in Grade School Math
Paper • 2402.14830 • Published • 25 -
MathScale: Scaling Instruction Tuning for Mathematical Reasoning
Paper • 2403.02884 • Published • 17 -
meta-math/MetaMath-Mistral-7B
Text Generation • Updated • 1.65k • 96 -
meta-math/MetaMath-13B-V1.0
Text Generation • Updated • 757 • 13
-
Rethinking Optimization and Architecture for Tiny Language Models
Paper • 2402.02791 • Published • 13 -
More Agents Is All You Need
Paper • 2402.05120 • Published • 57 -
Scaling Laws for Forgetting When Fine-Tuning Large Language Models
Paper • 2401.05605 • Published -
Aligning Large Language Models with Counterfactual DPO
Paper • 2401.09566 • Published • 2
-
KAN or MLP: A Fairer Comparison
Paper • 2407.16674 • Published • 43 -
MathScale: Scaling Instruction Tuning for Mathematical Reasoning
Paper • 2403.02884 • Published • 17 -
Synthetic Data (Almost) from Scratch: Generalized Instruction Tuning for Language Models
Paper • 2402.13064 • Published • 50 -
DPTDR: Deep Prompt Tuning for Dense Passage Retrieval
Paper • 2208.11503 • Published
-
MathScale: Scaling Instruction Tuning for Mathematical Reasoning
Paper • 2403.02884 • Published • 17 -
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models
Paper • 2402.03300 • Published • 131 -
Improving Small Language Models' Mathematical Reasoning via Mix Thoughts Distillation
Paper • 2401.11864 • Published • 2 -
Common 7B Language Models Already Possess Strong Math Capabilities
Paper • 2403.04706 • Published • 20
-
introspector/unimath
Updated • 900 • 7 -
MathQA: Towards Interpretable Math Word Problem Solving with Operation-Based Formalisms
Paper • 1905.13319 • Published • 2 -
Measuring Mathematical Problem Solving With the MATH Dataset
Paper • 2103.03874 • Published • 5 -
MathScale: Scaling Instruction Tuning for Mathematical Reasoning
Paper • 2403.02884 • Published • 17
-
Orca-Math: Unlocking the potential of SLMs in Grade School Math
Paper • 2402.14830 • Published • 25 -
MathScale: Scaling Instruction Tuning for Mathematical Reasoning
Paper • 2403.02884 • Published • 17 -
meta-math/MetaMath-Mistral-7B
Text Generation • Updated • 1.65k • 96 -
meta-math/MetaMath-13B-V1.0
Text Generation • Updated • 757 • 13
-
How to Train Data-Efficient LLMs
Paper • 2402.09668 • Published • 42 -
Adapting Large Language Models via Reading Comprehension
Paper • 2309.09530 • Published • 81 -
GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection
Paper • 2403.03507 • Published • 189 -
MathScale: Scaling Instruction Tuning for Mathematical Reasoning
Paper • 2403.02884 • Published • 17
-
Rethinking Optimization and Architecture for Tiny Language Models
Paper • 2402.02791 • Published • 13 -
More Agents Is All You Need
Paper • 2402.05120 • Published • 57 -
Scaling Laws for Forgetting When Fine-Tuning Large Language Models
Paper • 2401.05605 • Published -
Aligning Large Language Models with Counterfactual DPO
Paper • 2401.09566 • Published • 2
-
System 2 Attention (is something you might need too)
Paper • 2311.11829 • Published • 44 -
TPTU-v2: Boosting Task Planning and Tool Usage of Large Language Model-based Agents in Real-world Systems
Paper • 2311.11315 • Published • 8 -
Alignment for Honesty
Paper • 2312.07000 • Published • 16 -
Steering Llama 2 via Contrastive Activation Addition
Paper • 2312.06681 • Published • 15