Nanbeige
/

Nanbeige4-3B-Thinking-2511

Text Generation

text-generation-inference

Model card Files Files and versions

leran1995 commited on about 19 hours ago

Commit

ee9e9cd

·

verified ·

1 Parent(s): 59c140c

Update README.md

Files changed (1) hide show

README.md +16 -21

README.md CHANGED Viewed

@@ -1,16 +1,16 @@
----
-license: apache-2.0
-language:
-- en
-- zh
-library_name: transformers
-pipeline_tag: text-generation
-tags:
-- llm
-- nanbeige
-base_model:
-- Nanbeige/Nanbeige4-3B-Base
----
 <div align="center">
 <img src="figures/nbg.png" width="220" alt="Nanbeige Logo">
@@ -24,22 +24,17 @@ base_model:
 # Introduction
 Nanbeige4-3B-Thinking-2511 is an enhanced iteration over our previous Nanbeige4-3B-Thinking-2510.
-Through advanced distillation techniques and reinforcement learning (RL) optimization, we have effectively scaled the model’s reasoning capacity, resulting in superior performance across a broad range of benchmarks.
-On math and science reasoning benchmarks, Nanbeige4-3B-Thinking-2511 outperforms Qwen3-4B-Thinking-2507, Qwen3-8B-Thinking-2504, and Qwen3-14B-Thinking-2504 with a significant margin.
-Besides, Nanbeige4-3B-Thinking-2511 achieves state-of-the-art (SOTA) results among models smaller than 32B parameters on general tasks like Arena-Hard-V2 and BFCL-V4.
-This marks a major milestone in delivering powerful, efficient reasoning performance at a compact scale.
 * Technical Report: https://arxiv.org/pdf/2512.06266
 <div align="center">
-<img src="figures/performance_reasoning.png">
 </div>
-<div align="center">
-<img src="figures/performance_2511.png">
-</div>

+---
+license: apache-2.0
+language:
+- en
+- zh
+library_name: transformers
+pipeline_tag: text-generation
+tags:
+- llm
+- nanbeige
+base_model:
+- Nanbeige/Nanbeige4-3B-Base
+---
 <div align="center">
 <img src="figures/nbg.png" width="220" alt="Nanbeige Logo">
 # Introduction
 Nanbeige4-3B-Thinking-2511 is an enhanced iteration over our previous Nanbeige4-3B-Thinking-2510.
+Through advanced knowledge distillation techniques and targeted reinforcement learning (RL) optimization, we have significantly scaled the model’s reasoning capabilities, delivering stronger and more reliable performance on diverse challenging benchmarks.
+This version establishes new state-of-the-art (SOTA) results among open models under 32B parameters on AIME, GPQA-Diamond, Arena-Hard-V2, and BFCL-V4, which marks a major milestone in delivering powerful yet efficient reasoning capabilities at a compact scale.
 * Technical Report: https://arxiv.org/pdf/2512.06266
 <div align="center">
+<img src="figures/nbg_performance.png">
 </div>