TeichAI
/

Nemotron-Orchestrator-8B-Claude-4.5-Opus-Distill

Model card Files Files and versions

armand0e commited on 14 days ago

Commit

07e8298

·

verified ·

1 Parent(s): 99678a1

Update README.md

Files changed (1) hide show

README.md +26 -15

README.md CHANGED Viewed

@@ -1,21 +1,32 @@
 ---
-base_model: nvidia/Nemotron-Orchestrator-8B
-tags:
-- text-generation-inference
-- transformers
-- unsloth
-- qwen3
-license: apache-2.0
-language:
-- en
 ---
-# Uploaded finetuned  model
-- **Developed by:** TeichAI
-- **License:** apache-2.0
-- **Finetuned from model :** nvidia/Nemotron-Orchestrator-8B
-This qwen3 model was trained 2x faster with [Unsloth](https://github.com/unslothai/unsloth) and Huggingface's TRL library.
-[<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)

 ---
+datasets:
+- TeichAI/claude-4.5-opus-high-reasoning-250x
+base_model:
+- unsloth/Qwen3-8B-unsloth-bnb-4bit
 ---
+# Nemotron Orchestrator 8B x Claude 4.5 Opus (High Reasoning) Distill
+This model was trained on a **Claude Opus 4.5 (reasoning)** dataset with a high reasoning effort.
+You are viewing the safetensors variant of this model, a quantized gguf variant is available here: [TeichAI/Nemotron-Orchestrator-8B-Claude-4.5-Opus-Distill-GGUF](https://huggingface.co/TeichAI/Nemotron-Orchestrator-8B-Claude-4.5-Opus-Distill-GGUF)
+- &#129302; Related Models:
+| Model      | Effective parameters      | Active parameters     |
+| ------------- | ------------- | ------------- |
+| [`Qwen3-8B-Claude-4.5-Opus-High-Reasoning-Distill`](https://huggingface.co/TeichAI/Qwen3-8B-Claude-4.5-Opus-High-Reasoning-Distill) | 8 B | 8 B |
+| [`Qwen3-4B-Thinking-2507-Claude-4.5-Opus-High-Reasoning-Distill`](https://huggingface.co/TeichAI/Qwen3-4B-Thinking-2507-Claude-4.5-Opus-High-Reasoning-Distill) | 4 B | 4 B |
+- 🧬 Datasets:
+  - `TeichAI/claude-4.5-opus-high-reasoning-250x`
+- 🏗 Base Model:
+  - `unsloth/Qwen3-8B-unsloth-bnb-4bit`
+- &#9889; Use cases:
+  - Coding
+  - Science
+  - General Purpose
+- &#8721; Stats (Dataset)
+  - Costs: $ 52.3 (USD)
+  - Total tokens (input + output): 2.13 M