Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
JinnP
/
qwen3-8b-kernelbook-sft-megatron
like
0
megatron
qwen
sft
checkpoint
kernelbook
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
main
qwen3-8b-kernelbook-sft-megatron
115 GB
1 contributor
History:
2 commits
JinnP
Upload Qwen3-8B-KernelBook-SFT model
8ef8d49
verified
3 months ago
.gitattributes
Safe
2.35 kB
Upload Qwen3-8B-KernelBook-SFT model
3 months ago
.metadata
1.89 MB
xet
Upload Qwen3-8B-KernelBook-SFT model
3 months ago
README.md
2.21 kB
Upload Qwen3-8B-KernelBook-SFT model
3 months ago
__0_0.distcp
7.17 GB
xet
Upload Qwen3-8B-KernelBook-SFT model
3 months ago
__0_1.distcp
7.17 GB
xet
Upload Qwen3-8B-KernelBook-SFT model
3 months ago
__1_0.distcp
7.17 GB
xet
Upload Qwen3-8B-KernelBook-SFT model
3 months ago
__1_1.distcp
7.17 GB
xet
Upload Qwen3-8B-KernelBook-SFT model
3 months ago
__2_0.distcp
7.17 GB
xet
Upload Qwen3-8B-KernelBook-SFT model
3 months ago
__2_1.distcp
7.17 GB
xet
Upload Qwen3-8B-KernelBook-SFT model
3 months ago
__3_0.distcp
7.17 GB
xet
Upload Qwen3-8B-KernelBook-SFT model
3 months ago
__3_1.distcp
7.17 GB
xet
Upload Qwen3-8B-KernelBook-SFT model
3 months ago
__4_0.distcp
7.17 GB
xet
Upload Qwen3-8B-KernelBook-SFT model
3 months ago
__4_1.distcp
7.17 GB
xet
Upload Qwen3-8B-KernelBook-SFT model
3 months ago
__5_0.distcp
7.17 GB
xet
Upload Qwen3-8B-KernelBook-SFT model
3 months ago
__5_1.distcp
7.17 GB
xet
Upload Qwen3-8B-KernelBook-SFT model
3 months ago
__6_0.distcp
7.17 GB
xet
Upload Qwen3-8B-KernelBook-SFT model
3 months ago
__6_1.distcp
7.17 GB
xet
Upload Qwen3-8B-KernelBook-SFT model
3 months ago
__7_0.distcp
7.17 GB
xet
Upload Qwen3-8B-KernelBook-SFT model
3 months ago
__7_1.distcp
7.17 GB
xet
Upload Qwen3-8B-KernelBook-SFT model
3 months ago
common.pt
pickle
Detected Pickle imports (5)
"megatron.core.enums.ModelType"
,
"torch.bfloat16"
,
"megatron.core.transformer.enums.AttnBackend"
,
"torch.float32"
,
"argparse.Namespace"
How to fix it?
29.5 kB
xet
Upload Qwen3-8B-KernelBook-SFT model
3 months ago
metadata.json
Safe
119 Bytes
Upload Qwen3-8B-KernelBook-SFT model
3 months ago