Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
sumitdotml
/
moe-emergence
like
0
Text Generation
Transformers
Safetensors
codeparrot/codeparrot-clean
allenai/ai2_arc
allenai/c4
English
mixture-of-experts
gpt2
research
expert-specialization
License:
mit
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
moe-emergence
/
dense-baseline
2.15 GB
2 contributors
History:
1 commit
sumitdotml
add dense and moe checkpoints
3ff42e6
10 days ago
ckpt-step-4999.pt
pickle
Detected Pickle imports (8)
"torch.ByteStorage"
,
"torch._utils._rebuild_tensor_v2"
,
"numpy.dtype"
,
"torch.FloatStorage"
,
"collections.OrderedDict"
,
"numpy.ndarray"
,
"numpy._core.multiarray._reconstruct"
,
"_codecs.encode"
How to fix it?
1.49 GB
xet
add dense and moe checkpoints
10 days ago
final-model.json
Safe
756 Bytes
add dense and moe checkpoints
10 days ago
final-model.safetensors
652 MB
xet
add dense and moe checkpoints
10 days ago
metrics.jsonl
Safe
1.6 MB
add dense and moe checkpoints
10 days ago