MoE Open source MoE IEITYuan/Yuan2-M32-hf Text Generation • Updated May 30, 2024 • 309 • 61 allenai/OLMoE-1B-7B-0924 Text Generation • 7B • Updated Oct 19, 2024 • 16.5k • 137 microsoft/Phi-3.5-MoE-instruct Text Generation • 42B • Updated Mar 7 • 107k • 564 Qwen/Qwen1.5-MoE-A2.7B Text Generation • 14B • Updated Apr 18, 2024 • 45.5k • 211
LGViT The checkpoints of LGViT. Paper link: https://arxiv.org/abs/2308.00255 LGViT: Dynamic Early Exiting for Accelerating Vision Transformer Paper • 2308.00255 • Published Aug 1, 2023 FALcon6/LGViT-ViT-Cifar100 Image Classification • Updated Oct 30, 2024 • 1 FALcon6/LGViT-DeiT-Cifar100 Image Classification • Updated Oct 30, 2024 • 1 FALcon6/LGViT-Swin-Cifar100 Image Classification • Updated Oct 30, 2024 • 2
LGViT: Dynamic Early Exiting for Accelerating Vision Transformer Paper • 2308.00255 • Published Aug 1, 2023
MoE Open source MoE IEITYuan/Yuan2-M32-hf Text Generation • Updated May 30, 2024 • 309 • 61 allenai/OLMoE-1B-7B-0924 Text Generation • 7B • Updated Oct 19, 2024 • 16.5k • 137 microsoft/Phi-3.5-MoE-instruct Text Generation • 42B • Updated Mar 7 • 107k • 564 Qwen/Qwen1.5-MoE-A2.7B Text Generation • 14B • Updated Apr 18, 2024 • 45.5k • 211
LGViT The checkpoints of LGViT. Paper link: https://arxiv.org/abs/2308.00255 LGViT: Dynamic Early Exiting for Accelerating Vision Transformer Paper • 2308.00255 • Published Aug 1, 2023 FALcon6/LGViT-ViT-Cifar100 Image Classification • Updated Oct 30, 2024 • 1 FALcon6/LGViT-DeiT-Cifar100 Image Classification • Updated Oct 30, 2024 • 1 FALcon6/LGViT-Swin-Cifar100 Image Classification • Updated Oct 30, 2024 • 2
LGViT: Dynamic Early Exiting for Accelerating Vision Transformer Paper • 2308.00255 • Published Aug 1, 2023