nm-testing/Meta-Llama-3-8B-Instruct-W8A8-FP8-Channelwise-compressed-tensors Text Generation • 8B • Updated Oct 9, 2024 • 2 • 1
RedHatAI/Meta-Llama-3.1-8B-Instruct-quantized.w8a16 Text Generation • 3B • Updated Oct 23, 2024 • 3.97k • 12
RedHatAI/whisper-large-v3-turbo-FP8-dynamic Automatic Speech Recognition • 0.9B • Updated Apr 22 • 235 • 6
meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8 Image-Text-to-Text • 402B • Updated May 22 • 168k • • 140