BSRoformer-GGUF

Official GGUF model repository for the BSRoformer.cpp project. This repository contains BS Roformer/Mel-Band-Roformer models converted to the GGUF format, enabling high-performance, cross-platform local inference using the MelBandRoformer.cpp inference engine.

📦 Model List

This repository contains GGUF quantized versions of the following original models:

Original Model Original Author
mel-band-roformer-deux becruily
MelBandRoformers (voc_fv6) GaboxR67

Quantization Types

To meet different hardware requirements, various quantized versions are provided:

  • q8_0 (Recommended): 8-bit quantization. Significantly reduces memory usage and bandwidth requirements while maintaining audio quality almost identical to FP32.
  • fp16: 16-bit floating point. Suitable for scenarios requiring maximum precision.
  • q4_0 / q4_1 / q5_0 / q5_1: Lower-bit quantization, suitable for devices with low VRAM/RAM. (Note: Norm and Bias weights remain in FP32 to ensure numerical stability)

🚀 Usage

Please use with the MelBandRoformer.cpp command-line tool.

1. Download Tool

Download the executable for your system from MelBandRoformer.cpp Releases (or compile it yourself).

2. Download Model

Download the desired .gguf file (e.g., voc_fv6-Q8_0.gguf) from the Files and versions page of this repository.

3. Run Inference

# Basic usage
./bs_roformer-cli model_q8_0.gguf input.wav output.wav
# Advanced usage (adjust chunk size and overlap to optimize quality)
./bs_roformer-cli model_q8_0.gguf input.wav output.wav --overlap 2
Downloads last month
297
GGUF
Model size
51.1M params
Architecture
bs_roformer
Hardware compatibility
Log In to add your hardware

4-bit

5-bit

8-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support