metadata
library_name: transformers
license: apache-2.0
datasets:
- mozilla-foundation/common_voice_17_0
language:
- bn
metrics:
- wer
base_model:
- banglabridge/base-bn-lora-adapter
model-index:
- name: Whisper Base Bn - BanglaBridge
results:
- task:
name: Automatic Speech Recognition
type: automatic-speech-recognition
dataset:
name: Common Voice 17.0
type: mozilla-foundation/common_voice_17_0
config: bn
split: None
args: 'config: bn, split: test'
metrics:
- name: Wer
type: wer
value: 22.56397
Whisper Base Bn - by BanglaBridge
This model is a fine-tuned version of openai/whisper-base on the Common Voice 17.0 dataset.
It is the merged model from this fine-tuned PEFT LoRA adapter: banglabridge/base-bn-lora-adapter
It achieves the following results on the test set:
- Wer: 44.93734
- Normalized Wer: 22.56397
Refer to the adapter repository for more details on the finetuning: banglabridge/base-bn-lora-adapter
Framework versions
- Transformers 4.40.2
- Pytorch 2.6.0+cu124
- Tokenizers 0.19.1
- Peft 0.10.0