Malasar ASR Resources
Collection
7 items • Updated • 1
How to use vrclc/Malasar_small_MTF with Transformers:
# Use a pipeline as a high-level helper
from transformers import pipeline
pipe = pipeline("automatic-speech-recognition", model="vrclc/Malasar_small_MTF") # Load model directly
from transformers import AutoProcessor, AutoModelForSpeechSeq2Seq
processor = AutoProcessor.from_pretrained("vrclc/Malasar_small_MTF")
model = AutoModelForSpeechSeq2Seq.from_pretrained("vrclc/Malasar_small_MTF")This model is a fine-tuned version of vasista22/whisper-tamil-small on the Spoken Bible Corpus: Malasar dataset. It achieves the following results on the evaluation set:
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
| Training Loss | Epoch | Step | Validation Loss | Wer |
|---|---|---|---|---|
| 0.0924 | 11.3636 | 250 | 0.2929 | 52.1416 |
| 0.0013 | 22.7273 | 500 | 0.4152 | 52.8270 |
| 0.0008 | 34.0909 | 750 | 0.4400 | 52.8841 |
| 0.0002 | 45.4545 | 1000 | 0.4761 | 52.1416 |
| 0.0001 | 56.8182 | 1250 | 0.4888 | 51.6276 |
| 0.0001 | 68.1818 | 1500 | 0.4970 | 51.9703 |
| 0.0001 | 79.5455 | 1750 | 0.5021 | 51.7990 |
| 0.0001 | 90.9091 | 2000 | 0.5039 | 51.9132 |
@misc{multistage2024,
title={Multistage Fine-tuning Strategies for Automatic Speech Recognition in Low-resource Languages},
author={Leena G Pillai, Kavya Manohar, Basil K Raju, Elizabeth Sherly},
year={2024},
eprint={2411.04573},
archivePrefix={arXiv},
primaryClass={cs.CL},
url={https://arxiv.org/abs/2411.04573},
}
Base model
vasista22/whisper-tamil-small