Automatic Speech Recognition
Transformers
PyTorch
Safetensors
Chinese
Yue Chinese
whisper
whisper-event
Generated from Trainer
Eval Results (legacy)
Instructions to use alvanlii/whisper-small-cantonese with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use alvanlii/whisper-small-cantonese with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("automatic-speech-recognition", model="alvanlii/whisper-small-cantonese")# Load model directly from transformers import AutoProcessor, AutoModelForSpeechSeq2Seq processor = AutoProcessor.from_pretrained("alvanlii/whisper-small-cantonese") model = AutoModelForSpeechSeq2Seq.from_pretrained("alvanlii/whisper-small-cantonese") - Notebooks
- Google Colab
- Kaggle
about Download link for dataset
#5
by lq0104 - opened
Hello, thank you for your work. Can you provide the download link for the following training dataset, I want to do some finetune based on other models:
Cantonese-ASR 72
CantoMap 23
Pseudo-Labelled YouTube Data 438
Thank you very much!
https://github.com/HLTCHKUST/cantonese-asr
https://github.com/gwinterstein/CantoMap
the last one I am still in the progress of cleaning it. I will release it once it is ready
Thank you, excited to see how it goes! :))
alvanlii changed discussion status to closed