Instructions to use speech31/XLS-R-tamil-phoneme with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use speech31/XLS-R-tamil-phoneme with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("automatic-speech-recognition", model="speech31/XLS-R-tamil-phoneme")# Load model directly from transformers import AutoProcessor, AutoModelForCTC processor = AutoProcessor.from_pretrained("speech31/XLS-R-tamil-phoneme") model = AutoModelForCTC.from_pretrained("speech31/XLS-R-tamil-phoneme") - Notebooks
- Google Colab
- Kaggle
| datasets: | |
| - mozilla-foundation/common_voice_16_1 | |
| language: | |
| - ta | |
| metrics: | |
| - wer | |
| pipeline_tag: automatic-speech-recognition | |
| This model is fine-tuned on the Tamil dataset from Common Voice 16.1, preprocessed using Epitran for transliterating text into IPA. The 'tam-Taml' code was employed to generate a precise phoneme list, crucial for capturing the nuances of Tamil phonetics: | |
| * Vowels: | |
| * Monophthongs:'a', 'aː', 'e', 'eː', 'i', 'iː', 'o', 'oː', 'u', 'uː' | |
| * Diphthongs: 'aj', 'aʋ' | |
| * Consonants: | |
| * Nasals: 'm', 'n̪', 'n', 'ɳ', 'ɲ', 'ŋ' | |
| * Stops: 'p', 't̪', 'ʈ', 'k', | |
| * Affricates: 't͡ʃ', 'd͡ʒ' | |
| * Fricatives: 's', 'ʂ', 'ʃ', 'h' | |
| * Tap: 'ɾ' | |
| * Trill: 'r' | |
| * Approximants: 'ʋ','ɻ', 'j', 'l', 'ɭ' | |
| * Consonant cluster: 'kʂ' | |
| * Special Symbols: '்' (denotes the absence of inherent vowel) |