gauthamsuresh09 / wav2vec2-large-xlsr-53-malayalamLinks
Wav2vec2 Large XLSR 53 fine-tuned for Malayalam
β11Updated 3 years ago
Alternatives and similar repositories for wav2vec2-large-xlsr-53-malayalam
Users that are interested in wav2vec2-large-xlsr-53-malayalam are comparing it to the libraries listed below
Sorting:
- π― Speech Recognition Challenge by Speech Lab - IIT Madrasβ11Updated 4 years ago
- β43Updated 2 years ago
- Text to Speech for Indic languagesβ51Updated 3 years ago
- A pipeline to isolate and transcribe one language in mixed-language speechβ19Updated 2 years ago
- The project is related to the development of labs for the ITMO Speaker Recognition Course.β10Updated 3 months ago
- Code for paper "Hearing Lips in Noise: Universal Viseme-Phoneme Mapping and Transfer for Robust Audio-Visual Speech Recognition"β26Updated 2 years ago
- An implementation of the paper titled "Arabic Speech Emotion Recognition Employing Wav2vec2.0 and HuBERT Based on BAVED Dataset" https://β¦β12Updated 3 years ago
- Generated Audio Samples by ALGAN-VC model are available in the folderβ19Updated 3 years ago
- NPTEL2020: Speech2Text dataset for Indian-English Accentβ78Updated 3 years ago
- This repo contains the code for "Voice Disorder Analysis: A Transformer-based Approach", accepted at Interspeech 2024β12Updated last year
- Prabhupadavani: A Code-mixed Speech Translation Data for 25 languagesβ13Updated 2 years ago
- Deep Learning model for lexical stress detection in spoken Englishβ29Updated 5 years ago
- A simple voice conversion toolβ18Updated 3 years ago
- β28Updated 4 years ago
- Workflow for forced alignment between languagesβ20Updated last year
- Mispronunciation Detection using a pretrained and finetuned wav2vec2 model for phoneme recognition and diagnosis and feedback using largeβ¦β29Updated last year
- The official implementation of the method discussed in the paper Improving Spoken Language Identification with Map-Mix(work accepted at Iβ¦β17Updated 2 years ago
- A set of tools for working with accent data in Mozilla's Common Voice datasetβ13Updated last year
- Zero-Shot Foreign Accent Conversion without a Native Referenceβ34Updated last year
- [Interspeech22]Improving Mispronunciation Detection with Wav2vec2-based Momentum Pseudo-Labeling for Accentedness and Intelligibility Assβ¦β30Updated last year
- The offical code of "Parameter-Efficient Learning for Text-to-Speech Accent Adaptation"β13Updated 2 years ago
- β20Updated last year
- A complete end-to-end Deep Learning system to generate high quality human like speech in English for Korean Drama (WIP)β13Updated 2 years ago
- This repository is the implementation of the paper, "Score-balanced Loss for Multi-aspect Pronunciation Assessment" (Interspeech 2023).β20Updated last year
- Hosts text-to-speech corpus and speech synthesizers for African languages.β17Updated 2 years ago
- Code for AccentDB.β22Updated 4 years ago
- Tool to analyze an audio corpora in terms of intonation, intensity, duration and voice qualityβ22Updated 6 years ago
- Pretraining, fine-tuning and evaluation scripts for Indic-Wav2Vec2β89Updated last year
- Official PyTorch implementation of TTS Style Transferβ24Updated 3 years ago
- Syllable Segmentation and Cross-Lingual Generalization in a Visually Grounded, Self-Supervised Speech Modelβ32Updated 2 years ago