gauthamsuresh09 / wav2vec2-large-xlsr-53-malayalamLinks
Wav2vec2 Large XLSR 53 fine-tuned for Malayalam
β11Updated 4 years ago
Alternatives and similar repositories for wav2vec2-large-xlsr-53-malayalam
Users that are interested in wav2vec2-large-xlsr-53-malayalam are comparing it to the libraries listed below
Sorting:
- π― Speech Recognition Challenge by Speech Lab - IIT Madrasβ11Updated 5 years ago
- Text to Speech for Indic languagesβ52Updated 3 years ago
- β45Updated 3 years ago
- Indic-Conformer models for ASRβ20Updated last year
- Prabhupadavani: A Code-mixed Speech Translation Data for 25 languagesβ13Updated 3 years ago
- Transcribe your videos and translate it into Indic languages.β31Updated last month
- This repository contains the implementation of the paper: "Span Classification with Structured Information for Disfluency Detection in Spβ¦β15Updated 2 years ago
- Vistaar: Diverse Benchmarks and Training Sets for Indian Language ASRβ70Updated 7 months ago
- An implementation of the paper titled "Arabic Speech Emotion Recognition Employing Wav2vec2.0 and HuBERT Based on BAVED Dataset" https://β¦β14Updated 3 years ago
- Pretraining, fine-tuning and evaluation scripts for Indic-Wav2Vec2β101Updated 4 months ago
- NPTEL2020: Speech2Text dataset for Indian-English Accentβ80Updated 4 years ago
- A pipeline to isolate and transcribe one language in mixed-language speechβ19Updated 3 years ago
- KABooks is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. Using aβ¦β12Updated 2 years ago
- The project is related to the development of labs for the ITMO Speaker Recognition Course.β15Updated last month
- Zero-Shot Foreign Accent Conversion without a Native Referenceβ36Updated last year
- Syllable Segmentation and Cross-Lingual Generalization in a Visually Grounded, Self-Supervised Speech Modelβ34Updated 2 years ago
- Deep Learning model for lexical stress detection in spoken Englishβ29Updated 5 years ago
- Convert Arabic diacritised text to a sequence of phonemes and create a pronunciation dictionary from them for alignment using HTKβ63Updated 8 years ago
- Workflow for forced alignment between languagesβ23Updated last year
- Towards Building Text-To-Speech Systems for the Next Billion Users - Microsoft Research Intern Work - Accepted at ICASSP 2023β55Updated 2 years ago
- Hosts text-to-speech corpus and speech synthesizers for African languages.β17Updated 2 years ago
- A complete end-to-end Deep Learning system to generate high quality human like speech in English for Korean Drama (WIP)β13Updated 3 years ago
- Official PyTorch implementation of TTS Style Transferβ25Updated 3 years ago
- β14Updated 2 years ago
- This repo contains the code for "Voice Disorder Analysis: A Transformer-based Approach", accepted at Interspeech 2024β14Updated last year
- Mispronunciation Detection using a pretrained and finetuned wav2vec2 model for phoneme recognition and diagnosis and feedback using largeβ¦β43Updated last year
- Generated Audio Samples by ALGAN-VC model are available in the folderβ19Updated 3 years ago
- Repository containing experimentation platform on how to train, infer on wav2vec2 models.β87Updated 3 years ago
- The offical code of "Parameter-Efficient Learning for Text-to-Speech Accent Adaptation"β13Updated 2 years ago
- Automatic parallel speech database extractor from dubbed moviesβ26Updated last year