gauthamsuresh09 / wav2vec2-large-xlsr-53-malayalamLinks
Wav2vec2 Large XLSR 53 fine-tuned for Malayalam
β11Updated 4 years ago
Alternatives and similar repositories for wav2vec2-large-xlsr-53-malayalam
Users that are interested in wav2vec2-large-xlsr-53-malayalam are comparing it to the libraries listed below
Sorting:
- π― Speech Recognition Challenge by Speech Lab - IIT Madrasβ11Updated 4 years ago
- β43Updated 2 years ago
- Text to Speech for Indic languagesβ51Updated 3 years ago
- A pipeline to isolate and transcribe one language in mixed-language speechβ19Updated 2 years ago
- Transcribe your videos and translate it into Indic languages.β31Updated last week
- NPTEL2020: Speech2Text dataset for Indian-English Accentβ77Updated 3 years ago
- Zero-Shot Foreign Accent Conversion without a Native Referenceβ34Updated last year
- An implementation of the paper titled "Arabic Speech Emotion Recognition Employing Wav2vec2.0 and HuBERT Based on BAVED Dataset" https://β¦β12Updated 3 years ago
- Mispronunciation Detection using a pretrained and finetuned wav2vec2 model for phoneme recognition and diagnosis and feedback using largeβ¦β30Updated last year
- Indic-Conformer models for ASRβ18Updated last year
- Pretraining, fine-tuning and evaluation scripts for Indic-Wav2Vec2β92Updated last month
- The project is related to the development of labs for the ITMO Speaker Recognition Course.β10Updated 4 months ago
- Hosts text-to-speech corpus and speech synthesizers for African languages.β17Updated 2 years ago
- The official implementation of the method discussed in the paper Improving Spoken Language Identification with Map-Mix(work accepted at Iβ¦β18Updated 2 years ago
- Vistaar: Diverse Benchmarks and Training Sets for Indian Language ASRβ62Updated 3 months ago
- [Interspeech22]Improving Mispronunciation Detection with Wav2vec2-based Momentum Pseudo-Labeling for Accentedness and Intelligibility Assβ¦β31Updated last year
- Workflow for forced alignment between languagesβ20Updated last year
- A complete end-to-end Deep Learning system to generate high quality human like speech in English for Korean Drama (WIP)β13Updated 3 years ago
- KABooks is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. Using aβ¦β12Updated 2 years ago
- scipts for working with open.bible dataβ25Updated 3 years ago
- This repository contains the implementation of the paper: "Span Classification with Structured Information for Disfluency Detection in Spβ¦β13Updated 2 years ago
- Word-level language identification for Bangla-English code-mixed social media data, using a BiLSTM with subword embeddings.β10Updated 2 years ago
- A python package for whisper normalizerβ66Updated 2 weeks ago
- Code for paper "Hearing Lips in Noise: Universal Viseme-Phoneme Mapping and Transfer for Robust Audio-Visual Speech Recognition"β26Updated 2 years ago
- Generated Audio Samples by ALGAN-VC model are available in the folderβ19Updated 3 years ago
- The offical code of "Parameter-Efficient Learning for Text-to-Speech Accent Adaptation"β13Updated 2 years ago
- This repo contains the code for "Voice Disorder Analysis: A Transformer-based Approach", accepted at Interspeech 2024β12Updated last year
- Repository containing experimentation platform on how to train, infer on wav2vec2 models.β87Updated 3 years ago
- This project is about performing Speaker diarization for Hindi Language.β50Updated 4 years ago
- Towards Building Text-To-Speech Systems for the Next Billion Users - Microsoft Research Intern Work - Accepted at ICASSP 2023β54Updated 2 years ago