AI4Bharat / RasaLinks
Expressive TTS Dataset for Assamese, Bengali, and Tamil.
☆11Updated 6 months ago
Alternatives and similar repositories for Rasa
Users that are interested in Rasa are comparing it to the libraries listed below
Sorting:
- Finetune VITS and MMS using HuggingFace's tools☆162Updated last year
- Vistaar: Diverse Benchmarks and Training Sets for Indian Language ASR☆61Updated 2 months ago
- a curated list of speech datasets (110+ datasets, 75+ easy to download)☆152Updated 2 years ago
- Update ASR paper everyday☆303Updated this week
- Pretraining, fine-tuning and evaluation scripts for Indic-Wav2Vec2☆89Updated last week
- ☆377Updated last year
- Speaker anonymization pipeline for hiding the identity of the speaker of a recording by changing the voice in it.☆81Updated 2 months ago
- Finetune Wa2vec 2.0 For Speech Recognition☆138Updated 7 months ago
- Versatile Evaluation of Speech and Audio☆319Updated 2 weeks ago
- Code, Dataset, and Pretrained Models for Audio and Speech Large Language Model "Listen, Think, and Understand".☆452Updated last year
- Variational Bayes HMM over x-vectors diarization☆275Updated last year
- Fine-tune and evaluate Whisper models for Automatic Speech Recognition (ASR) on custom datasets or datasets from huggingface.☆332Updated 2 years ago
- Confidence interval computation for evaluation in machine learning using the bootstrapping approach☆87Updated last year
- Text to speech alignment using CTC forced alignment☆347Updated 3 weeks ago
- A collection of dataset consists of a total of 8 English speech datasets for SER☆28Updated 7 months ago
- The official repository of Dynamic-SUPERB.☆189Updated 2 months ago
- Multilingual G2P in 100 languages☆353Updated 2 years ago
- This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at…☆429Updated 3 weeks ago
- Contains links to publicly available datasets for modeling health outcomes using speech and language.☆124Updated last year
- UT-Sarulab MOS prediction system using SSL models☆258Updated last year
- Phoneme-Level BERT for Enhanced Prosody of Text-to-Speech with Grapheme Predictions☆260Updated 7 months ago
- A unified dataset of multilingual emotional human utterances☆28Updated 3 years ago
- This repository contains the training, inference, evaluation code for SpeechLLM models and details about the model releases on huggingfac…☆121Updated last year
- Official repository of DailyTalk: Spoken Dialogue Dataset for Conversational Text-to-Speech, ICASSP 2023☆229Updated 3 months ago
- A list of speech recognition learning resources including courses, books, tutorials, papers and toolkits.☆60Updated 2 weeks ago
- UTokyo-SaruLab MOS Prediction System☆232Updated last month
- Some comprehensive papers about speaker diarization☆302Updated 3 months ago
- Synthetic Dialog Generation and Analysis with LLMs☆33Updated last week
- A speaker embedding network in Pytorch that is very quick to set up and use for whatever purposes.☆91Updated 5 months ago
- ☆30Updated 3 years ago