AI4Bharat / RasaLinks
Expressive TTS Dataset for Assamese, Bengali, and Tamil.
☆11Updated 7 months ago
Alternatives and similar repositories for Rasa
Users that are interested in Rasa are comparing it to the libraries listed below
Sorting:
- Pretraining, fine-tuning and evaluation scripts for Indic-Wav2Vec2☆98Updated 2 months ago
- Vistaar: Diverse Benchmarks and Training Sets for Indian Language ASR☆63Updated 4 months ago
- A unified dataset of multilingual emotional human utterances☆28Updated 3 years ago
- Finetune VITS and MMS using HuggingFace's tools☆170Updated last year
- This is the M-AILABS Speech Dataset☆87Updated 11 months ago
- NPTEL2020: Speech2Text dataset for Indian-English Accent☆77Updated 3 years ago
- Speaker anonymization pipeline for hiding the identity of the speaker of a recording by changing the voice in it.☆88Updated 3 months ago
- Finetune Wa2vec 2.0 For Speech Recognition☆140Updated 8 months ago
- Indic TTS for Indian Languages: This is a project on developing text-to-speech (TTS) synthesis systems for Indian languages, improving qu…☆14Updated last year
- Versatile Evaluation of Speech and Audio☆351Updated last week
- ☆378Updated last year
- Various speech datasets made available to the public☆131Updated 10 months ago
- This repository contains the training, inference, evaluation code for SpeechLLM models and details about the model releases on huggingfac…☆125Updated last year
- An evolving, large-scale and multi-domain ASR corpus for low-resource languages with automated crawling, transcription and refinement☆176Updated 2 months ago
- ☆21Updated last year
- ZMM-TTS: Zero-shot Multilingual and Multispeaker Speech Synthesis Conditioned on Self-supervised Discrete Speech Representations☆176Updated last year
- ☆30Updated 3 years ago
- Phoneme-Level BERT for Enhanced Prosody of Text-to-Speech with Grapheme Predictions☆262Updated 9 months ago
- UT-Sarulab MOS prediction system using SSL models☆276Updated last year
- The official pytorch implemention of the Intespeech 2024 paper "Reshape Dimensions Network for Speaker Recognition"☆179Updated last month
- Reference-aware automatic speech evaluation toolkit☆164Updated 10 months ago
- Contains links to publicly available datasets for modeling health outcomes using speech and language.☆125Updated last year
- Fine-tune and evaluate Whisper models for Automatic Speech Recognition (ASR) on custom datasets or datasets from huggingface.☆346Updated 2 years ago
- A collection of dataset consists of a total of 8 English speech datasets for SER☆28Updated 9 months ago
- UTokyo-SaruLab MOS Prediction System☆255Updated 3 weeks ago
- Machine learning speaker characteristics☆41Updated 2 weeks ago
- Repository for Accent Recognition (Hackathon @SLT2022)☆36Updated last year
- Code for our INTERSPEECH paper Simul-Whisper: Attention-Guided Streaming Whisper with Truncation Detection☆94Updated 7 months ago
- Code for the paper: GAMA: A Large Audio-Language Model with Advanced Audio Understanding and Complex Reasoning Abilities☆144Updated 10 months ago
- CHiME-9 Task 1 - MCoRec baseline☆22Updated 4 months ago