AI4Bharat / indic-asr-api-backendView external linksLinks
Indic-Conformer models for ASR
☆20Jul 19, 2024Updated last year
Alternatives and similar repositories for indic-asr-api-backend
Users that are interested in indic-asr-api-backend are comparing it to the libraries listed below
Sorting:
- KABooks is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. Using a…☆12Mar 24, 2023Updated 2 years ago
- 🎯 Speech Recognition Challenge by Speech Lab - IIT Madras☆11Nov 5, 2020Updated 5 years ago
- ☆14Aug 16, 2023Updated 2 years ago
- A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcripts☆16Dec 3, 2024Updated last year
- SANE-TTS: Stable And Natural End-to-End Multilingual Text-to-Speech☆11Jun 30, 2023Updated 2 years ago
- Aty-TTS: Improving fairness for spoken language understanding in atypical speech with Text-to-Speech☆11May 14, 2025Updated 9 months ago
- Repo of the paper "Towards Building an End-to-End Multilingual Automatic Lyrics Transcription Model""☆14Jun 28, 2024Updated last year
- WebRTC-based real-time audio streaming with Faster Whisper ASR integration for live speech-to-text transcription.☆13Sep 27, 2024Updated last year
- ☆14Aug 19, 2024Updated last year
- Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks☆17Aug 18, 2023Updated 2 years ago
- Goodness of Pronunciation algorithm using PyKaldi☆18Jun 12, 2022Updated 3 years ago
- A composition of offline tools to achieve high quality multilingual speech to text transcription☆23Feb 2, 2026Updated last week
- Python implementation of a few speech intelligibility prediction algorithms☆15May 29, 2024Updated last year
- ☆20Mar 4, 2024Updated last year
- Audio tokenization, in the fastest way possible!☆53Aug 26, 2024Updated last year
- RVC Onnx Infer- Upgraded and simplified-ish☆25May 9, 2024Updated last year
- Multivoice: Enhance your foreign-language movie and TV show experience with personalized dubbed versions. Our project uses voice cloning …☆27Aug 1, 2023Updated 2 years ago
- Official implementation of the paper "SPEAKER VGG CCT: Cross-corpus Speech Emotion Recognition with Speaker Embedding and Vision Transfor…☆24Feb 17, 2023Updated 2 years ago
- **ICASSP 2022** 《Toward Degradation-Robust Voice Conversion》Using speech enhancement and end-to-end denoising training to improve degrada…☆24Sep 27, 2022Updated 3 years ago
- This repository contains text-to-speech (TTS) models and utilities designed produce synthetic training datasets for other speech-related …☆28Mar 12, 2023Updated 2 years ago
- A unified model for zero-shot singing voice conversion and synthesis☆22Nov 30, 2022Updated 3 years ago
- This is a winter of code project aimed at speech enhancement of text to speech models.☆24Feb 6, 2022Updated 4 years ago
- SylNet: An Adaptable End-to-End Syllable Count Estimator for Speech☆27May 25, 2023Updated 2 years ago
- Non Parallel Voice Conversion based on VITS☆24Mar 31, 2023Updated 2 years ago
- Baseline kaldi script for UA-SPEECH corpus☆32Oct 16, 2024Updated last year
- ☆25Jun 14, 2022Updated 3 years ago
- Vistaar: Diverse Benchmarks and Training Sets for Indian Language ASR☆73Jun 8, 2025Updated 8 months ago
- Compendium for the paper "Transparent pronunciation scoring using articulatorily weighted phoneme edit distance" by Karhila, Smolander, Y…☆25May 6, 2019Updated 6 years ago
- Vecna is a Python chatbot which recommends songs and movies depending upon your feelings☆11Jun 28, 2022Updated 3 years ago
- PyTorch toolkit for streaming speech recognition, speech translation and simultaneous translation based on fairseq.☆25Oct 3, 2022Updated 3 years ago
- This will hold the data pipeline to convert raw audio data to speech which will act as input dataset for speech-to-text pipeline☆32Feb 15, 2023Updated 2 years ago
- Mission to create a Hebrew TTS model as powerful and user-friendly as WaveNet☆38Jan 5, 2025Updated last year
- Syllable Segmentation and Cross-Lingual Generalization in a Visually Grounded, Self-Supervised Speech Model☆34Aug 27, 2023Updated 2 years ago
- Zero-Shot Foreign Accent Conversion without a Native Reference☆36May 1, 2024Updated last year
- monkeyplug is a little script to mute profanity in audio files☆37Jan 27, 2026Updated 2 weeks ago
- Ready-to-use Multilingual Text-To-Speech (TTS) package.☆24Aug 13, 2023Updated 2 years ago
- An evaluation toolkit for voice conversion models.☆42Jul 11, 2021Updated 4 years ago
- Text to speech is an emerging zone of AI. This repository helps to create a dataset with audio and transcripts for personalized text to s…☆28Mar 14, 2023Updated 2 years ago
- ☆28Dec 14, 2021Updated 4 years ago