AI4Bharat / SvarahLinks
Swarah: Indian-English speech dataset collected across the country
☆34Updated 2 weeks ago
Alternatives and similar repositories for Svarah
Users that are interested in Svarah are comparing it to the libraries listed below
Sorting:
- NPTEL2020: Speech2Text dataset for Indian-English Accent☆77Updated 3 years ago
- asr2k☆51Updated last year
- ☆17Updated 4 years ago
- A python package for whisper normalizer☆63Updated last month
- ☆46Updated 2 years ago
- Code for AccentDB.☆22Updated 4 years ago
- Prosodic Speech Segmentation with Transformers☆25Updated last year
- ☆12Updated 5 months ago
- Implementation of the paper "BERTphone: Phonetically-aware Encoder Representations for Utterance-level Speaker and Language Recognition"☆17Updated 4 years ago
- A handy dataset of noises for ASR☆21Updated 6 years ago
- pytorch implementation for MultiSpeech: Multi-Speaker Text to Speech with Transformer paper☆21Updated 3 years ago
- Grapheme to phoneme model for PyTorch☆41Updated 2 years ago
- PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supp…☆48Updated last year
- ☆17Updated 2 years ago
- A new metric for evaluating end-to-end speech recognition and disfluency removal systems☆19Updated 4 years ago
- ☆37Updated 2 months ago
- DUSTED: Spoken-Term Discovery using Discrete Speech Units☆17Updated 9 months ago
- Balanced Error Rate for Speaker Diarization☆32Updated 2 years ago
- Source code of paper <End-to-End Language Diarization for Bilingual Code-switching Speech>☆19Updated 3 years ago
- Rescoring methods for end-to-end Automatic Speech Recognition☆27Updated 4 years ago
- Speaker change detection using SincNet and an LSTM/Transformer☆52Updated last month
- ☆56Updated 2 years ago
- A mini, simple, and fast end-to-end automatic speech recognition toolkit.☆54Updated 2 years ago
- Clustering-based methods for overlapping diarization☆81Updated last year
- ☆11Updated 3 years ago
- A list of podcast URLs scraped from the Apple podcast database in late 2021, including a script for downloading those podcasts.☆41Updated 3 years ago
- Dataset release for Emotional TTS in Indian Accent☆39Updated 2 years ago
- A toolkit to calculate speech audio quality. Not affiliated with the original authors☆52Updated 11 months ago
- Dataset Release for Intent Classification from Speech☆47Updated 4 months ago
- Pronunciation-assisted Subword Modeling☆29Updated 6 years ago