ReadAlongs / SoundSwallowerLinks
An even smaller speech recognizer / force aligner
☆34Updated 6 months ago
Alternatives and similar repositories for SoundSwallower
Users that are interested in SoundSwallower are comparing it to the libraries listed below
Sorting:
- SEPIA server to support open-source speech recognition via WebSocket connection.☆128Updated 8 months ago
- Coqui Inference Engine☆40Updated 3 years ago
- The EveryVoice TTS Toolkit - Text To Speech for your language☆37Updated this week
- IPA Phonemizer/Dephonemizer for 139 human languages☆30Updated this week
- Create modular, cross-browser, web audio pipelines to record and process audio in background threads. Comes with modules for VAD, ASR, re…☆47Updated 2 years ago
- Launch your speech synthesis within one minute.☆12Updated last year
- On-device noise suppression powered by deep learning☆73Updated 3 weeks ago
- A free & open tool for transcribing audio interviews with offline ASR support☆24Updated last year
- Labeled data for homograph disambiguation☆59Updated 2 years ago
- On-device voice activity detection (VAD) powered by deep learning☆219Updated this week
- Simple diarization model☆50Updated last month
- Use VITS and Opencpop to develop singing voice synthesis; Different from VISinger.☆35Updated 2 years ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆116Updated 2 years ago
- An High-resolution implementation of HiFi-GAN Vocoder for Voice Conversion.☆31Updated 2 years ago
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆100Updated 9 months ago
- Putting flows on top of neural transducers for better TTS☆62Updated 3 weeks ago
- This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speech…☆17Updated 2 years ago
- 🎹 pyannote + 🗒 notebook = pyannotebook☆26Updated 2 years ago
- ☆22Updated 4 years ago
- Timething is a library for aligning text transcripts with their audio recordings.☆122Updated 7 months ago
- Convert Arpabet to IPA. Arpabet is the set of phonemes used by the CMU Pronouncing Dictionary. IPA is the International Phonetic Alphabet…☆44Updated 4 years ago
- Evaluation of STT models for german language☆15Updated 3 years ago
- Web app for keyword spotting using TensorflowJS☆72Updated 2 years ago
- A curated list of awesome voice activity detection☆59Updated 7 months ago
- Audiobook alignment for Indigenous languages☆40Updated this week
- StyleTTS2 + Vocos as a Decoder☆13Updated 3 months ago
- OpenAI Whisper Prompt Examples☆52Updated last year
- A complete speech segmentation system using Kaldi and x-vectors for voice activity detection (VAD) and speaker diarisation.☆29Updated last year
- (WIP) A retrain of F5-TTS on permissively-licensed data☆11Updated 3 months ago
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…☆29Updated 2 years ago