ReadAlongs / SoundSwallower
An even smaller speech recognizer / force aligner
☆32Updated last month
Alternatives and similar repositories for SoundSwallower:
Users that are interested in SoundSwallower are comparing it to the libraries listed below
- The EveryVoice TTS Toolkit - Text To Speech for your language☆24Updated this week
- A free & open tool for transcribing audio interviews with offline ASR support☆24Updated last year
- Create modular, cross-browser, web audio pipelines to record and process audio in background threads. Comes with modules for VAD, ASR, re…☆47Updated last year
- Audiobook alignment for Indigenous languages☆38Updated this week
- Evaluation of STT models for german language☆15Updated 3 years ago
- Coqui Inference Engine☆38Updated 3 years ago
- Grapheme-to-Phoneme transductions that preserve input and output indices, and support cross-lingual g2p!☆146Updated last week
- Experiments to test different speech recognition systems for SEPIA Framework☆58Updated last year
- TTS Client for Coqui TTS server☆13Updated 2 years ago
- On-device speaker diarization powered by deep learning☆34Updated 2 weeks ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆107Updated last year
- C++ version of pyannote audio speaker diarizaiton pipeline☆19Updated 11 months ago
- Workflow for forced alignment between languages☆17Updated 11 months ago
- Simple Diarization model☆46Updated last year
- OCTRA is a web-application for the orthographic transcription of audio files.☆37Updated this week
- ☆17Updated last year
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zoo☆25Updated last year
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆90Updated 3 months ago
- This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speech…☆17Updated last year
- An High-resolution implementation of HiFi-GAN Vocoder for Voice Conversion.☆30Updated last year
- On-device voice activity detection (VAD) powered by deep learning☆192Updated 2 weeks ago
- ☆9Updated 3 months ago
- Multilingual Grapheme to Phoneme☆49Updated 8 years ago
- SEPIA server to support open-source speech recognition via WebSocket connection.☆123Updated 2 months ago
- Prosodic Speech Segmentation with Transformers☆25Updated 11 months ago
- 🐸TTS recipes for different datasets☆85Updated 2 years ago
- 🫠 check your data, before you wreck your model☆16Updated 2 years ago
- Labeled data for homograph disambiguation☆54Updated last year
- Speaker change detection using SincNet and an LSTM/Transformer☆46Updated 7 months ago
- zero-shot realtime TTS system, fully offline, free and open source☆24Updated 2 weeks ago