coqui-ai / snakepitLinks
π Coqui's machine learning job scheduler
β32Updated 3 years ago
Alternatives and similar repositories for snakepit
Users that are interested in snakepit are comparing it to the libraries listed below
Sorting:
- πΈTTS recipes for different datasetsβ86Updated 2 years ago
- SEPIA server to support open-source speech recognition via WebSocket connection.β128Updated 8 months ago
- Web app for keyword spotting using TensorflowJSβ72Updated 2 years ago
- β76Updated 3 years ago
- SC-GlowTTS: an Efficient Zero-Shot Multi-Speaker Text-To-Speech Modelβ107Updated 3 years ago
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zooβ26Updated 2 years ago
- Simple text to phonemes converter for multiple languagesβ20Updated 2 years ago
- automatically align transcribed audio and generate a wav2letter training corpusβ36Updated 2 years ago
- Coqui Inference Engineβ40Updated 3 years ago
- πΉ pyannote + π notebook = pyannotebookβ26Updated 2 years ago
- A repo listing known open source voice tools, ordered by where they sit in the voice stackβ26Updated 2 years ago
- Running Mozilla's implementation of Baidu DeepSpeech on Google Colaboratoryβ16Updated 6 years ago
- Server & client for DeepSpeech using WebSockets for real-time speech recognition in separate environmentsβ102Updated 5 years ago
- JavaScript deployment for Howl, the wake word detection modeling toolkit for Firefox Voiceβ10Updated 4 years ago
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.β137Updated last year
- DeepSpeech based forced alignment toolβ238Updated 4 years ago
- Jupyter Notebooks for creating Speech datasetsβ46Updated 6 years ago
- Stable timestamps and confidence score for words of OpenAI's Whisper outputs down to word-level.β25Updated 2 years ago
- Text to Speech for Indic languagesβ51Updated 3 years ago
- Advanced data structures for handling temporal segments with attached labels.β114Updated 5 months ago
- Evaluate results from ASR/Speech-to-Text quicklyβ37Updated 3 years ago
- Manage audio and video datasetsβ31Updated 2 weeks ago
- An even smaller speech recognizer / force alignerβ34Updated 6 months ago
- πΈSTT integration examplesβ129Updated 2 years ago
- β43Updated last year
- Reproducible experimental protocols for multimedia (audio, video, text) databaseβ104Updated 5 months ago
- Forced Alignments for Common Voiceβ31Updated 4 years ago
- β56Updated 2 years ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.β102Updated 2 years ago
- π€ Nix-TTS: Lightweight and End-to-end Text-to-Speech via Module-wise Distillationβ254Updated last year