coqui-ai / snakepit
π Coqui's machine learning job scheduler
β32Updated 3 years ago
Alternatives and similar repositories for snakepit:
Users that are interested in snakepit are comparing it to the libraries listed below
- πΈTTS recipes for different datasetsβ85Updated 2 years ago
- Coqui Inference Engineβ38Updated 3 years ago
- β74Updated 3 years ago
- automatically align transcribed audio and generate a wav2letter training corpusβ36Updated last year
- Forced Alignments for Common Voiceβ31Updated 4 years ago
- Simple text to phonemes converter for multiple languagesβ20Updated 2 years ago
- A Hackable speech recognition library.β25Updated 4 months ago
- β43Updated 8 months ago
- TTS Client for Coqui TTS serverβ13Updated 2 years ago
- β56Updated 2 years ago
- Repository for fine-tuning Transformers π€ based seq2seq speech models in JAX/Flax.β35Updated 2 years ago
- SC-GlowTTS: an Efficient Zero-Shot Multi-Speaker Text-To-Speech Modelβ107Updated 3 years ago
- JavaScript deployment for Howl, the wake word detection modeling toolkit for Firefox Voiceβ10Updated 4 years ago
- Kaldi style neural network training in pytorch for use in place of nnet3 in Kaldi.β26Updated 7 months ago
- πΉ pyannote + π notebook = pyannotebookβ26Updated last year
- This repository contains data used in the NAACL 2021 Paper - Proteno: Text Normalization with Limited Data for Fast Deployment in Text toβ¦β44Updated 3 years ago
- Tunable pipelinesβ31Updated 2 weeks ago
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zooβ25Updated last year
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.β101Updated last year
- β80Updated 9 months ago
- Large scale (>200h) and publicly available read audio book corpus. This corpus is an augmentation of LibriSpeech ASR Corpus (1000h) and cβ¦β43Updated 2 years ago
- Code for Speaker Change Detection in Broadcast TV using Bidirectional Long Short-Term Memory Networksβ64Updated 4 years ago
- A "Crowd-Built" continuously growing speech dataset with transcripts. The dataset contains multiple languages and is intended for anyone β¦β41Updated 2 years ago
- An even smaller speech recognizer / force alignerβ32Updated 2 months ago
- Simple Kaldi model server for chain (nnet3) models in online recognition mode directly from a local microphoneβ35Updated 3 years ago
- Using YouTube to prepare a speech recognition dataset for any languageβ10Updated 3 years ago
- Code for AccentDB.β20Updated 3 years ago
- Web app for keyword spotting using TensorflowJSβ70Updated 2 years ago
- Use your data to create a speech recognition system in Kaldi. Fast.β65Updated 5 years ago
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GPβ¦β93Updated 5 months ago