carlfm01 / librivox-toolsLinks
Collector and speech cutter for librivox audiobooks
☆22Updated 2 years ago
Alternatives and similar repositories for librivox-tools
Users that are interested in librivox-tools are comparing it to the libraries listed below
Sorting:
- TTS Client for Coqui TTS server☆13Updated 2 years ago
- Audio Book scrapper☆26Updated last year
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zoo☆26Updated 2 years ago
- End-to-end spoken language identification out of the box.☆48Updated 4 years ago
- Interface for using TTS and vocoder models in the form of a text editor☆20Updated 2 years ago
- A free & open tool for transcribing audio interviews with offline ASR support☆24Updated last year
- ☆17Updated 2 years ago
- ☆80Updated last year
- An even smaller speech recognizer / force aligner☆33Updated 6 months ago
- Streamlit app to visualize and edit TTS datasets☆14Updated 3 years ago
- Multi-Language Dataset Cleaner/Creator for Mozilla's DeepSpeech Framework☆47Updated 2 years ago
- The EveryVoice TTS Toolkit - Text To Speech for your language☆36Updated this week
- Large scale (>200h) and publicly available read audio book corpus. This corpus is an augmentation of LibriSpeech ASR Corpus (1000h) and c…☆44Updated 2 years ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆115Updated 2 years ago
- Labeled data for homograph disambiguation☆59Updated 2 years ago
- A collection of utilities for handling IPA phones.☆25Updated last year
- Uses ctypes and libespeak-ng to transform test into IPA phonemes☆20Updated last year
- Heteronym to Phoneme Parser☆18Updated last year
- Voice100 includes neural TTS/ASR models. Inference of Voice100 is low cost as its models are tiny and only depend on CNN without autoregr…☆28Updated last year
- 24-hour Automatic Speech Recognition☆27Updated 4 years ago
- A high-quality, varied ~30hr voice dataset suitable for training a TTS model☆60Updated 2 years ago
- automatically align transcribed audio and generate a wav2letter training corpus☆36Updated 2 years ago
- ☆20Updated 6 years ago
- Running Mozilla's implementation of Baidu DeepSpeech on Google Colaboratory☆16Updated 6 years ago
- A bidirectional recurrent neural network model with attention mechanism for restoring missing punctuation in unsegmented text☆36Updated 4 years ago
- Add n-gram and large language model (LLM) support to Whisper models.☆26Updated last month
- Scripts for computing common lyrics-to-audio alignment evaluation metrics. Usable evaluation for any token-based alignment (e.g. if tok…☆18Updated 4 years ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.☆102Updated 2 years ago
- Coqui Inference Engine☆40Updated 3 years ago
- Convert Arpabet to IPA. Arpabet is the set of phonemes used by the CMU Pronouncing Dictionary. IPA is the International Phonetic Alphabet…☆44Updated 4 years ago