CypherousSkies / reading-for-listenersLinks
A deep-learning powered accessibility application which turns pdfs into audio files. Featuring ocr improvement and tts with inflection!
☆23Updated 4 months ago
Alternatives and similar repositories for reading-for-listeners
Users that are interested in reading-for-listeners are comparing it to the libraries listed below
Sorting:
- TTS Client for Coqui TTS server☆13Updated 2 years ago
- My guide to create an italian TTS with Coqui☆14Updated 3 years ago
- Interface for using TTS and vocoder models in the form of a text editor☆20Updated 2 years ago
- KATube is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. From a l…☆23Updated 10 months ago
- 🫠 check your data, before you wreck your model☆16Updated 2 years ago
- Using YouTube to prepare a speech recognition dataset for any language☆10Updated 4 years ago
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zoo☆26Updated 2 years ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Updated last year
- Evaluation of STT models for german language☆15Updated 3 years ago
- pytorch implementation for MultiSpeech: Multi-Speaker Text to Speech with Transformer paper☆21Updated 3 years ago
- ☆17Updated 2 years ago
- A repo listing known open source voice tools, ordered by where they sit in the voice stack☆26Updated 2 years ago
- scipts for working with open.bible data☆24Updated 3 years ago
- This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.☆24Updated last year
- ☆17Updated 2 years ago
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…☆28Updated 2 years ago
- Voice100 includes neural TTS/ASR models. Inference of Voice100 is low cost as its models are tiny and only depend on CNN without autoregr…☆28Updated last year
- ☆11Updated 3 years ago
- Finally, some decent sample sentences☆23Updated last year
- Coqui Inference Engine☆40Updated 3 years ago
- Onnx compatible styletts2 code☆12Updated 2 weeks ago
- ☆56Updated 2 years ago
- Artie Bias Corpus: an audio corpus + code for detecting demographic bias☆21Updated 4 years ago
- ☆17Updated 4 years ago
- Official PyTorch implementation of TTS Style Transfer☆23Updated 3 years ago
- S3PRL for Speech Emotion Recognition (see s3prl > downstream)☆15Updated 4 months ago
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment☆16Updated 3 years ago
- phone inventory library☆16Updated 2 years ago
- ☆32Updated 3 years ago
- ☆22Updated 3 years ago