solyarisoftware / CoquiSTTJsLinks
Coqui STT offline engine API for NodeJs developers. With a simple HTTP ASR server.
☆30Updated 4 years ago
Alternatives and similar repositories for CoquiSTTJs
Users that are interested in CoquiSTTJs are comparing it to the libraries listed below
Sorting:
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.☆137Updated 2 years ago
- Faster Tortoise inference then Tortoise Fast Fork☆127Updated last year
- 🌼 Daisy-TTS: Simulating Wider Spectrum of Emotions via Prosody Embedding Decomposition☆15Updated last year
- Docker image and scripts for training finetuned or completely personal Kaldi speech models. Particularly for use with kaldi-active-gramma…☆21Updated 3 years ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆68Updated 3 weeks ago
- zero-shot realtime TTS system, fully offline, free and open source☆48Updated 6 months ago
- Uses ctypes and libespeak-ng to transform test into IPA phonemes☆24Updated 2 years ago
- SoTA open-source TTS☆107Updated 5 months ago
- Google's SoundStorm: Efficient Parallel Audio Generation☆132Updated 2 years ago
- Speaker Diarization with Transformers☆69Updated 5 months ago
- Automatically cleaning, enhancing, segmenting, filtering, and formatting a dataset to fine tune or train a voice model.☆44Updated 2 months ago
- ☆17Updated 4 years ago
- 🐸STT integration examples☆129Updated 3 years ago
- (WIP) A retrain of F5-TTS on permissively-licensed data☆13Updated 7 months ago
- A free & open tool for transcribing audio interviews with offline ASR support☆25Updated last year
- KATube is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. From a l…☆24Updated last year
- C++ version of pyannote audio overlapped speech detection pipeline☆13Updated last year
- ☆68Updated last month
- High quality text-to-speech based on StyleTTS 2.☆69Updated last week
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆102Updated last year
- The EveryVoice TTS Toolkit - Text To Speech for your language☆41Updated this week
- ☆29Updated last year
- ☆43Updated last year
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆152Updated last year
- Community framework for training tortoise☆44Updated 3 years ago
- ArtSpeech: Adaptive Text-to-Speech Synthesis with Articulatory Representations☆20Updated last month
- Launch your speech synthesis within one minute.☆12Updated last year
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆126Updated 3 months ago
- ☆50Updated this week
- A high-quality, varied ~30hr voice dataset suitable for training a TTS model☆63Updated 2 years ago