solyarisoftware / CoquiSTTJsLinks
Coqui STT offline engine API for NodeJs developers. With a simple HTTP ASR server.
β30Updated 4 years ago
Alternatives and similar repositories for CoquiSTTJs
Users that are interested in CoquiSTTJs are comparing it to the libraries listed below
Sorting:
- πΈSTT integration examplesβ129Updated 2 years ago
- SEPIA server to support open-source speech recognition via WebSocket connection.β130Updated 10 months ago
- π Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset types. π§π₯π Advanced audio processing.β251Updated last year
- Vosk ASR offline engine API for NodeJs developers. With a simple HTTP ASR server.β51Updated 4 years ago
- Web app for keyword spotting using TensorflowJSβ73Updated 2 years ago
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.β137Updated 2 years ago
- Create an LJSpeech structured voice dataset on wave inputβ34Updated 11 months ago
- Faster Tortoise inference then Tortoise Fast Forkβ128Updated last year
- A TTS model capable of generating ultra-realistic dialogue in one pass.β120Updated last month
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of codeβ150Updated last year
- Real-time processing and delivery of sentences from a continuous stream of characters or text chunks.β68Updated 2 months ago
- πΌ Daisy-TTS: Simulating Wider Spectrum of Emotions via Prosody Embedding Decompositionβ15Updated last year
- πΈTTS recipes for different datasetsβ86Updated 3 years ago
- An automatic speech recognition APIβ69Updated 3 weeks ago
- β41Updated last year
- An even smaller speech recognizer / force alignerβ35Updated 8 months ago
- (WIP) A retrain of F5-TTS on permissively-licensed dataβ12Updated 5 months ago
- On-device voice activity detection (VAD) powered by deep learningβ228Updated last month
- Community framework for training tortoiseβ44Updated 2 years ago
- Google's SoundStorm: Efficient Parallel Audio Generationβ132Updated 2 years ago
- A tokenizer, text cleaner, and phonemizer for many human languages.β325Updated 10 months ago
- π€ Nix-TTS: Lightweight and End-to-end Text-to-Speech via Module-wise Distillationβ256Updated 2 years ago
- Automatically cleaning, enhancing, segmenting, filtering, and formatting a dataset to fine tune or train a voice model.β42Updated this week
- Real-Time Whisper Voice Recognition with vosk model feedback.β119Updated 2 years ago
- Simple GUI application to help record audio dictated from given text prompts, for use with training speech recognition or speech synthesiβ¦β41Updated 4 years ago
- π π€ Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloningβ159Updated last year
- A curated list of awesome OpenAI's Whisperβ101Updated last year
- Create modular, cross-browser, web audio pipelines to record and process audio in background threads. Comes with modules for VAD, ASR, reβ¦β47Updated 2 years ago
- SoTA open-source TTSβ86Updated 3 months ago
- [WIP] VoiceSmith makes training text to speech models easy.β225Updated 2 years ago