echogarden-project / echogardenLinks
Cross-platform speech toolset, used from the command-line or as a Node.js library. Includes a variety of engines for speech synthesis, speech recognition, forced alignment, speech translation, voice isolation, language detection and more.
β370Updated 3 weeks ago
Alternatives and similar repositories for echogarden
Users that are interested in echogarden are comparing it to the libraries listed below
Sorting:
- Open source inference code for Rev's modelβ404Updated 2 months ago
- Omni SenseVoice: High-Speed Speech Recognition with words timestamps π£οΈπ―β851Updated 3 months ago
- ez audio transcription tool with flexible processing and post-processing optionsβ152Updated last year
- Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated textsβ329Updated 7 months ago
- Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detectionβ741Updated 2 weeks ago
- Text to speech alignment using CTC forced alignmentβ300Updated 2 months ago
- web based editor for subtitles and transcriptsβ135Updated 10 months ago
- Synchronize SRT timestamps over an existing accurate transcriptionβ32Updated 7 months ago
- Voice Transformation for Videos. π€ππ¬β238Updated this week
- Synchronize Whisper's timestamps over an existing accurate transcriptionβ152Updated last year
- A Fast TTS Engineβ514Updated 4 months ago
- Local SRT/LLM/TTS Voicechatβ692Updated 8 months ago
- speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts withβ¦β219Updated 2 months ago
- A speech recognition library running in the browser thanks to a WebAssembly build of Voskβ454Updated last year
- Interface for OuteTTS models.β1,304Updated 3 weeks ago
- Real-Time Whisper Voice Recognition with vosk model feedback.β112Updated last year
- The subtitles and translations are generated in real-time and displayed as pop-ups.β166Updated 2 years ago
- Voice activity detector (VAD) for the browser with a simple APIβ1,391Updated last month
- Open dubbing is an AI dubbing system which uses machine learning models to automatically translate and synchronize audio dialogue into diβ¦β244Updated last week
- G2Pβ258Updated last month
- Dockerfile for WhisperX: Automatic Speech Recognition with Word-Level Timestamps and Speaker Diarization (Dockerfile, CI image build and β¦β309Updated 2 weeks ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelinesβ95Updated last year
- A CLI text-to-speech tool using the Kokoro model, supporting multiple languages, voices (with blending), and various input formats includβ¦β501Updated 3 weeks ago
- Accelerating faster-whisper single file processing by multiprocessing through parallelizationβ54Updated 2 years ago
- A python package to build AI-powered real-time audio applicationsβ1,336Updated 4 months ago
- Minimal extension of OpenAI's Whisper adding speaker diarization with special tokensβ497Updated last year
- An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engineβ429Updated 9 months ago
- Listen to any audio stream on your machine and print out the transcribed or translated audio.β120Updated last year
- Transcription, forced alignment, and audio indexing with OpenAI's Whisperβ1,914Updated last month
- β1,134Updated 4 months ago