echogarden-project / echogardenLinks
Cross-platform speech toolset, used from the command-line or as a Node.js library. Includes a variety of engines for speech synthesis, speech recognition, forced alignment, speech translation, voice isolation, language detection and more.
β404Updated last month
Alternatives and similar repositories for echogarden
Users that are interested in echogarden are comparing it to the libraries listed below
Sorting:
- Omni SenseVoice: High-Speed Speech Recognition with words timestamps π£οΈπ―β867Updated 6 months ago
- Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detectionβ827Updated 4 months ago
- Synchronize Whisper's timestamps over an existing accurate transcriptionβ157Updated last year
- ez audio transcription tool with flexible processing and post-processing optionsβ159Updated last year
- Voice Transformation for Videos. π€ππ¬β243Updated 3 months ago
- The subtitles and translations are generated in real-time and displayed as pop-ups.β174Updated 2 years ago
- Open dubbing is an AI dubbing system which uses machine learning models to automatically translate and synchronize audio dialogue into diβ¦β298Updated 2 months ago
- Open source inference code for Rev's modelβ429Updated 5 months ago
- Free on-device web app for audio transcribing and rendering subtitlesβ270Updated 2 months ago
- Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated textsβ343Updated 10 months ago
- A speech recognition library running in the browser thanks to a WebAssembly build of Voskβ491Updated last year
- web based editor for subtitles and transcriptsβ141Updated last year
- Voice activity detector (VAD) for the browser with a simple APIβ1,621Updated last week
- Cog implementation of transcribing + diarization pipeline with Whisper & Pyannoteβ223Updated 7 months ago
- Transcription, forced alignment, and audio indexing with OpenAI's Whisperβ2,019Updated this week
- Synchronize SRT timestamps over an existing accurate transcriptionβ35Updated 10 months ago
- An API to transcribe audio with OpenAI's Whisper Large v3!β305Updated 10 months ago
- A Fast TTS Engineβ549Updated 8 months ago
- The fastest Whisper optimization for automatic speech recognition as a command-line interface β‘οΈβ378Updated last year
- speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts withβ¦β237Updated last month
- Node.js bindings for OpenAI's Whisper. (C++ CPU version by ggerganov)β291Updated last year
- A program to dub non-english media with modern AI speech synthesis, diarization, and voice cloning!β313Updated 10 months ago
- Minimal extension of OpenAI's Whisper adding speaker diarization with special tokensβ518Updated last year
- React / Vanilla JS Text to Speech with highlighting the words and sentences that are being spoken using audio files, text to speech API, β¦β171Updated 2 weeks ago
- Whisper command line client compatible with original OpenAI client based on CTranslate2.β1,113Updated last month
- A CLI text-to-speech tool using the Kokoro model, supporting multiple languages, voices (with blending), and various input formats includβ¦β790Updated 3 weeks ago
- Dockerfile for WhisperX: Automatic Speech Recognition with Word-Level Timestamps and Speaker Diarization (Dockerfile, CI image build and β¦β363Updated last month
- ππ§ A tool for creating ebooks with synchronized text and audio (EPUB3 with Media Overlays)β318Updated last year
- Whisper with Medusa headsβ859Updated last month
- π₯π₯ Kokoro in Rust. https://huggingface.co/hexgrad/Kokoro-82M Insanely fast, realtime TTS with high quality you ever have.β602Updated last month