echogarden-project / echogarden
Cross-platform speech toolset, used from the command-line or as a Node.js library. Includes a variety of engines for speech synthesis, speech recognition, forced alignment, speech translation, voice isolation, language detection and more.
β359Updated this week
Alternatives and similar repositories for echogarden:
Users that are interested in echogarden are comparing it to the libraries listed below
- Voice Transformation for Videos. π€ππ¬β237Updated 6 months ago
- ez audio transcription tool with flexible processing and post-processing optionsβ149Updated last year
- web based editor for subtitles and transcriptsβ130Updated 8 months ago
- Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detectionβ690Updated 4 months ago
- Synchronize Whisper's timestamps over an existing accurate transcriptionβ149Updated 11 months ago
- Text to speech alignment using CTC forced alignmentβ279Updated last month
- Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated textsβ325Updated 5 months ago
- Synchronize SRT timestamps over an existing accurate transcriptionβ29Updated 5 months ago
- speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts withβ¦β213Updated 3 weeks ago
- Open source inference code for Rev's modelβ402Updated 2 weeks ago
- A GUI tool for offline transcription of speech recordings, including speaker diarization, utilizing state-of-the-art machine learning modβ¦β506Updated this week
- A Fast TTS Engineβ494Updated 3 months ago
- A program to dub non-english media with modern AI speech synthesis, diarization, and voice cloning!β307Updated 5 months ago
- Omni SenseVoice: High-Speed Speech Recognition with words timestamps π£οΈπ―β837Updated 2 months ago
- Open dubbing is an AI dubbing system which uses machine learning models to automatically translate and synchronize audio dialogue into diβ¦β212Updated 2 months ago
- Dockerfile for WhisperX: Automatic Speech Recognition with Word-Level Timestamps and Speaker Diarization (Dockerfile, CI image build and β¦β273Updated this week
- Transcription, forced alignment, and audio indexing with OpenAI's Whisperβ1,858Updated last month
- This tool uses AI to evaluate your pronunciation.β276Updated last week
- Whisper command line client compatible with original OpenAI client based on CTranslate2.β1,007Updated 2 months ago
- Multilingual Automatic Speech Recognition with word-level timestamps and confidenceβ2,390Updated last month
- Cog implementation of transcribing + diarization pipeline with Whisper & Pyannoteβ203Updated 2 months ago
- Real-Time Whisper Voice Recognition with vosk model feedback.β112Updated last year
- π A forced aligner intended for synchronization of narrated textβ92Updated 2 years ago
- An API to transcribe audio with OpenAI's Whisper Large v3!β271Updated 5 months ago
- A GUI interface for Open AI Whisper based on Tauri and Sveltekitβ127Updated 5 months ago
- The subtitles and translations are generated in real-time and displayed as pop-ups.β156Updated last year
- π π€ Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloningβ159Updated 9 months ago
- Smart Whisper is a native Node.js addon designed for efficient and streamlined interaction with the whisper.cpp, with automatic model offβ¦β53Updated last week
- Node.js bindings for OpenAI's Whisper. (C++ CPU version by ggerganov)β273Updated 9 months ago
- streaming speech to text server using Whisperβ91Updated last year