daanelson / cog-whisperxLinks
☆10Updated 2 years ago
Alternatives and similar repositories for cog-whisperx
Users that are interested in cog-whisperx are comparing it to the libraries listed below
Sorting:
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…☆22Updated last year
- ☆69Updated 6 months ago
- ☆12Updated last year
- ☆14Updated 11 months ago
- Cog wrapper for collabora/WhisperSpeech☆24Updated last year
- ☆28Updated last year
- ☆55Updated last year
- [WIP] AI Try-On plugin for Chrome☆27Updated last year
- ASR + diarization model server with speculative decoding☆63Updated last year
- All the world is a play, we are but actors in it.☆50Updated 3 months ago
- ☆47Updated last year
- Gradio UI for a Cog API☆69Updated last year
- Auto-Video maker handling many AI's☆10Updated last year
- Voice agent using LiveKit (orchestration), Cartesia (TTS), OpenAI (LLM), and Deepgram (STT)☆17Updated this week
- Voxtral: Convert Mistral into a end2end SpeechLM. No information bottleneck, preserves prosody, learns interruptions from data. Unlike GP…☆36Updated 7 months ago
- ☆19Updated last year
- ☆20Updated last year
- ☆174Updated last year
- Cog wrapper for Coqui / xtts-v2☆78Updated 11 months ago
- Gradio app to track objects in video and add visual effects☆17Updated 3 months ago
- Play.ht's Text to Speech API☆92Updated 2 months ago
- ☆83Updated last year
- Incredibly descriptive audiovisual summaries for videos☆40Updated last year
- ☆13Updated last year
- [SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild☆61Updated last year
- Site for sharing MusicGen + AudioGen Prompts and Creations☆47Updated 7 months ago
- Add real-time Speech-to-Text to your LiveKit application with AssemblyAI☆17Updated 4 months ago
- Generate visual podcasts about novels using open source models☆25Updated 2 years ago
- This project breathes life into video characters by using AI to describe their personality and then chat with you as them.☆47Updated last year
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆97Updated last year