riteshhere / Speaker_diarization
Speech Diarization for scrum automation
☆94Updated last year
Related projects: ⓘ
- Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detection☆163Updated last week
- A lightweight end-to-end text-to-speech model☆79Updated this week
- Live-Transcription (STT) with Whisper PoC☆140Updated 3 months ago
- We Speech Transcript based on LLM, in 300 lines of code.☆117Updated last month
- Have a natural voice conversation with an LLM☆189Updated this week
- zero-shot voice conversion with in context learning☆135Updated this week
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆81Updated 4 months ago
- ☆166Updated 9 months ago
- OpenAI API and Whisper based Video Translation☆66Updated 5 months ago
- 🎥➡️📝 Hermes: Blazing-fast video transcription powered by AI gods! Transcribe 6.5 minutes of video in just 1 second using Groq's LPU. Ch…☆48Updated 2 weeks ago
- Nendo is an open source platform for AI-driven audio management, intelligence, and generation.☆116Updated 5 months ago
- ⚡️ 80x faster language detection with Fasttext | Split text by language for TTS☆104Updated this week
- The full experience of chatting with your favourite news website.☆108Updated 9 months ago
- 🎧 Pod-Helper: Real-time audio transcription and repair on consumer hardware☆78Updated 6 months ago
- Whisper realtime streaming for long speech-to-text transcription and translation☆97Updated 7 months ago
- Python tools for WhisperKit: Model conversion, optimization and evaluation☆151Updated 5 months ago
- ☆166Updated last month
- Voice Transformation for Videos. 🎤👄🎬☆202Updated 4 months ago
- ☆49Updated last month
- Fuse ChatTTS with OpenVoice, upload a 10-second audio clip, and clone your personalized ChatTTS voice.☆298Updated 2 months ago
- MacOS Agent: A Simplified Assistant for Your Mac☆47Updated last month
- Fine tune an LLM using Replicate☆51Updated last year
- Efficient approach to speaker diarization using voice characteristics extraction☆56Updated 4 months ago
- Llama3.1 learns to Listen☆134Updated this week
- Real-time Voice Activity Detection (VAD) with some example use case like simple voice bot and live transcription (realtime transcription)☆43Updated 3 months ago
- ☆28Updated 2 months ago
- ez audio transcription tool with flexible processing and post-processing options☆122Updated 7 months ago
- Tell a story and get a live feed of images.☆130Updated 5 months ago
- experiments with different llms☆30Updated 2 weeks ago
- Filter X content using LLM API requests, configurable, based on Groq API☆125Updated last month