DongKeon / webrtc-whisper-asrLinks
WebRTC-based real-time audio streaming with Faster Whisper ASR integration for live speech-to-text transcription.
☆12Updated 9 months ago
Alternatives and similar repositories for webrtc-whisper-asr
Users that are interested in webrtc-whisper-asr are comparing it to the libraries listed below
Sorting:
- A lightweight Python library for running TTS models with a unified API.☆20Updated 4 months ago
- (WIP) A retrain of F5-TTS on permissively-licensed data☆11Updated 3 months ago
- a simple system for 2-way interruptible voice interactions between human and LLM☆30Updated last year
- Multivoice: Enhance your foreign-language movie and TV show experience with personalized dubbed versions. Our project uses voice cloning …☆26Updated last year
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆63Updated last month
- ☆25Updated 2 weeks ago
- Open TTS models, built for streaming on the edge☆43Updated 4 months ago
- Mission to create a Hebrew TTS model as powerful and user-friendly as WaveNet☆34Updated 6 months ago
- An open source chat bot architecture for voice/vision (and multimodal) assistants, local(CPU/GPU bound) and remote(I/O bound) to run.☆59Updated this week
- ☆16Updated 4 months ago
- Whisper Speaker Identification (WSI), a cutting-edge model for multilingual speaker identification.☆20Updated 4 months ago
- Speaker diarization service☆23Updated 3 weeks ago
- Babylon.cpp is a C and C++ library for grapheme to phoneme conversion and text to speech synthesis. For phonemization a ONNX runtime port…☆22Updated this week
- Transcription and annotation interface for recorded audio or video files☆35Updated this week
- Code associated with the paper: CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition.☆15Updated 2 months ago
- An open source NLP as a service project focused on providing state of the art systems with ease. Training and inference by simple docker …☆20Updated 10 months ago
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…☆29Updated 2 years ago
- Code release for "TinySpeech: Attention Condensers for Deep Speech Recognition Neural Networks on Edge Devices"☆19Updated last month
- A lightweight, efficient variation of the StyleTTS 2 text‐to‐speech model.☆35Updated last month
- A curated list of awesome voice activity detection☆59Updated 7 months ago
- Accelerate Whisper tasks such as transcription, by multiprocesing through parallelization☆25Updated 2 years ago
- Stable timestamps and confidence score for words of OpenAI's Whisper outputs down to word-level.☆25Updated 2 years ago
- A composition of offline tools to achieve high quality multilingual speech to text transcription☆19Updated last month
- This repository includes training, inference, evaluation, and utility scripts developed for fine-tuning the Whisper medium.en model on Ai…☆15Updated 9 months ago
- [Early Alpha] A unified framework for text-to-speech, voice conversion, automatic speech recognition, audio classification, voice activit…☆21Updated 6 months ago
- Normalize Text in Russian☆27Updated last year
- ☆51Updated 2 weeks ago
- ⚡ Blazing fast audio augmentation in Python, powered by GPU for high-efficiency processing in machine learning and audio analysis tasks.☆33Updated last year
- A high-throughput and memory-efficient inference and serving engine for Whisper, https://mesolitica.com/blog/vllm-whisper☆28Updated 11 months ago
- IPA Phonemizer/Dephonemizer for 139 human languages☆30Updated this week