gumblex / whisper_vad
Whisper.cpp Speech-to-text with Voice Acticity Detection
☆12Updated 2 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for whisper_vad
- ez audio transcription tool with flexible processing and post-processing options☆130Updated 9 months ago
- Port of Funasr's Paraformer model in C/C++☆25Updated 5 months ago
- Inference TinyLlama models on ncnn☆25Updated last year
- Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.☆9Updated 4 months ago
- Experiments to test different speech recognition systems for SEPIA Framework☆57Updated last year
- faster-whisper livestream translation, OBS noise reduction, dual language subtitles☆75Updated last year
- ASR using OpenAI capability API `v1/audio/transcriptions` like Groq, SiliconFlow☆22Updated 2 months ago
- ONNX-compatible Fast SeamlessM4T—Massively Multilingual & Multimodal Machine Translation☆40Updated last year
- Open dubbing is an AI dubbing system which uses machine learning models to automatically translate and synchronize audio dialogue into di…☆64Updated this week
- The inference code of RVC-Boss/GPT-SoVITS that can be developer-friendly.☆11Updated last month
- Run baby Llama 2 model in windows☆13Updated last year
- Port of Funasr's Sense-voice model in C/C++☆163Updated this week
- Running the F5-TTS by ONNX Runtime☆39Updated this week
- EdgeInfer enables efficient edge intelligence by running small AI models, including embeddings and OnnxModels, on resource-constrained de…☆38Updated 7 months ago
- Semantic Search demo featuring UForm, USearch, UCall, and StreamLit, to visual and retrieve from image datasets, similar to "CLIP Retriev…☆40Updated 10 months ago
- Download full or partial git-lfs repos without temporarily using 2x disk space☆30Updated last year
- Web browser version of StarCoder.cpp☆43Updated last year
- ☆12Updated 3 years ago
- Speech Diarization for scrum automation☆97Updated last year
- LiveKit SDK for Embedded☆21Updated 3 weeks ago
- Streaming ASR and TTS based on FastAPI+ sherpa-onnx☆39Updated last month
- Self hosted high quality voice recognition for de-googled Android using whisper. Like Siri or OK Google.☆53Updated 10 months ago
- OVALChat is a customizable Web app aimed at conducting user studies with chatbots☆27Updated 10 months ago
- ☆10Updated last year
- Uses the excellent silero VAD with onnxruntime C api for fast detection of audio segments with speech☆10Updated 2 months ago
- WebUI for tcconfig and tc on Linux server.☆14Updated 6 months ago
- Python Audio Separator in Real Time using MDX-NET model☆12Updated last year
- libvits-ncnn is an ncnn implementation of the VITS library that enables cross-platform GPU-accelerated speech synthesis.🎙️💻☆56Updated last year
- Efficient inference of large language models.☆144Updated this week
- A toolkit enhances PyTorch with specialized functions for low-bit quantized neural networks.☆28Updated 4 months ago