onnxruntime / Whisper-HybridLoop-Onnx-Demo
☆14Updated 8 months ago
Related projects ⓘ
Alternatives and complementary repositories for Whisper-HybridLoop-Onnx-Demo
- ONNX Inference of Pyannote Segmentation☆66Updated 2 months ago
- TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, Korean, Chinese, German and Ea…☆13Updated 3 years ago
- SyntaSpeech: Syntax-aware Generative Adversarial Text-to-Speech; IJCAI 2022; Official code☆197Updated 2 years ago
- Collection of Open Source Speech Data☆146Updated 2 weeks ago
- C++ library for converting text to phonemes for Piper☆89Updated 8 months ago
- proof of concept conversation orchestrator with a speech-language model☆14Updated last month
- Lyra V2 (SoundStream) running in the browser☆18Updated last year
- Onnx wrapper for espnet infrernce model☆156Updated last month
- ONNX implementation of Whisper. PyTorch free.☆85Updated this week
- openvino version of openai/whisper☆12Updated last month
- a cpp ggml port of "VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech." for use in mobile…☆30Updated 2 months ago
- Code for our INTERSPEECH paper Simul-Whisper: Attention-Guided Streaming Whisper with Truncation Detection☆45Updated last week
- Export an ONNX graph that performs ISTFT. Designed for TTS models.☆19Updated 6 months ago
- Fine-Tune Whisper with Transformers and PEFT☆38Updated last year
- Apply https://github.com/k2-fsa/sherpa-ncnn in live streaming and WebRTC☆21Updated last year
- ☆254Updated 5 months ago
- C++ version of pyannote audio speaker diarizaiton pipeline☆18Updated 9 months ago
- Kaldi-compatible online fbank extractor without external dependencies☆80Updated 3 weeks ago
- On-device voice activity detection (VAD) powered by deep learning☆179Updated last week
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆84Updated 6 months ago
- VoiceBox neural network implementation☆96Updated 3 months ago
- Real-Time Whisper Voice Recognition with vosk model feedback.☆105Updated last year
- NPTEL2020: Speech2Text dataset for Indian-English Accent☆72Updated 2 years ago
- Colab notebooks for Next-gen Kaldi☆26Updated 3 weeks ago
- Babylon.cpp is a C and C++ library for grapheme to phoneme conversion and text to speech synthesis. For phonemization a ONNX runtime port…☆12Updated 2 months ago
- Repository for the paper: VoiceMe: Personalized voice generation in TTS☆125Updated 2 years ago
- Coqui Inference Engine☆38Updated 3 years ago
- streaming speech to text server using Whisper☆83Updated last year
- A library for adding punctuation into a text from ASR.☆17Updated last year
- Experiments to test different speech recognition systems for SEPIA Framework☆57Updated last year