fabio-sim / Fast-SeamlessM4T-ONNX
ONNX-compatible Fast SeamlessM4T—Massively Multilingual & Multimodal Machine Translation
☆42Updated last year
Alternatives and similar repositories for Fast-SeamlessM4T-ONNX:
Users that are interested in Fast-SeamlessM4T-ONNX are comparing it to the libraries listed below
- SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime☆83Updated 4 months ago
- A lightweight pure C++ Text-to-Speech (TTS) pipeline with OpenVINO, supporting multiple languages.☆39Updated this week
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆92Updated 9 months ago
- A high-throughput and memory-efficient inference and serving engine for Whisper, https://mesolitica.com/blog/vllm-whisper☆24Updated 6 months ago
- Experiments to test different speech recognition systems for SEPIA Framework☆58Updated last year
- ONNX Inference of Pyannote Segmentation☆80Updated last month
- Running the F5-TTS by ONNX Runtime☆104Updated last week
- Port of Funasr's Paraformer model in C/C++☆28Updated 8 months ago
- Chinese and English Bilinguish G2P☆20Updated last year
- C++ version of pyannote audio speaker diarizaiton pipeline☆20Updated last year
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆93Updated 4 months ago
- Repository for the paper: VoiceMe: Personalized voice generation in TTS☆126Updated 2 years ago
- A simple Python package to easily use Meta's Massively Multilingual Speech (MMS) project☆52Updated last year
- A lightweight end-to-end text-to-speech model☆102Updated last month
- The paddle implementation of meta's LLaMA.☆45Updated last year
- SadTalker gradio_demo.py file with code section that allows you to set the eye blink and pose reference videos for the software to use wh…☆11Updated last year
- flow mirror models from JZX AI Labs☆42Updated 4 months ago
- This repo is an exploratory experiment to enable frozen pretrained RWKV language models to accept speech modality input. We followed the …☆40Updated last month
- SpeechDenoiser: Real-Time Speech Denoising with ONNX Welcome to SpeechDenoiser, a simple and effective solution for real-time speech den…☆58Updated 6 months ago
- Self hosted high quality voice recognition for de-googled Android using whisper. Like Siri or OK Google.☆60Updated last year
- Putting flows on top of neural transducers for better TTS☆62Updated this week
- faster-whisper livestream translation, OBS noise reduction, dual language subtitles☆77Updated last year
- Barkify: an unoffical training implementation of Bark TTS by suno-ai☆129Updated last year
- Real-Time Whisper Voice Recognition with vosk model feedback.☆109Updated last year
- paraformer(chinense asr) online onnx runtime for python☆40Updated 10 months ago
- A model that predicts the punctuation of English, Italian, French and German texts.☆78Updated last year
- A enterprise-grade Voice Activity Detector from modelscope and funasr.☆76Updated last year
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io☆67Updated last year
- ChatTTS is a generative speech model for daily dialogue.☆14Updated 4 months ago
- SyntaSpeech: Syntax-aware Generative Adversarial Text-to-Speech; IJCAI 2022; Official code☆199Updated 2 years ago