fabio-sim / Fast-SeamlessM4T-ONNX
ONNX-compatible Fast SeamlessM4T—Massively Multilingual & Multimodal Machine Translation
☆43Updated last year
Alternatives and similar repositories for Fast-SeamlessM4T-ONNX:
Users that are interested in Fast-SeamlessM4T-ONNX are comparing it to the libraries listed below
- flow mirror models from JZX AI Labs☆44Updated 6 months ago
- C++ version of pyannote audio speaker diarizaiton pipeline☆20Updated last year
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆94Updated 11 months ago
- Cantonese Text to Speech with VITS implementation☆29Updated 2 years ago
- Real-time processing and delivery of sentences from a continuous stream of characters or text chunks.☆46Updated 2 weeks ago
- A lightweight end-to-end text-to-speech model☆112Updated last month
- openvino version of openai/whisper☆166Updated last year
- A high-throughput and memory-efficient inference and serving engine for Whisper, https://mesolitica.com/blog/vllm-whisper☆25Updated 8 months ago
- Port of Funasr's Paraformer model in C/C++☆31Updated 9 months ago
- AI 3D avatar voice interface in browser. VAD -> STT -> LLM -> TTS -> VRM (Prototype/Proof-of-Concept)☆66Updated last year
- SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime☆88Updated 6 months ago
- SadTalker gradio_demo.py file with code section that allows you to set the eye blink and pose reference videos for the software to use wh…☆11Updated last year
- Whisper realtime streaming for long speech-to-text transcription and translation☆113Updated last year
- Real-Time Whisper Voice Recognition with vosk model feedback.☆111Updated last year
- A simple Python package to easily use Meta's Massively Multilingual Speech (MMS) project☆52Updated last year
- A performant high-throughput CPU-based API for Meta's No Language Left Behind (NLLB) using CTranslate2, hosted on Hugging Face Spaces.☆107Updated this week
- Experiments to test different speech recognition systems for SEPIA Framework☆60Updated last year
- Repository for the paper: VoiceMe: Personalized voice generation in TTS☆126Updated 2 years ago
- libvits-ncnn is an ncnn implementation of the VITS library that enables cross-platform GPU-accelerated speech synthesis.🎙️💻☆60Updated last year
- Chinese and English Bilinguish G2P☆20Updated last year
- Running the F5-TTS by ONNX Runtime☆142Updated last week
- A enterprise-grade Voice Activity Detector from modelscope and funasr.☆91Updated last year
- Use VITS and Opencpop to develop singing voice synthesis; Different from VISinger.☆35Updated 2 years ago
- Text to speech is an emerging zone of AI. This repository helps to create a dataset with audio and transcripts for personalized text to s…☆28Updated 2 years ago
- ONNX Inference of Pyannote Segmentation☆84Updated 3 months ago
- Robust Speech Recognition via Large-Scale Weak Supervision☆30Updated last year
- SpeechDenoiser: Real-Time Speech Denoising with ONNX Welcome to SpeechDenoiser, a simple and effective solution for real-time speech den…☆72Updated 7 months ago
- Accelerate Whisper tasks such as transcription, by multiprocesing through parallelization☆25Updated 2 years ago
- This is a repository that collects common audio noise reduction models, using Gradio to demonstrate the use of each model, which is very …☆34Updated 4 months ago
- SpeechAgents: Human-Communication Simulation with Multi-Modal Multi-Agent Systems☆81Updated last year