fabio-sim / Fast-SeamlessM4T-ONNX
ONNX-compatible Fast SeamlessM4T—Massively Multilingual & Multimodal Machine Translation
☆41Updated last year
Related projects: ⓘ
- SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime☆55Updated 2 weeks ago
- A lightweight end-to-end text-to-speech model☆79Updated this week
- We Speech Transcript based on LLM, in 300 lines of code.☆117Updated last month
- A performant high-throughput CPU-based API for Meta's No Language Left Behind (NLLB) using CTranslate2, hosted on Hugging Face Spaces.☆83Updated this week
- Whisper realtime streaming for long speech-to-text transcription and translation☆97Updated 7 months ago
- Full version of wav2lip-onnx including face alignment and face enhancement and more...☆41Updated 5 months ago
- flow mirror models from JZX AI Labs☆33Updated this week
- VALL-E 2 reproduction☆72Updated 2 months ago
- ☆244Updated 6 months ago
- ☆97Updated this week
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆81Updated 4 months ago
- ☆31Updated last month
- zero-shot voice conversion with in context learning☆135Updated this week
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆119Updated 2 months ago
- openvino version of openai/whisper☆157Updated 10 months ago
- ChatTTS is a generative speech model for daily dialogue.☆11Updated 2 weeks ago
- ONNX implementation of Whisper. PyTorch free.☆79Updated last month
- lina-speech : linear attention based text-to-speech☆111Updated 3 months ago
- Efficient approach to speaker diarization using voice characteristics extraction☆56Updated 4 months ago
- ☆166Updated 9 months ago
- LlamaVoice is a llama-based large voice generation model, providing inference and training ability.☆169Updated 3 weeks ago
- ONNX Inference of Pyannote Segmentation☆54Updated last week
- Llama3.1 learns to Listen☆134Updated this week
- ☆62Updated 4 months ago
- A simple Python package to easily use Meta's Massively Multilingual Speech (MMS) project☆51Updated last year
- Real-time Voice Activity Detection (VAD) with some example use case like simple voice bot and live transcription (realtime transcription)☆43Updated 3 months ago
- Next-generation TTS model using flow-matching and DiT, inspired by Stable Diffusion 3☆334Updated this week
- Port of Funasr's Paraformer model in C/C++☆25Updated 3 months ago
- Chinese and English Bilinguish G2P☆19Updated last year
- ☆61Updated last month