fabio-sim / Fast-SeamlessM4T-ONNX
ONNX-compatible Fast SeamlessM4T—Massively Multilingual & Multimodal Machine Translation
☆40Updated last year
Related projects ⓘ
Alternatives and complementary repositories for Fast-SeamlessM4T-ONNX
- SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime☆74Updated last month
- VoiceRestore: Flow-Matching Transformers for Universal Speech Restoration☆84Updated last month
- flow mirror models from JZX AI Labs☆40Updated last month
- Running the F5-TTS by ONNX Runtime☆35Updated this week
- A simple Python package to easily use Meta's Massively Multilingual Speech (MMS) project☆52Updated last year
- libvits-ncnn is an ncnn implementation of the VITS library that enables cross-platform GPU-accelerated speech synthesis.🎙️💻☆56Updated last year
- SpeechDenoiser: Real-Time Speech Denoising with ONNX Welcome to SpeechDenoiser, a simple and effective solution for real-time speech den…☆44Updated 3 months ago
- audiolm-pytorch training code☆15Updated last year
- Pseudo Streaming SenseVoice with Hotwords☆85Updated 2 weeks ago
- ONNX Inference of Pyannote Segmentation☆66Updated 2 months ago
- A lightweight end-to-end text-to-speech model☆91Updated 2 months ago
- Port of Funasr's Paraformer model in C/C++☆25Updated 5 months ago
- This repo is an exploratory experiment to enable frozen pretrained RWKV language models to accept speech modality input. We followed the …☆32Updated last week
- Barkify: an unoffical training implementation of Bark TTS by suno-ai☆126Updated last year
- VALL-E 2 reproduction☆87Updated 4 months ago
- Next-generation TTS model using flow-matching and DiT, inspired by Stable Diffusion 3☆365Updated 2 months ago
- Chinese and English Bilinguish G2P☆20Updated last year
- ☆45Updated 4 months ago
- The YouTube Text-To-Speech dataset is comprised of waveform audio extracted from YouTube videos alongside their English transcriptions☆50Updated 3 years ago
- Awesome TTS☆54Updated 3 years ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆84Updated 6 months ago
- A enterprise-grade Voice Activity Detector from modelscope and funasr.☆65Updated last year
- C++ version of pyannote audio speaker diarizaiton pipeline☆18Updated 9 months ago
- 用于SenseVoice的api项目,输出带时间戳字幕☆14Updated 3 weeks ago
- This is an implementation for train hifigan part of XTTSv2 model using Coqui/TTS.☆61Updated last week
- Official Code for ParrotTTS☆42Updated last month
- Grapheme-to-Phoneme for Mixed Chinese (Mandarin or Cantonese) and English☆74Updated this week
- paraformer(chinense asr) online onnx runtime for python☆36Updated 7 months ago
- Grapheme-to-Phoneme lexicons for Chinese dialects☆66Updated 2 years ago