fabio-sim / Fast-SeamlessM4T-ONNX
ONNX-compatible Fast SeamlessM4T—Massively Multilingual & Multimodal Machine Translation
☆42Updated last year
Alternatives and similar repositories for Fast-SeamlessM4T-ONNX:
Users that are interested in Fast-SeamlessM4T-ONNX are comparing it to the libraries listed below
- SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime☆78Updated 3 months ago
- Misc. tools/scripts that I made to use for tortoise☆21Updated 4 months ago
- Running the F5-TTS by ONNX Runtime☆80Updated this week
- A lightweight end-to-end text-to-speech model☆99Updated 3 weeks ago
- flow mirror models from JZX AI Labs☆43Updated 3 months ago
- An open source chat bot architecture for voice/vision (and multimodal) assistants, local(CPU/GPU bound) and remote(I/O bound) to run; if…☆20Updated this week
- libvits-ncnn is an ncnn implementation of the VITS library that enables cross-platform GPU-accelerated speech synthesis.🎙️💻☆60Updated last year
- Port of Funasr's Paraformer model in C/C++☆26Updated 6 months ago
- VoiceRestore: Flow-Matching Transformers for Universal Speech Restoration☆114Updated this week
- ONNX Inference of Pyannote Segmentation☆81Updated 3 weeks ago
- Self hosted high quality voice recognition for de-googled Android using whisper. Like Siri or OK Google.☆57Updated last year
- An unofficial PyTorch implementation of VALL-E☆87Updated this week
- This repo is an exploratory experiment to enable frozen pretrained RWKV language models to accept speech modality input. We followed the …☆38Updated 3 weeks ago
- An implementation of MeloTTS by onnxruntime☆15Updated 2 months ago
- SadTalker gradio_demo.py file with code section that allows you to set the eye blink and pose reference videos for the software to use wh…☆11Updated last year
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆90Updated 3 months ago
- faster-whisper livestream translation, OBS noise reduction, dual language subtitles☆77Updated last year
- The YouTube Text-To-Speech dataset is comprised of waveform audio extracted from YouTube videos alongside their English transcriptions☆52Updated 3 years ago
- A lightweight pure C++ Text-to-Speech (TTS) pipeline with OpenVINO, supporting multiple languages.☆33Updated this week
- An LLM base TTS engine☆60Updated 3 weeks ago
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆148Updated 6 months ago
- Babylon.cpp is a C and C++ library for grapheme to phoneme conversion and text to speech synthesis. For phonemization a ONNX runtime port…☆15Updated 4 months ago
- Putting flows on top of neural transducers for better TTS☆62Updated 3 weeks ago
- A simple Python package to easily use Meta's Massively Multilingual Speech (MMS) project☆52Updated last year
- Efficient approach to speaker diarization using voice characteristics extraction☆81Updated 8 months ago
- Accelerate Whisper tasks such as transcription, by multiprocesing through parallelization☆25Updated 2 years ago
- Multivoice: Enhance your foreign-language movie and TV show experience with personalized dubbed versions. Our project uses voice cloning …☆26Updated last year
- Nue-ASR inference code by rinna Co., Ltd.☆30Updated 5 months ago
- Text to speech is an emerging zone of AI. This repository helps to create a dataset with audio and transcripts for personalized text to s…☆27Updated last year