fabio-sim / Fast-SeamlessM4T-ONNXLinks
ONNX-compatible Fast SeamlessM4T—Massively Multilingual & Multimodal Machine Translation
☆43Updated 2 years ago
Alternatives and similar repositories for Fast-SeamlessM4T-ONNX
Users that are interested in Fast-SeamlessM4T-ONNX are comparing it to the libraries listed below
Sorting:
- openvino version of openai/whisper☆178Updated 2 years ago
- Port of Funasr's Paraformer model in C/C++☆39Updated last year
- ☆175Updated 2 years ago
- SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime☆106Updated 2 months ago
- Real-Time Whisper Voice Recognition with vosk model feedback.☆121Updated 2 years ago
- A lightweight end-to-end text-to-speech model☆124Updated 9 months ago
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆160Updated last year
- Running the F5-TTS by ONNX Runtime☆184Updated last month
- Open models for Coqui STT☆148Updated 2 years ago
- faster-whisper livestream translation, OBS noise reduction, dual language subtitles☆80Updated 2 years ago
- an improved version of Real-time-voice-cloning☆52Updated last year
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆98Updated last year
- An open source NLP as a service project focused on providing state of the art systems with ease. Training and inference by simple docker …☆20Updated last year
- ultimate vocal remover application run on linux ubuntu1604☆54Updated 2 years ago
- A performant high-throughput CPU-based API for Meta's No Language Left Behind (NLLB) using CTranslate2, hosted on Hugging Face Spaces.☆130Updated this week
- TTS with The Massively Multilingual Speech (MMS) project☆231Updated last year
- C++ version of pyannote audio speaker diarizaiton pipeline☆22Updated last year
- ☆354Updated last year
- Putting flows on top of neural transducers for better TTS☆64Updated 2 weeks ago
- The YouTube Text-To-Speech dataset is comprised of waveform audio extracted from YouTube videos alongside their English transcriptions☆52Updated 4 years ago
- Multivoice: Enhance your foreign-language movie and TV show experience with personalized dubbed versions. Our project uses voice cloning …☆26Updated 2 years ago
- Barkify: an unoffical training implementation of Bark TTS by suno-ai☆128Updated 2 years ago
- A simple Python package to easily use Meta's Massively Multilingual Speech (MMS) project☆54Updated 2 years ago
- Experiments to test different speech recognition systems for SEPIA Framework☆62Updated 2 years ago
- This is a repository that collects common audio noise reduction models, using Gradio to demonstrate the use of each model, which is very …☆48Updated last year
- ☆261Updated last year
- libvits-ncnn is an ncnn implementation of the VITS library that enables cross-platform GPU-accelerated speech synthesis.🎙️💻☆62Updated 2 years ago
- Text to speech is an emerging zone of AI. This repository helps to create a dataset with audio and transcripts for personalized text to s…☆28Updated 2 years ago
- Real-time processing and delivery of sentences from a continuous stream of characters or text chunks.☆72Updated 4 months ago
- Speech Diarization for scrum automation☆111Updated 2 years ago