Madhuvod / VoxLinguaLinks
A Model (maybe an app) that translates the audio of a video from one language to another language, cloning the voice of original video with the translated audio
β15Updated 8 months ago
Alternatives and similar repositories for VoxLingua
Users that are interested in VoxLingua are comparing it to the libraries listed below
Sorting:
- CLASP: Contrastive Language-Speech Pretraining for Multilingual Multimodal Information Retrievalβ13Updated 7 months ago
- SpeechPlus: Small LLM-Based Text-to-Speech Library πβ20Updated 8 months ago
- All-in-one Speech Transcriptionβ10Updated this week
- Code associated with the paper: CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition.β15Updated 8 months ago
- Text-to-Speech Latency Benchmarkβ22Updated last week
- specifications and documentation for the Open Voice Interoperability Initiative Projectβ21Updated last week
- LoRA-based phoneme/prosody control for LLM-based TTS with no G2P - Lightweight adapter for edit and control the target language's phonemeβ¦β23Updated 5 months ago
- a simple system for 2-way interruptible voice interactions between human and LLMβ30Updated last year
- Supervoice diffusion enhanceβ28Updated last year
- Arabic Grapheme-to-Phoneme (G2P) Conversionβ13Updated 10 months ago
- β18Updated 10 months ago
- chatterbox TTS + Voice Clone using onnxβ27Updated 3 weeks ago
- Soniox Compare. Compare real-time voice AI side by side. No glossy charts, just results.β18Updated 6 months ago
- A composition of offline tools to achieve high quality multilingual speech to text transcriptionβ23Updated last week
- Whisper finetuningβ15Updated 9 months ago
- This repository includes training, inference, evaluation, and utility scripts developed for fine-tuning the Whisper medium.en model on Aiβ¦β21Updated last year
- β32Updated 3 months ago
- proof of concept conversation orchestrator with a speech-language model