chenxwh / seamless_communication
Foundational Models for State-of-the-Art Speech and Text Translation
☆10Updated last year
Related projects ⓘ
Alternatives and complementary repositories for seamless_communication
- ☆18Updated 2 months ago
- Auto-Video maker handling many AI's☆12Updated 8 months ago
- ☆54Updated 10 months ago
- ☆17Updated 11 months ago
- Generate video stories with AI ✨☆28Updated 2 months ago
- (Windows/Linux/MacOS) Local WebUI with neural network models (Text, Image, Video, 3D, Audio) on python (Gradio interface). Translated on …☆73Updated this week
- ☆77Updated 4 months ago
- ⚙️ | REPLACED BY https://github.com/runpod-workers | Official set of serverless worker provided by RunPod as endpoints.☆55Updated last year
- ☆35Updated last week
- ☆27Updated last year
- ☆15Updated last year
- ☆37Updated 10 months ago
- [WIP] AI Try-On plugin for Chrome☆25Updated 8 months ago
- This project aims to bring a more stable and user friendly check GPT interface designed to allow others to implement their own GPT prompt…☆11Updated last year
- ☆78Updated 11 months ago
- ☆19Updated 10 months ago
- One-click launcher for Audiocraft MusicGen + AudioGen Gradio Web UI☆63Updated last year
- A WebUI to create song covers with any RVC v2 trained AI voice from YouTube videos or audio files.☆38Updated 11 months ago
- ☆31Updated 11 months ago
- ☆11Updated last year
- ☆25Updated 11 months ago
- ☆24Updated 11 months ago
- A Lightweight Gradio Web interface for Text-to-Audio Generation utilising SAO1.0☆46Updated 5 months ago
- ☆24Updated 11 months ago
- StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models☆30Updated 3 weeks ago
- ☆19Updated 8 months ago
- Run AuraFlow on Replicate☆13Updated 4 months ago
- ☆46Updated 2 months ago
- Turn text from websites into spoken audio with edge-tts, F5, etc. and save as mp3 files☆29Updated this week
- Site for sharing Bark voices☆48Updated 4 months ago