chenxwh / seamless_communication
Foundational Models for State-of-the-Art Speech and Text Translation
☆11Updated last year
Alternatives and similar repositories for seamless_communication:
Users that are interested in seamless_communication are comparing it to the libraries listed below
- [SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild☆58Updated 10 months ago
- Generate video stories with AI ✨☆30Updated 4 months ago
- Seamless Voice Interactions with LLMs☆11Updated last year
- Using a single image and just 10 seconds of sample audio, our project enables you to create a video where it appears as if you're speakin…☆33Updated last year
- Style-Transfer: Apply the style of an image to another image☆52Updated 10 months ago
- ☆18Updated 4 months ago
- Turn text from websites into spoken audio with edge-tts, F5, etc. and save as mp3 files☆43Updated 2 weeks ago
- ☆18Updated 8 months ago
- ☆16Updated last year
- Viral Factory is a highly modular gradio app that automates the production of various forms of social media content. Thanks to it's comp…☆42Updated last month
- Run AuraFlow on Replicate☆14Updated 6 months ago
- ⚙️ | REPLACED BY https://github.com/runpod-workers | Official set of serverless worker provided by RunPod as endpoints.☆57Updated last year
- [WIP] AI Try-On plugin for Chrome☆27Updated 10 months ago
- ☆43Updated 2 months ago
- The code for some apps built with Sieve.☆74Updated 2 months ago
- kokoro text to speech using javascript☆50Updated this week
- ☆19Updated last year
- Imagine translating your speech or anybody's speech to any language you want within minutes. check this out...☆39Updated 5 months ago
- Unofficial package to easily interact with the Kits.AI API☆10Updated 10 months ago
- Docker image for the Text Generation Web UI: A Gradio web UI for Large Language Models. Supports Transformers, AWQ, GPTQ, llama.cpp (GGUF…☆1Updated 5 months ago
- ✨ Experience the enchantment of Story Blocks: an open-source project merging AI text generation and image synthesis to create captivating…☆55Updated last year
- ☆24Updated 3 weeks ago
- A WebUI to create song covers with any RVC v2 trained AI voice from YouTube videos or audio files.☆40Updated last year
- Generate visual podcasts about novels using open source models☆24Updated last year
- ☆12Updated 2 months ago
- Example agents I've built using the LiveKit Agents (https://github.com/livekit/agents) framework☆18Updated 8 months ago
- Cog wrapper for collabora/WhisperSpeech☆25Updated 10 months ago
- This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Mult…☆35Updated last year
- ImageBind One Embedding Space to Bind Them All☆20Updated last year
- ☆78Updated last year