John-SadJoe-Doe / synapse-hack-2024Links
☆0Updated last year
Alternatives and similar repositories for synapse-hack-2024
Users that are interested in synapse-hack-2024 are comparing it to the libraries listed below
Sorting:
- A server software for Minecraft: Bedrock Edition in PHP☆2Updated 4 years ago
- ☆30Updated last year
- Instant voice cloning by MIT and MyShell. Audio foundation model.☆33,864Updated 3 months ago
- V-Express aims to generate a talking head video under the control of a reference image, an audio, and a sequence of V-Kps images.☆2,349Updated 6 months ago
- High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.☆6,610Updated 7 months ago
- [SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild☆7,116Updated last year
- EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine☆8,125Updated 11 months ago
- Zero-Shot Speech Editing and Text-to-Speech in the Wild☆8,360Updated 4 months ago
- Foundational model for human-like, expressive TTS☆4,144Updated last year
- Inference and training library for high-quality TTS models.☆5,380Updated 8 months ago
- PhotoMaker [CVPR 2024]☆10,060Updated 9 months ago
- Outfit Anyone: Ultra-high quality virtual try-on for Any Clothing and Any Person☆5,927Updated last year
- [SIGGRAPH Asia 2024, Journal Track] ToonCrafter: Generative Cartoon Interpolation☆5,874Updated 4 months ago
- ☆28Updated last year
- Your image is almost there!☆7,661Updated last year
- Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junio…☆9,277Updated 2 months ago
- ☆8Updated last year
- Instant voice cloning by MyShell.☆104Updated last year
- StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models☆5,899Updated last year
- StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation☆10,350Updated 8 months ago
- [AAAI 2025] Official implementation of "OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on"☆6,379Updated last year
- An Open Source text-to-speech system built by inverting Whisper.☆4,329Updated 2 months ago
- Text-to-Music Generation with Rectified Flow Transformers☆1,709Updated 8 months ago
- Emote Portrait Alive: Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions☆7,647Updated 11 months ago
- 1 min voice data can also be used to train a good TTS model! (few shot voice cloning)☆49,686Updated last week
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"☆12,891Updated 3 weeks ago
- Large World Model -- Modeling Text and Video with Millions Context☆7,320Updated 9 months ago
- [CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation☆13,088Updated last year
- Turn any glasses into AI-powered smart glasses☆3,712Updated last month
- Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation☆14,756Updated 6 months ago