korakoe / VALL-E-XLinks
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io
☆16Updated last year
Alternatives and similar repositories for VALL-E-X
Users that are interested in VALL-E-X are comparing it to the libraries listed below
Sorting:
- Generate images from an initial frame and text☆37Updated 2 years ago
- Text-Guided Generation of Full-Body Image with Preserved Reference Face for Customized Animation☆24Updated last year
- Music production for silent film clips.☆29Updated 7 months ago
- General Prior for Anime - 1☆44Updated 2 years ago
- FlexiFilm: Long Video Generation with Flexible Conditions☆31Updated last year
- ☆19Updated last year
- Talking head animation☆28Updated last year
- [IEEE/CVF CVPR'2022] "ST-MFNet: A Spatio-Temporal Multi-Flow Network for Frame Interpolation", Duolikun Danier, Fan Zhang, David Bull☆13Updated 2 years ago
- ☆16Updated last year
- Vid Driven Portrait Animation 🤢😷☆18Updated last year
- ☆31Updated last year
- Let's try and finetune the OpenAI consistency decoder to work for SDXL☆24Updated 2 years ago
- ☆32Updated last year
- ☆24Updated 2 months ago
- ☆16Updated 2 years ago
- ☆13Updated last year
- ☆20Updated last year
- ☆14Updated last year
- Animatediff implementation. Includes a ControlNet pipeline.☆19Updated last year
- High-performance ASR tool using Faster Whisper, supporting custom models, multi-language transcription, and real-time processing feedback…☆10Updated 2 months ago
- ☆18Updated last year
- Official implementation for "Nested Attention: Semantic-aware Attention Values for Concept Personalization" [SIGGRAPH 2025]☆27Updated 3 months ago
- Repository with which to explore k-diffusion and diffusers, and within which changes to said packages may be tested.☆54Updated last year
- animatediff prompt travel☆19Updated last year
- STDFormer: Spatio Temporal Disentanglement Learning for 3D Human Mesh Recovery from Monocular Videos with Transformer☆45Updated last year
- ☆40Updated last year
- ☆13Updated last year
- finetune your florence2 model easy☆21Updated last year
- PyTorch implementation of NEUTART, a system that creates photorealistic talking avatars from an input text transcription.☆34Updated 8 months ago
- implementation of AnimateDiff.☆32Updated 2 years ago