korakoe / VALL-E-XLinks
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io
☆15Updated last year
Alternatives and similar repositories for VALL-E-X
Users that are interested in VALL-E-X are comparing it to the libraries listed below
Sorting:
- Generate images from an initial frame and text☆36Updated 2 years ago
- Text-Guided Generation of Full-Body Image with Preserved Reference Face for Customized Animation☆24Updated last year
- ☆16Updated last year
- Music production for silent film clips.☆28Updated 6 months ago
- [IEEE/CVF CVPR'2022] "ST-MFNet: A Spatio-Temporal Multi-Flow Network for Frame Interpolation", Duolikun Danier, Fan Zhang, David Bull☆13Updated 2 years ago
- FlexiFilm: Long Video Generation with Flexible Conditions☆31Updated last year
- ☆18Updated last year
- Official implementation of "VSTAR: Generative Temporal Nursing for Longer Dynamic Video Synthesis"☆19Updated 9 months ago
- High-performance ASR tool using Faster Whisper, supporting custom models, multi-language transcription, and real-time processing feedback…☆10Updated last month
- General Prior for Anime - 1☆44Updated 2 years ago
- ☆31Updated last year
- ☆20Updated last year
- Let's try and finetune the OpenAI consistency decoder to work for SDXL☆24Updated last year
- ☆23Updated 2 months ago
- ☆13Updated last year
- ☆26Updated last year
- ☆45Updated 11 months ago
- ☆19Updated last year
- ☆11Updated last year
- ☆40Updated last year
- ☆21Updated last year
- ☆16Updated 2 years ago
- ☆14Updated last year
- ☆23Updated last year
- Vid Driven Portrait Animation 🤢😷☆18Updated last year
- ☆31Updated last year
- ☆31Updated last year
- Animatediff implementation. Includes a ControlNet pipeline.☆18Updated last year
- implementation of AnimateDiff.☆31Updated 2 years ago
- STDFormer: Spatio Temporal Disentanglement Learning for 3D Human Mesh Recovery from Monocular Videos with Transformer☆45Updated last year