lukerbs / pytoon
PyToon is a Python based animation library for automatically animating a cartoon character's mouth movements and bodily expressions to sync with an audio recording of someone talking. PyToon uses machine learning based audio analysis techniques to automatically generate lip-synced character animations (see "Example Output Video" in README).
☆13Updated last month
Related projects ⓘ
Alternatives and complementary repositories for pytoon
- Official repository of Tapir Lab.'s Lip-Sync Method☆9Updated last year
- Project Page for VividTalk☆16Updated 11 months ago
- a naive 3d human pose editor GUI.☆18Updated last year
- lightweight LAMA inference wrapper☆24Updated last year
- Towards Robust Blind Face Restoration with Codebook Lookup Transformer☆27Updated 10 months ago
- ☆10Updated 10 months ago
- Modify-Anything is based on yolov5,yolov8 for video and image detection. Segment-anything,lama_cleaner is applied to segment, modify, era…☆10Updated last year
- Generate images from an initial frame and text☆37Updated last year
- Text-Guided Generation of Full-Body Image with Preserved Reference Face for Customized Animation☆22Updated 4 months ago
- A pipeline to generate user-preferred photo-realistic avatars using stable-diffusion and bayesian-optimization.☆18Updated 2 weeks ago
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io☆16Updated 7 months ago
- ESRGAN (Enhanced Super Resolution GAN) using two 2x2 (kernel size) conv2d layers instead of a traditional single 3x3 conv2d layer in its …☆13Updated 2 years ago
- An library for editing and rendering motion of 3D characters with deep learning.☆10Updated last year
- Official code and dataset release for "JAFPro: Joint Appearance Fusion and Propagation for Human Video Motion Transfer from Multiple Refe…☆14Updated 3 years ago
- Code repository for the BMVC 2022 paper: Geometry Driven Progressive Warping for One Shot Face Animation☆12Updated last year
- Animatediff implementation. Includes a ControlNet pipeline.☆19Updated 10 months ago
- Audio-Visual Lip Synthesis via Intermediate Landmark Representation☆14Updated last year
- ☆15Updated 11 months ago
- Code for "SelfTalk: A Self-Supervised Commutative Training Diagram to Comprehend 3D Talking Faces" ACM MM 2023☆30Updated last year
- ☆23Updated last year
- High-performance ASR tool using Faster Whisper, supporting custom models, multi-language transcription, and real-time processing feedback…☆10Updated 3 weeks ago
- FlexiFilm: Long Video Generation with Flexible Conditions☆32Updated 6 months ago
- ☆21Updated 3 months ago
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor…☆33Updated last year
- CPU inference version of VisemeNet-tensorflow☆13Updated 5 years ago