UniAVGen: Unified Audio and Video Generation with Asymmetric Cross-Modal Interactions
☆47Dec 16, 2025Updated 2 months ago
Alternatives and similar repositories for Sora2-mini
Users that are interested in Sora2-mini are comparing it to the libraries listed below
Sorting:
- [CVPR 2026] Soul: Breathe Life into Digital Human for High-fidelity Long-term Multimodal Animation☆55Dec 16, 2025Updated 2 months ago
- [AAAI 2025] Video Diffusion Models are Strong Video Inpainter☆17Jul 21, 2025Updated 7 months ago
- ☆16Oct 4, 2024Updated last year
- CVPR2025-3D Gaussian Head Avatars with Expressive Dynamic Appearances by Compact Tensorial Representations☆36Sep 3, 2025Updated 6 months ago
- Spatial Temporal Graph Convolutional Networks for Sign Language (ST-GCN-SL) Recognition☆21Feb 12, 2024Updated 2 years ago
- Code for "SelfTalk: A Self-Supervised Commutative Training Diagram to Comprehend 3D Talking Faces" ACM MM 2023☆30Jul 29, 2023Updated 2 years ago
- ☆34Dec 16, 2025Updated 2 months ago
- PitchVC: Pitch Conditioned Any-to-Many Voice Conversion☆36Jun 6, 2024Updated last year
- ☆30Jun 30, 2025Updated 8 months ago
- Omni Controllable Video Diffusion☆41Dec 22, 2025Updated 2 months ago
- Unoffical LivePortrait Training Script [ 🚧 Under Construction]☆38Jan 28, 2025Updated last year
- Inferix: A Block-Diffusion based Next-Generation Inference Engine for World Simulation☆110Feb 27, 2026Updated last week
- Code for the paper "IFFNeRF: 6D Pose Estimation from a Single Image and a 3D Gaussian Splatting Model"☆12May 26, 2024Updated last year
- ☆62Jul 1, 2025Updated 8 months ago
- OpenTMA: support text-motion alignment for HumanML3D, Motion-X, and UniMoCap☆46May 22, 2024Updated last year
- ☆37Dec 18, 2025Updated 2 months ago
- official implementation of [PRIMAL: Physically Reactive and Interactive Motor Model for Avatar Learning, ICCV'25]☆35Oct 31, 2025Updated 4 months ago
- LTX-Video-Trainer-GUI 是为LTX视频lora模型训练提供的GUI工具,支持通过简单的界面训练 LoRA 模型用于视频生成。本训练器提供了直观的 GUI 界面,使用户能够轻松设置和启动训练流程,无需编写复杂代码。☆13Jul 18, 2025Updated 7 months ago
- PyTorch Implementation of "Mask2Hand: Learning to Predict the 3D Hand Pose and Shape from Shadow"☆12Aug 19, 2024Updated last year
- ☆12Aug 14, 2018Updated 7 years ago
- [WACV 2025] "Reversing the Damage: A QP-Aware Transformer-Diffusion Approach for 8K Video Restoration under Codec Compression"☆19Oct 14, 2025Updated 4 months ago
- BEGANSing - Korean SVS + SVC + AudioSR☆11Feb 17, 2024Updated 2 years ago
- ☆10Jan 26, 2025Updated last year
- ☆14Feb 22, 2025Updated last year
- Implementation of Differential Learning Rate in Keras☆11Jun 4, 2019Updated 6 years ago
- ☆63Dec 1, 2025Updated 3 months ago
- [AAAI'26] PyTorch code for our paper "QuantVSR: Low-Bit Post-Training Quantization for Real-World Video Super-Resolution"☆32Jan 29, 2026Updated last month
- AI-powered video object removal (diffusion inpainting under the hood).☆15Oct 23, 2025Updated 4 months ago
- [🔥ACM MM2025] EchoMask: Speech-Queried Attention-based Mask Modeling for Holistic Co-Speech Motion Generation☆23Dec 30, 2025Updated 2 months ago
- ☆18May 15, 2025Updated 9 months ago
- Inverse Kinematics for MANO hands☆18Feb 23, 2022Updated 4 years ago
- Agentic Keyframe Search for Video Question Answering☆16Apr 7, 2025Updated 11 months ago
- GUI for pitch correction and audio synthesis using NSF-HiFiGAN neural vocoders.☆24Updated this week
- [ICCV 2021] The official repo for the paper "Speech Drives Templates: Co-Speech Gesture Synthesis with Learned Templates".☆97Jun 3, 2023Updated 2 years ago
- ☆48Aug 10, 2023Updated 2 years ago
- Official PyTorch implementation of the CVPR 2024 Highlight Paper "Real-time 3D-aware Portrait Video Relighting"☆63Oct 23, 2024Updated last year
- ☆15Oct 30, 2024Updated last year
- Learning Motion and Temporal Cues for Unsupervised Video Object Segmentation[TNNLS2024]☆13May 6, 2025Updated 10 months ago
- Beyond Real: Imaginary Extension of Rotary Position Embeddings for Long-Context LLMs☆33Dec 9, 2025Updated 3 months ago