MCG-NJU / Sora2-miniLinks
UniAVGen: Unified Audio and Video Generation with Asymmetric Cross-Modal Interactions
☆37Updated last month
Alternatives and similar repositories for Sora2-mini
Users that are interested in Sora2-mini are comparing it to the libraries listed below
Sorting:
- ☆53Updated last month
- Phantom-Data: Towards a General Subject-Consistent Video Generation Dataset☆104Updated 2 months ago
- The official UniVerse-1 code.☆119Updated 3 months ago
- [ICCV 2025] MagicMirror: ID-Preserved Video Generation in Video Diffusion Transformers☆128Updated 7 months ago
- ☆63Updated last month
- ☆85Updated 3 months ago
- [CVPR'25] StyleMaster: Stylize Your Video with Artistic Generation and Translation☆166Updated 2 months ago
- ShotBench: Expert-Level Cinematic Understanding in Vision-Language Models☆89Updated 4 months ago
- Official implementation of "Towards One-Step Causal Video Generation via Adversarial Self-Distillation" (arXiv 2025). A novel framework f…☆22Updated 2 months ago
- Muti-human Interactive Talking Dataset☆67Updated 5 months ago
- The official SpeakerVid-5M data curation code.☆68Updated 6 months ago
- HyperMotion is a pose guided human image animation framework based on a large-scale video diffusion Transformer.☆132Updated 6 months ago
- Concat-ID: Towards Universal Identity-Preserving Video Synthesis☆65Updated 8 months ago
- [Arxiv 2025] ByteMorph: Benchmarking Instruction-Guided Image Editing with Non-Rigid Motions☆44Updated 7 months ago
- Official implementation of "Perception-as-Control: Fine-grained Controllable Image Animation with 3D-aware Motion Representation" (ICCV 2…☆79Updated 5 months ago
- ☆25Updated last year
- [[NeurIPS 2025] UltraVideo: High-Quality UHD Video Dataset with Comprehensive Captions☆84Updated 6 months ago
- ☆91Updated last year
- Training-Free Text-Guided Image Editing Using Visual Autoregressive Model☆71Updated 9 months ago
- Official repository for HOComp: Interaction-Aware Human-Object Composition☆29Updated last month
- ☆85Updated 10 months ago
- The Best of Both Worlds: Integrating Language Models and Diffusion Models for Video Generation☆39Updated 8 months ago
- [CVPR'25 Highlight] Official implementation for paper - LeviTor: 3D Trajectory Oriented Image-to-Video Synthesis☆157Updated 9 months ago
- Efficient Long-duration Talking Video Synthesis with Linear Diffusion Transformer under Multimodal Guidance☆61Updated 3 months ago
- [NeurIPS 2025] Improving Video Generation with Human Feedback☆415Updated 4 months ago
- [SIGGRAPH ASIA'25] BlobCtrl: Taming Controllable Blob for Element-level Image Editing☆26Updated 2 months ago
- VideoCoF: Unified Video Editing with Temporal Reasoner☆129Updated 3 weeks ago
- Benchmark dataset and code of MSRVTT-Personalization☆52Updated 2 months ago
- Official implementation of "HumanAesExpert: Advancing a Multi-Modality Foundation Model for Human Image Aesthetic Assessment"☆99Updated 9 months ago
- [CVPR2024] MotionEditor is the first diffusion-based model capable of video motion editing.☆186Updated 4 months ago