Official implementation of LaVin-DiT
☆53Jan 27, 2025Updated last year
Alternatives and similar repositories for LaVin-DiT
Users that are interested in LaVin-DiT are comparing it to the libraries listed below
Sorting:
- KMM: Key Frame Mask Mamba for Extended Motion Generation☆19Sep 22, 2025Updated 5 months ago
- Uni-OVSeg is a weakly supervised open-vocabulary segmentation framework that leverages unpaired mask-text pairs.☆53Jun 11, 2024Updated last year
- E-GRPO: High Entropy Steps Drive Effective Reinforcement Learning for Flow Models☆39Jan 5, 2026Updated 2 months ago
- Tuning-Free Image Editing with Fidelity and Editability via Unified Latent Diffusion Model☆13Dec 29, 2024Updated last year
- [ACM MM25] Official Pytorch implementation of [Decoupled Global-Local Alignment for Improving Compositional Understanding]☆15Jul 15, 2025Updated 7 months ago
- ☆31Sep 1, 2025Updated 6 months ago
- PhyGDPO: Physics-Aware Groupwise Direct Preference Optimization for Physically Consistent Text-to-Video Generation☆55Jan 5, 2026Updated 2 months ago
- Real-time meshing the contours of 3D scalar fields with the dual contouring algorithm in plain C++ and OpenGL/CUDA. Along with seam handl…☆17May 9, 2024Updated last year
- Official implementation for our paper: Rethinking Video Tokenization: A Conditioned Diffusion-based Approach☆14Apr 2, 2025Updated 11 months ago
- Create your own 3D scene with words anywhere.☆32Updated this week
- ☆15Apr 13, 2023Updated 2 years ago
- ☆17Apr 17, 2025Updated 10 months ago
- Pytorch implementation for Deep Visual Hull Prior☆15Feb 17, 2021Updated 5 years ago
- ☆42Sep 15, 2025Updated 5 months ago
- ☆18Oct 20, 2024Updated last year
- ☆16Sep 16, 2025Updated 5 months ago
- A clean Pytorch Implementation of Mean Flow, with FID evaluation on the fly☆55Sep 21, 2025Updated 5 months ago
- Siggraph 2025 Journal track☆23Aug 13, 2025Updated 6 months ago
- [EMNLP 2024] Preserving Multi-Modal Capabilities of Pre-trained VLMs for Improving Vision-Linguistic Compositionality☆21Oct 8, 2024Updated last year
- ☆18May 2, 2024Updated last year
- This is the official implementation of ICLR 2024 paper "VDC: Versatile Data Cleanser based on Visual-Linguistic Inconsistency by Multimod…☆19Feb 24, 2025Updated last year
- MegaRAG: Multimodal Graph-based RAG☆37Sep 16, 2025Updated 5 months ago
- Official Implementation for Diffusion Models Without Classifier-free Guidance☆171Feb 18, 2025Updated last year
- DC-VideoGen: Efficient Video Generation with Deep Compression Video Autoencoder☆179Oct 5, 2025Updated 5 months ago
- ☆31Mar 24, 2023Updated 2 years ago
- ☆27Jun 18, 2025Updated 8 months ago
- A Recipe for Building LLM Reasoners to Solve Complex Instructions☆30Oct 9, 2025Updated 5 months ago
- Towards Defending against Adversarial Examples via Attack-Invariant Features☆12Oct 12, 2023Updated 2 years ago
- ABC: Achieving Better Control of Multimodal Embeddings using VLMs [TMLR2025]☆21Aug 21, 2025Updated 6 months ago
- A PyTorch implementation of Minimal-IK to restore SMPL parameters from skeleton/keypoints + point cloud, with VPoser pose prior for bette…☆22Dec 28, 2023Updated 2 years ago
- Simple script to parallelize download and extract files for SA-1B Dataset.☆38Jan 15, 2026Updated last month
- libpgo: Library for Physically based Simulation (P), Geometric Shape Modeling (G), and Optimization (O)☆26Jan 26, 2026Updated last month
- [ICML 2025] Playmate: Flexible Control of Portrait Animation via 3D-Implicit Space Guided Diffusion☆34Nov 10, 2025Updated 3 months ago
- Chinese-native image generation while compatible with SD eco-system, 1st-gen, AAAI2025☆13Jun 25, 2024Updated last year
- ☆26Jun 2, 2025Updated 9 months ago
- The official repo for LIFT: Language-Image Alignment with Fixed Text Encoders☆42Jun 10, 2025Updated 8 months ago
- Unifying Specialized Visual Encoders for Video Language Models☆25Nov 22, 2025Updated 3 months ago
- ☆33Jul 15, 2025Updated 7 months ago
- [ICML 2025] Official code of "AlphaDPO: Adaptive Reward Margin for Direct Preference Optimization"☆30Jan 10, 2026Updated last month