liruilong940607 / mintLinks
Multi-modal Content Creation Model Training Infrastructure including the FACT model (AI Choreographer) implementation.
☆11Updated 3 years ago
Alternatives and similar repositories for mint
Users that are interested in mint are comparing it to the libraries listed below
Sorting:
- [ICCV 2023] Controllable Person Image Synthesis with Pose‑Constrained Latent Diffusion☆41Updated last year
- ☆26Updated 9 months ago
- [CVPR 2022] Exploring Dual-task Correlation for Pose Guided Person Image Generation☆98Updated last year
- [CVPR 2022] Show Me What and Tell Me How: Video Synthesis via Multimodal Conditioning☆192Updated 3 years ago
- Code Release for the paper "Make-A-Story: Visual Memory Conditioned Consistent Story Generation" in CVPR 2023☆42Updated 2 years ago
- [CVPR2024] Official implementation of the paper: Skeleton-in-Context: Unified Skeleton Sequence Modeling with In-Context Learning☆40Updated 3 weeks ago
- [ICLR 2023] Towards Smooth Video Composition☆85Updated 2 years ago
- Official implementation of "TM2T: Stochastic and Tokenized Modeling for the Reciprocal Generation of 3D Human Motions and Texts (ECCV2022…☆121Updated last year
- ☆92Updated 2 years ago
- Code for ACM MM'23 paper: LayoutLLM-T2I: Eliciting Layout Guidance from LLM for Text-to-Image Generation☆48Updated last year
- [CVPR 2023] The official implementation of CVPR 2023 paper "Human-Art: A Versatile Human-Centric Dataset Bridging Natural and Artificial …☆265Updated last year
- [NeurIPS 2023] Customize spatial layouts for conditional image synthesis models, e.g., ControlNet, using GPT☆136Updated last year
- Code for CVPR'2022 paper ✨ "Predict, Prevent, and Evaluate: Disentangled Text-Driven Image Manipulation Empowered by Pre-Trained Vision-L…☆37Updated 3 years ago
- Code for "DreamEdit: Subject-driven Image Editing" (TMLR2023)☆108Updated last year
- Official Repository for "Diffusion HPC: Generate Synthetic Data for Human Mesh Recovery in Challenging Domains" (3DV 2024 Spotlight)☆43Updated 2 years ago
- [CVPR2024] MotionEditor is the first diffusion-based model capable of video motion editing.☆177Updated 4 months ago
- ☆126Updated last year
- ☆33Updated 2 years ago
- Update-to-data resources for conditional content generation, including human motion generation, image or video generation and editing.☆276Updated last year
- [ECCV 2022] FashionViL: Fashion-Focused V+L Representation Learning☆60Updated 2 years ago
- Official pytorch implementation of Action-GPT☆116Updated 2 years ago
- CVPR2023 paper☆52Updated 2 years ago
- ☆51Updated last year
- [ICCV 2023] The official implementation of paper "HumanSD: A Native Skeleton-Guided Diffusion Model for Human Image Generation"☆301Updated last year
- Supercharged BLIP-2 that can handle videos☆120Updated last year
- [ICCV 2023 Oral, Best Paper Finalist] ITI-GEN: Inclusive Text-to-Image Generation☆68Updated last year
- Official code for CVPR 2024 paper: Discriminative Probing and Tuning for Text-to-Image Generation☆32Updated 5 months ago
- Video dataset dedicated to portrait-mode video recognition.☆52Updated 8 months ago
- TIP-I2V: A Million-Scale Real Text and Image Prompt Dataset for Image-to-Video Generation☆33Updated 9 months ago
- Official implementation of the paper "Boosting Human-Object Interaction Detection with Text-to-Image Diffusion Model"☆63Updated 2 years ago