[CVPR 2025, Highlight] The official implementation of the paper "Unleashing In-context Learning of Autoregressive Models for Few-shot Image Manipulation"
☆28Jun 6, 2025Updated 11 months ago
Alternatives and similar repositories for InstaManip
Users that are interested in InstaManip are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICCV2025] Training-Free Diffusion Models for Geometric Image Editing☆33Jan 13, 2026Updated 4 months ago
- ☆33Feb 18, 2026Updated 3 months ago
- [CVPR 2025] Beacon3D: Object-centric Evaluation for 3D Grounding-QA☆28Nov 25, 2025Updated 5 months ago
- ☆18Apr 2, 2026Updated last month
- [CVPR 2025] Official Repo of Paper "FOCUS: Knowledge-enhanced Adaptive Visual Compression for Few-shot Whole Slide Image Classification"☆48Jun 6, 2025Updated 11 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Mirage: One-Step Video Diffusion for Photorealistic and Coherent Asset Editing in Driving Scenes☆29Mar 12, 2026Updated 2 months ago
- Official implementation of ICCV 2025 paper - DualReal: Adaptive Joint Training for Lossless Identity-Motion Fusion in Video Customization☆22Jul 13, 2025Updated 10 months ago
- Official code for "Activating Wider Areas in Image Super-Resolution"☆13Aug 22, 2024Updated last year
- ☆28Aug 19, 2025Updated 9 months ago
- [CVPR 2025] Official implementation of ByTheWay: Boost Your Text-to-Video Generation Model to Higher Quality in a Training-free Way☆47Oct 10, 2025Updated 7 months ago
- Implementation and dataset for paper "Can MLLMs Perform Text-to-Image In-Context Learning?"☆48Jun 2, 2025Updated 11 months ago
- The code of Towards Efficient Diffusion-Based Image Editing with Instant Attention Masks☆27Apr 10, 2024Updated 2 years ago
- ☆42Aug 16, 2024Updated last year
- (Siggraph Asia 2023) Project Page of "HyperDreamer: Hyper-Realistic 3D Content Generation and Editing from a Single Image"☆10Dec 9, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Codes accompanying the paper "Toward Guidance-Free AR Visual Generation via Condition Contrastive Alignment"☆38Feb 11, 2025Updated last year
- Python与机器学习方向,《深度学习》课程仓库☆14Mar 16, 2018Updated 8 years ago
- Codebase for the paper-Elucidating the design space of language models for image generation☆46Nov 17, 2024Updated last year
- ☆94Mar 15, 2026Updated 2 months ago
- Unified layout planning and image generation, ICCV2025☆45Jan 19, 2026Updated 4 months ago
- CVPR 2025' Instruct-4DGS: Efficient Dynamic Scene Editing via 4D Gaussian-based Static-Dynamic Separation☆30Sep 21, 2025Updated 8 months ago
- [NeurIPS 2024] Official Code for the Paper "Multimodal Task Vectors Enable Many-Shot Multimodal In-Context Learning"☆28Apr 8, 2025Updated last year
- ☆38Nov 24, 2025Updated 6 months ago
- [ACL 2026 Main] Revisit What You See: Revealing Visual Semantics in Vision Tokens to Guide LVLM Decoding☆25Nov 21, 2025Updated 6 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- [NeurIPS 2024] COVE: Unleashing the Diffusion Feature Correspondence for Consistent Video Editing☆26Dec 8, 2024Updated last year
- "Towards Improving Document Understanding: An Exploration on Text-Grounding via MLLMs" 2023☆16Nov 28, 2024Updated last year
- ☆13Apr 23, 2025Updated last year
- MXNet/Gluon implementation of the original (Gaussian) Variational Autoencoders (VAE)☆10Dec 22, 2017Updated 8 years ago
- EditAR: Unified Conditional Generation with Autoregressive Models (CVPR 2025)☆43Jun 13, 2025Updated 11 months ago
- Source code for the reproduction of RibbonFold paper.☆12May 10, 2025Updated last year
- ☆18Aug 7, 2025Updated 9 months ago
- Official code for the paper, "TaCA: Upgrading Your Visual Foundation Model with Task-agnostic Compatible Adapter".☆17Jun 20, 2023Updated 2 years ago
- PICABench: How Far Are We from Physically Realistic Image Editing?☆38Nov 5, 2025Updated 6 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- The official pytorch implementation of DiffHarmony and DiffHarmony++.☆34Oct 29, 2024Updated last year
- ☆13Mar 8, 2024Updated 2 years ago
- Training, optimization and deployment of Object Detection model with dinov2 backbone for efficient inference on NVIDIA Jetson☆13Jul 26, 2025Updated 9 months ago
- Acoustic Scene Classification using transfer learning on VGGish pre-trained model☆11Jan 3, 2018Updated 8 years ago
- Unofficial implementation of the paper: "NeRF-In: Free-Form NeRF Inpainting with RGB-D Priors"☆11Apr 30, 2023Updated 3 years ago
- Python与机器学习方向,《决策树与集成算法》课程仓库☆25Jun 13, 2018Updated 7 years ago
- Python与机器学习方向,《Python Web高级开发》课程仓库☆22Jan 29, 2019Updated 7 years ago