IMProv: Inpainting-based Multimodal Prompting for Computer Vision Tasks
☆58Sep 26, 2024Updated last year
Alternatives and similar repositories for IMProv
Users that are interested in IMProv are comparing it to the libraries listed below
Sorting:
- ☆49Nov 28, 2024Updated last year
- Measuring the Signal to Noise Ratio in Language Model Evaluation☆28Aug 19, 2025Updated 6 months ago
- Syphus: Automatic Instruction-Response Generation Pipeline☆14Dec 14, 2023Updated 2 years ago
- [NeurIPS2023] Official implementation and model release of the paper "What Makes Good Examples for Visual In-Context Learning?"☆183Mar 4, 2024Updated 2 years ago
- [NeurIPS2023] Official implementation of the paper "Large Language Models are Visual Reasoning Coordinators"☆105Nov 9, 2023Updated 2 years ago
- [TMLR 2025] The official repository of the paper "Unsupervised Discovery of Object-Centric Neural Fields"☆18Feb 15, 2026Updated 2 weeks ago
- A minimal and stable PPO.☆146Feb 9, 2024Updated 2 years ago
- LLMBind: A Unified Modality-Task Integration Framework☆19Jun 16, 2024Updated last year
- (ICCV 2023) MasQCLIP for Open-Vocabulary Universal Image Segmentation☆37Oct 18, 2023Updated 2 years ago
- Directed masked autoencoders☆14Feb 20, 2026Updated last week
- DexPoint: Generalizable Point Cloud Reinforcement Learning for Sim-to-Real Dexterous Manipulation, CoRL 2022☆100May 22, 2024Updated last year
- a starter-kit for jaynes, the cloud-agnostic launch library☆17Aug 20, 2024Updated last year
- Official implementation and data release of the paper "Visual Prompting via Image Inpainting".☆318Aug 7, 2023Updated 2 years ago
- (Siggraph Asia 2023) Project Page of "HyperDreamer: Hyper-Realistic 3D Content Generation and Editing from a Single Image"☆10Dec 9, 2023Updated 2 years ago
- [WACV 2026] PyTorch code for 4D-Animal.☆27Nov 18, 2025Updated 3 months ago
- The Structure and Interpretation of Deep Networks Handbook☆14Dec 14, 2024Updated last year
- A curated list of papers & resources linked to concept learning☆12Aug 9, 2023Updated 2 years ago
- BH hackathon☆14Apr 4, 2024Updated last year
- [ICLR 2026] Official repo for "FrameThinker: Learning to Think with Long Videos via Multi-Turn Frame Spotlighting"☆38Oct 9, 2025Updated 4 months ago
- [CVPR23] "Understanding and Improving Visual Prompting: A Label-Mapping Perspective" by Aochuan Chen, Yuguang Yao, Pin-Yu Chen, Yihua Zha…☆53Sep 17, 2023Updated 2 years ago
- Official repo for the TMLR paper "Discffusion: Discriminative Diffusion Models as Few-shot Vision and Language Learners"☆30Apr 27, 2024Updated last year
- UnrealBakedSDF is a sample Unreal project for importing and visualizing BakedSDF meshes.☆15Jun 14, 2023Updated 2 years ago
- ☆32Jan 17, 2026Updated last month
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io☆16Apr 18, 2024Updated last year
- ☆48Apr 25, 2024Updated last year
- 🕊️ HATO: Learning Visuotactile Skills with Two Multifingered Hands☆165May 27, 2024Updated last year
- [CVPR 2022] Joint hand motion and interaction hotspots prediction from egocentric videos☆71Jan 29, 2024Updated 2 years ago
- Code for "Hierarchical World Models as Visual Whole-Body Humanoid Controllers"☆203Sep 18, 2025Updated 5 months ago
- Accelerating Vision-Language Pretraining with Free Language Modeling (CVPR 2023)☆32May 15, 2023Updated 2 years ago
- PyTorch implementation of "Sample- and Parameter-Efficient Auto-Regressive Image Models" from CVPR 2025☆14Nov 21, 2025Updated 3 months ago
- On the Feasibility of Cross-Task Transfer with Model-Based Reinforcement Learning☆16Apr 30, 2023Updated 2 years ago
- MoDem Accelerating Visual Model-Based Reinforcement Learning with Demonstrations☆86Dec 12, 2022Updated 3 years ago
- Isaac Gym Python Stubs for Code Completion☆126Jun 10, 2024Updated last year
- ☆38May 15, 2025Updated 9 months ago
- Description and applications of OpenAI's paper about DALL-E (2021) and implementation of other (CLIP-guided) zero-shot text-to-image gene…☆33Aug 11, 2022Updated 3 years ago
- ☆13Sep 4, 2023Updated 2 years ago
- This is the official implementation of work HiM2SAM in PRCV25.☆25Aug 30, 2025Updated 6 months ago
- [IJCV 2024] MosaicFusion: Diffusion Models as Data Augmenters for Large Vocabulary Instance Segmentation☆128Oct 8, 2024Updated last year
- [ICCV 2023 Workshop] The Official Implementation of The First Prize Solution for RVOS Competition☆14Jan 1, 2024Updated 2 years ago