xvjiarui/IMProv

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/xvjiarui/IMProv)

xvjiarui / IMProv

IMProv: Inpainting-based Multimodal Prompting for Computer Vision Tasks

☆57

Alternatives and similar repositories for IMProv

Users that are interested in IMProv are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

DannyTran123 / egopet
View on GitHub
Official implementation of the paper "EgoPet: Egomotion and Interaction Data from an Animal's Perspective".
☆29Dec 15, 2025Updated 7 months ago
alhojel / visual_task_vectors
View on GitHub
☆41Jul 19, 2024Updated 2 years ago
ZhangYuanhan-AI / visual_prompt_retrieval
View on GitHub
[NeurIPS2023] Official implementation and model release of the paper "What Makes Good Examples for Visual In-Context Learning?"
☆182Mar 4, 2024Updated 2 years ago
pufanyi / syphus
View on GitHub
Syphus: Automatic Instruction-Response Generation Pipeline
☆14Dec 14, 2023Updated 2 years ago
amirbar / visual_prompting
View on GitHub
Official implementation and data release of the paper "Visual Prompting via Image Inpainting".
☆319Aug 7, 2023Updated 2 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
yzqin / dexpoint-release
View on GitHub
DexPoint: Generalizable Point Cloud Reinforcement Learning for Sim-to-Real Dexterous Manipulation, CoRL 2022
☆107May 22, 2024Updated 2 years ago
fortyfive-labs / jaynes-starter-kit
View on GitHub
a starter-kit for jaynes, the cloud-agnostic launch library
☆17Apr 1, 2026Updated 3 months ago
ToruOwO / minimal-stable-PPO
View on GitHub
A minimal and stable PPO.
☆148Feb 9, 2024Updated 2 years ago
OPTML-Group / ILM-VP
View on GitHub
[CVPR23] "Understanding and Improving Visual Prompting: A Label-Mapping Perspective" by Aochuan Chen, Yuguang Yao, Pin-Yu Chen, Yihua Zha…
☆52Sep 17, 2023Updated 2 years ago
ToruOwO / hato
View on GitHub
🕊️ HATO: Learning Visuotactile Skills with Two Multifingered Hands [ICRA 2025]
☆172May 27, 2024Updated 2 years ago
mlpc-ucsd / MasQCLIP
View on GitHub
(ICCV 2023) MasQCLIP for Open-Vocabulary Universal Image Segmentation
☆37Oct 18, 2023Updated 2 years ago
yzqin / isaacgym-stubs
View on GitHub
Isaac Gym Python Stubs for Code Completion
☆126Jun 10, 2024Updated 2 years ago
joyhsu0504 / LEFT
View on GitHub
☆50Apr 25, 2024Updated 2 years ago
osheraz / allsight
View on GitHub
AllSight, is an optical tactile sensor with a round 3D structure, potentially designed for robotic in-hand manipulation tasks
☆20Nov 28, 2025Updated 7 months ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
x-robotics-lab / minbc
View on GitHub
MinBC - Minimal Behavior Cloning
☆36Jul 5, 2026Updated 2 weeks ago
heaplax / ARMAP
View on GitHub
☆29Jun 5, 2025Updated last year
SeanJia / CoTPC
View on GitHub
Chain-of-Thought Predictive Control
☆56May 1, 2023Updated 3 years ago
facebookresearch / AINA
View on GitHub
Official implementation of Dexterity from Smart Lenses Multi-Fingered Robot Manipulation with In-the-Wild Human Demonstrations. Project w…
☆58Dec 26, 2025Updated 6 months ago
Jiayuan-Gu / motion-planning
View on GitHub
A pythonic motion planning library
☆45Sep 28, 2024Updated last year
JunlinHan / CropMix
View on GitHub
Code of CropMix: Sampling a Rich Input Distribution via Multi-Scale Cropping
☆17Oct 8, 2022Updated 3 years ago
ToruOwO / twisting-lids
View on GitHub
Twisting Lids Off with Two Hands [CoRL 2024]
☆42Mar 16, 2025Updated last year
pd-perry / EXPO
View on GitHub
☆34Aug 25, 2025Updated 10 months ago
gary23ai / awesome_concept_learning_list
View on GitHub
A curated list of papers & resources linked to concept learning
☆13Aug 9, 2023Updated 2 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
Jazzcharles / OVSegmentor
View on GitHub
OVSegmentor, CVPR23
☆62Apr 22, 2024Updated 2 years ago
elad-amrani / xtra
View on GitHub
PyTorch implementation of "Sample- and Parameter-Efficient Auto-Regressive Image Models" from CVPR 2025
☆14Nov 21, 2025Updated 8 months ago
helblazer811 / RefSAM
View on GitHub
Referring Image Segmentation Benchmarking with Segment Anything Model (SAM)
☆39Apr 7, 2023Updated 3 years ago
StoneT2000 / trajectorytranslation
View on GitHub
Code for Abstract-to-Executable Trajectory Translation for One Shot Task Generalization (ICML 2023)
☆23May 12, 2023Updated 3 years ago
qpsolvers / free_for_all_qpbenchmark
View on GitHub
Community-built test set to benchmark QP solvers
☆16Updated this week
UT-Austin-RPL / deoxys_vision
View on GitHub
Vision package for robot manipulation and learning research
☆26Apr 21, 2024Updated 2 years ago
facebookresearch / modem
View on GitHub
MoDem Accelerating Visual Model-Based Reinforcement Learning with Demonstrations
☆87Dec 12, 2022Updated 3 years ago
nicklashansen / puppeteer
View on GitHub
Code for "Hierarchical World Models as Visual Whole-Body Humanoid Controllers"
☆213Sep 18, 2025Updated 10 months ago
epic-kitchens / C1-Action-Recognition
View on GitHub
Evaluation metrics and submission file creation scripts the Action Recognition challenge
☆15Feb 9, 2026Updated 5 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
showlab / MovieSeq
View on GitHub
[ECCV 2024] Learning Video Context as Interleaved Multimodal Sequences
☆46Mar 11, 2025Updated last year
jhejna / hierarchical_morphology_transfer
View on GitHub
Code for paper "Hierarchically Decoupled Imitation for Morphological Transfer"
☆17Mar 24, 2023Updated 3 years ago
SkyworkAI / DAQ-VS
View on GitHub
Code For Our Work: DVIS-DAQ: Improving Video Segmentation via Dynamic Anchor Queries [ECCV-2024]
☆15Jul 11, 2024Updated 2 years ago
evelinehong / 3D-Concept-Grounding
View on GitHub
Code Release of "3D Concept Grounding on Neural Fields (NeurIPS2022)"
☆15Feb 13, 2023Updated 3 years ago
bytedance / fc-clip
View on GitHub
[NeurIPS 2023] This repo contains the code for our paper Convolutions Die Hard: Open-Vocabulary Segmentation with Single Frozen Convoluti…
☆345Feb 5, 2024Updated 2 years ago
syp2ysy / prompt-SelF
View on GitHub
[TIP] Exploring Effective Factors for Improving Visual In-Context Learning
☆21Jul 2, 2025Updated last year
PKU-YuanGroup / LLMBind
View on GitHub
LLMBind: A Unified Modality-Task Integration Framework
☆19Jun 16, 2024Updated 2 years ago