[ICLR2025] Code Release of Refining CLlP's Spatial Awareness: A Visual-centric Perspective
☆20Apr 11, 2025Updated last year
Alternatives and similar repositories for CLIPRefiner
Users that are interested in CLIPRefiner are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official Implement of the work "Coherent and Multi-modality Image Inpainting via Latent Space Optimization"☆55Apr 10, 2025Updated last year
- [NeurIPS'25] FlySearch: Exploring how vision-language models explore☆24Mar 12, 2026Updated last month
- Flow RL is a high-performance RL library with flow and diffusion models.☆36Apr 23, 2026Updated last week
- Code of the paper "Unseen from Seen: Rewriting Observation-Instruction Using Foundation Models for Augmenting Vision-Language Navigation"…☆20Nov 11, 2025Updated 5 months ago
- The project presents a drone obstacle avoidance system using Microsoft AirSim and the DDPG algorithm, training drones with LIDAR and dept…☆22May 22, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆12May 26, 2022Updated 3 years ago
- A curated publication list on visual dialog☆14May 8, 2023Updated 2 years ago
- The goal of this project is to make a prediction model which will predict whether an athlete will win a medal or not.☆10Sep 17, 2021Updated 4 years ago
- [ICCV 2025] Unbiased Region-Language Alignment for Open-Vocabulary Dense Prediction☆53Sep 22, 2025Updated 7 months ago
- Code for "Grounded Vision-Language Navigation for UAVs with Open-Vocabulary Goal Understanding"☆68Apr 6, 2026Updated 3 weeks ago
- A minimalist (educational) implementation of Latent Diffusion Models (LDM) with PyTorch distributed training.☆13Dec 22, 2024Updated last year
- ☆15Feb 23, 2023Updated 3 years ago
- ☆29Jan 27, 2025Updated last year
- Supervised Training of Conditional Monge Maps☆19Oct 30, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Fast-Slow Test-time Adaptation for Online Vision-and-Language Navigation☆34Dec 5, 2025Updated 4 months ago
- Official Implementation for *PaCo-RL: Advancing Reinforcement Learning for Consistent Image Generation with Pairwise Reward Modeling*☆39Dec 13, 2025Updated 4 months ago
- A torch-based implementation of K-Means and K-Means++☆17Dec 6, 2020Updated 5 years ago
- [ICRA 2026] Official implemetation of the paper "InSpire: Vision-Language-Action Models with Intrinsic Spatial Reasoning"☆48Feb 2, 2026Updated 3 months ago
- EXT2 File System Emulator☆17Jun 27, 2020Updated 5 years ago
- ☆45Nov 13, 2025Updated 5 months ago
- ☆73May 5, 2025Updated 11 months ago
- [CVPR 2026] Official implementation of "ACoT-VLA: Action Chain-of-Thought for Vision-Language-Action Models"☆148Apr 3, 2026Updated last month
- [CVPR 2023] Official repository for paper "Stare at What You See: Masked Image Modeling without Reconstruction"☆70Jul 2, 2025Updated 10 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [CVPR 2022 Oral] Towards Open Set Temporal Action Localization☆55Sep 4, 2023Updated 2 years ago
- [CVPR 2026] Thinking in 360°: Humanoid Visual Search in the Wild☆143Mar 3, 2026Updated 2 months ago
- An example RLDS dataset builder for X-embodiment dataset conversion.☆62Mar 1, 2025Updated last year
- ☆53Jan 3, 2023Updated 3 years ago
- Official pytorch implementation of "AlphaFlow: Understanding and Improving MeanFlow Models"☆118Oct 24, 2025Updated 6 months ago
- ☆38Apr 16, 2025Updated last year
- This repository is dedicated to collecting and sharing research papers on diffusion guidance methods.☆69Apr 11, 2026Updated 3 weeks ago
- repository containing analysis scripts and auxiliary files☆38Apr 9, 2020Updated 6 years ago
- Official implementation of Sim-to-Real Transfer via 3D Feature Fields for Vision-and-Language Navigation (CoRL'24).☆76Dec 26, 2025Updated 4 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Implementation of the proposed LVMAE, from the paper, Extending Video Masked Autoencoders to 128 frames, in Pytorch☆55Nov 25, 2024Updated last year
- EditReward: A Human-Aligned Reward Model for Instruction-Guided Image Editing [ICLR 2026]☆142Apr 11, 2026Updated 3 weeks ago
- Mini-Kinetics-200 data splits used in paper "Rethinking Spatiotemporal Feature Learning For Video Understanding"☆80Dec 24, 2017Updated 8 years ago
- Official implementation of ImageCritic (CVPR 2026)☆158Mar 7, 2026Updated last month
- [ECCV2024] Towards Reliable Advertising Image Generation Using Human Feedback☆60Nov 8, 2024Updated last year
- Official codebase for Margin-aware Preference Optimization for Aligning Diffusion Models without Reference (MaPO).☆82Jun 11, 2024Updated last year
- [CVPR'25] Towards More General Video-based Deepfake Detection through Facial Feature Guided Adaptation for Foundation Model (DFD-FCG)☆52Jul 20, 2025Updated 9 months ago