[TMLR 2025 J2C] TextRegion: Text-Aligned Region Tokens from Frozen Image-Text Models
☆52Dec 24, 2025Updated 2 months ago
Alternatives and similar repositories for TextRegion
Users that are interested in TextRegion are comparing it to the libraries listed below
Sorting:
- [ICML'25] Kernel-based Unsupervised Embedding Alignment for Enhanced Visual Representation in Vision-language Models☆21Sep 7, 2025Updated 6 months ago
- Simulation assets of space-ros demos☆17Jan 31, 2026Updated last month
- [ICCV2025] Harnessing CLIP, DINO and SAM for Open Vocabulary Segmentation☆115Nov 22, 2025Updated 4 months ago
- ☆42May 15, 2025Updated 10 months ago
- 3D LiDAR Processing Tools☆15Jun 28, 2022Updated 3 years ago
- Official implementation of DIP: Unsupervised Dense In-Context Post-training of Visual Representations☆46Sep 8, 2025Updated 6 months ago
- PyTorch Implementation for InMaP☆11Oct 28, 2023Updated 2 years ago
- A Python library for inference-time scaling LLMs☆32Updated this week
- ☆19Jul 4, 2024Updated last year
- JoVA: Unified Multimodal Learning for Joint Video-Audio Generation☆30Dec 22, 2025Updated 3 months ago
- Implementation of "Quadrotor Helicopter Trajectory Tracking Control"☆14Jan 11, 2021Updated 5 years ago
- ☆25Dec 8, 2025Updated 3 months ago
- [ICLR 2024] The official implementation of Zip-Your-Clip☆35Mar 14, 2024Updated 2 years ago
- Documentation for terminator☆16Jul 8, 2025Updated 8 months ago
- Implementation for EACL 2024 paper "Corpus-Steered Query Expansion with Large Language Models"☆12Mar 19, 2024Updated 2 years ago
- Source code accompanying the NeurIPS 2022 paper "Learning Partial Equivariances From Data"☆10Nov 18, 2022Updated 3 years ago
- Margin-based Vision Transformer☆67Nov 28, 2025Updated 3 months ago
- [CVPR 2026] Official repo of "MorphAny3D: Unleashing the Power of Structured Latent in 3D Morphing“☆81Mar 5, 2026Updated 2 weeks ago
- PyTorch Implementation of NACLIP in "Pay Attention to Your Neighbours: Training-Free Open-Vocabulary Semantic Segmentation"☆74Sep 23, 2024Updated last year
- Official repository for the General Robust Image Task (GRIT) Benchmark☆54Mar 29, 2023Updated 2 years ago
- [NeurIPS '25 Spotlight] Official Pytorch implementation of "Vision Transformers Don't Need Trained Registers"☆173Sep 19, 2025Updated 6 months ago
- ☆12Dec 11, 2024Updated last year
- Implementation for DIY-SC paper.☆23Jul 14, 2025Updated 8 months ago
- Reinforcing Action Policies by Prophesying☆40Nov 26, 2025Updated 3 months ago
- ☆37Nov 13, 2025Updated 4 months ago
- [ICCV 2025] MOVE: Motion-Guided Few-Shot Video Object Segmentation☆87Sep 8, 2025Updated 6 months ago
- [ECCV 2024] Official code repository of paper titled "Efficient 3D-Aware Facial Image Editing Via Attribute-Specific Prompt Learning"☆10Aug 2, 2024Updated last year
- Test-Time Adaptation of Vision-Language Models for Open-Vocabulary Semantic Segmentation☆25Sep 20, 2025Updated 6 months ago
- CLIP-MoE: Mixture of Experts for CLIP☆56Oct 10, 2024Updated last year
- [ICLR 2026] Official Implementation of ProxyThinker: Test-Time Guidance through Small Visual Reasoners.☆20Sep 24, 2025Updated 5 months ago
- ☆44Jun 25, 2025Updated 8 months ago
- ☆18Apr 10, 2025Updated 11 months ago
- FormulaOne: A dataset of algorithmic problems based on MSO formulas.☆25Mar 1, 2026Updated 2 weeks ago
- Implementation of Variance Reduction Techniques in Julia☆11Sep 6, 2016Updated 9 years ago
- TR2M: Transferring Monocular Relative Depth to Metric Depth with Language Descriptions and Dual-Level Scale-Oriented Contrast☆20Mar 3, 2026Updated 2 weeks ago
- [CVPR 2025] This repository is the official implementation of "ProKeR: A Kernel Perspective on Few-Shot Adaptation of Large Vision-Langua…☆21Apr 1, 2025Updated 11 months ago
- Inverse Scaling in Test-Time Compute☆25Dec 3, 2025Updated 3 months ago
- [CVPR 2025] Official Pytorch implementation of "Learning with Noisy Triplet Correspondence for Composed Image Retrieval".☆23Jun 9, 2025Updated 9 months ago
- SwiftUI Drag and Drop editor☆15Sep 20, 2020Updated 5 years ago