OpenSeg-R: Improving Open-Vocabulary Segmentation via Step-by-Step Visual Reasoning
☆27May 24, 2025Updated 9 months ago
Alternatives and similar repositories for OpenSeg-R
Users that are interested in OpenSeg-R are comparing it to the libraries listed below
Sorting:
- Empowering Small VLMs to Think with Dynamic Memorization and Exploration☆15Nov 18, 2025Updated 3 months ago
- ALTo: Adaptive-Length Tokenizer for Autoregressive Mask Generation☆27May 27, 2025Updated 9 months ago
- ☆14Oct 30, 2023Updated 2 years ago
- [CVPR2025] Official Implementations "One-Way Ticket : Time-Independent Unified Encoder for Distilling Text-to-Image Diffusion Models"☆28Jul 28, 2025Updated 7 months ago
- Mobile-O: Unified Multimodal Understanding and Generation on Mobile Device☆75Updated this week
- [ICLR 25] The implementation of paper Diff-Prompt: Diffusion-Driven Prompt Generator with Mask Supervision.☆53Jul 17, 2025Updated 7 months ago
- Weakly Supervised Video Moment Localisation with Contrastive Negative Sample Mining☆30Apr 4, 2022Updated 3 years ago
- ☆31Mar 24, 2022Updated 3 years ago
- [2023 ACL] CONE: An Efficient COarse-to-fiNE Alignment Framework for Long Video Temporal Grounding☆31Aug 5, 2023Updated 2 years ago
- This is the official implementation of ReVisionLLM: Recursive Vision-Language Model for Temporal Grounding in Hour-Long Videos☆43Nov 5, 2025Updated 3 months ago
- ☆68Nov 5, 2025Updated 3 months ago
- Scalable DBSCAN and OPTICS for clustering high-dimensional datasets using random projections☆13Nov 1, 2024Updated last year
- ☆36Apr 14, 2021Updated 4 years ago
- Official implementation of the paper "LTrack: Generalizing Multiple Object Tracking to Unseen Domains by Introducing Natural Language Rep…☆12Jul 26, 2023Updated 2 years ago
- Project focused on enhancing the quality of low-fidelity endoscopy images using Generative Adversarial Networks (GANs) implemented in PyT…☆17Jun 5, 2025Updated 8 months ago
- ☆11Dec 6, 2024Updated last year
- [CVPR 2025] DiscoVLA: Discrepancy Reduction in Vision, Language, and Alignment for Parameter-Efficient Video-Text Retrieval☆21Jun 23, 2025Updated 8 months ago
- DisTime: Distribution-based Time Representation for Video Large Language Models.☆18Jul 10, 2025Updated 7 months ago
- Python资源大全中文版,内容包括:Web框架、网络爬虫、网络内容提取、模板引擎、数据库、数据可视化、图片处理、文本处理、自然语言处 理、机器学习、日志、代码分析等☆11May 24, 2016Updated 9 years ago
- [ICCV2025] Constructing Ophthalmic MLLM for Positioning-diagnosis Collaboration Through Clinical Cognitive Chain Reasoning☆23Nov 13, 2025Updated 3 months ago
- Communication Relay by creating a WiFi Mesh Network using ROS, and using that network for Data Telemetry, with Telemetry radios ( Ubiquit…☆11Dec 18, 2018Updated 7 years ago
- SAM4SS: Tailoring SAM and SAM2 for Semantic Segmentation☆11Jul 31, 2024Updated last year
- Experiments Notebook of "Understanding the Skill Gap in Recurrent Language Models: The Role of the Gather-and-Aggregate Mechanism"☆14Apr 30, 2025Updated 10 months ago
- Reinforcement Training of Robot☆11Dec 1, 2019Updated 6 years ago
- [MedIA 2026] Official implementation of TTGA: Test-Time Generative Augmentation for Medical Image Segmentation.☆12Jan 5, 2026Updated last month
- ☆13Jul 28, 2024Updated last year
- Chapter-wise notebooks for the book 'Practical Natural Language Processing'☆10Apr 21, 2020Updated 5 years ago
- LiteGPT: A 124M Small Language Model (SLM) pre-trained on FineWeb and fine-tuned on Alpaca.☆34Dec 16, 2025Updated 2 months ago
- UM1 test programs and sample code☆11Jul 25, 2022Updated 3 years ago
- ☆10May 15, 2021Updated 4 years ago
- ☆15Dec 2, 2025Updated 2 months ago
- [ICCV' 23] MRM: Masked Relation Modeling for Medical Image Pre-Training with Genetics☆10Oct 28, 2024Updated last year
- ☆18Sep 20, 2025Updated 5 months ago
- [ECCV 2024] The first zero-shot setting for spatio-temporal video grounding.☆11Jul 16, 2024Updated last year
- Converts folders of images to chunks which can easily be saved/loaded into RAM (numpy).☆11Nov 21, 2019Updated 6 years ago
- Computational Neuroscience stuff☆13Aug 12, 2019Updated 6 years ago
- Weakly Supervised Referring Video Object Segmentation with Object-Centric Pseudo-Guidance☆10Aug 17, 2024Updated last year
- Code and dataset release for the paper "Unstructured Evidence Attribution for Long Context Query Focused Summarization"☆11Nov 3, 2025Updated 3 months ago
- Companion repository to "Prompt Compression and Contrastive Conditioning for Controllability and Toxicity Reduction in Language Models"☆14May 31, 2023Updated 2 years ago