DanielSHKao/ThinkFirst

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/DanielSHKao/ThinkFirst)

DanielSHKao / ThinkFirst

Official implementation for "Think Before You Segment: High-Quality Reasoning Segmentation with GPT Chain of Thoughts"

☆22

Alternatives and similar repositories for ThinkFirst

Users that are interested in ThinkFirst are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

DanielSHKao / CoT-RVS
View on GitHub
[ICLR 2026] Official implementation for CoT-RVS
☆23Mar 17, 2026Updated 4 months ago
baoxiaoyi / CoReS
View on GitHub
code for the paper "CoReS: Orchestrating the Dance of Reasoning and Segmentation"
☆23Nov 24, 2025Updated 7 months ago
tommarvoloriddle / SLIP
View on GitHub
ZERO SHOT CONTEXT-BASED OBJECT SEGMENTATION USING SLIP (SAM+CLIP)
☆19May 23, 2024Updated 2 years ago
ZHANG1023 / FLNeRF
View on GitHub
☆18Apr 30, 2023Updated 3 years ago
Zhang-Yihao / Transfomer2DFA
View on GitHub
Implementation for paper Automata Extraction from Transformers.
☆12Jun 8, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
songw-zju / PixelThink
View on GitHub
The official implementation of "PixelThink: Towards Efficient Chain-of-Pixel Reasoning" (ICML 2026)
☆43Jul 4, 2026Updated 2 weeks ago
jcwang0602 / MLLMSeg
View on GitHub
MLLMSeg: Unlocking the Potential of MLLMs in Referring Expression Segmentation via a Light-weight Mask Decoder
☆56Jun 12, 2026Updated last month
yayafengzi / LMM-HiMTok
View on GitHub
HiMTok: Learning Hierarchical Mask Tokens for Image Segmentation with Large Multimodal Model
☆97Jul 17, 2025Updated last year
mahtabbigverdi / Aurora
View on GitHub
☆12Dec 4, 2024Updated last year
aim-uofa / Omni-R1
View on GitHub
[NeurIPS 2025] Official Repo of Omni-R1: Reinforcement Learning for Omnimodal Reasoning via Two-System Collaboration
☆126Dec 3, 2025Updated 7 months ago
Junyi42 / viser
View on GitHub
☆16Feb 21, 2025Updated last year
MuxLi / Approximating-shapes-in-images-with-low-complexity-polygons
View on GitHub
☆10Aug 14, 2020Updated 5 years ago
zhangce01 / SimNL
View on GitHub
[WACV 2025] Code for Enhancing Vision-Language Few-Shot Adaptation with Negative Learning
☆12Feb 24, 2025Updated last year
Luo-Z13 / GLH-Bridge-Code
View on GitHub
☆13Nov 21, 2025Updated 8 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
unica-visual-intelligence-lab / OmniRad
View on GitHub
☆16Feb 3, 2026Updated 5 months ago
yaoliliu / OmniRefiner
View on GitHub
OmniRefiner: Reinforcement-Guided Local Diffusion Refinement
☆18Nov 26, 2025Updated 7 months ago
TungChintao / SkiLa
View on GitHub
Official codes of "Sketch-in-Latents: Eliciting Unified Reasoning in MLLMs"
☆17Feb 15, 2026Updated 5 months ago
Bodhiswatta / ICTNet
View on GitHub
ICTNet: a novel network for semantic segmentation with the underlying architecture of a fully convolutional network, infused with feature…
☆10May 27, 2020Updated 6 years ago
aim-uofa / SegAgent
View on GitHub
[CVPR2025] SegAgent: Exploring Pixel Understanding Capabilities in MLLMs by Imitating Human Annotator Trajectories
☆106Aug 8, 2025Updated 11 months ago
DylanOrange / flexevent
View on GitHub
FlexEvent: Event Camera Object Detection at Arbitrary Frequencies
☆21Dec 10, 2024Updated last year
lionelmessi6410 / Panorama-Stitching
View on GitHub
Create a panorama stitching image based on multiple images.
☆12Jul 18, 2019Updated 7 years ago
opencity3d / opencity3d
View on GitHub
Official implementation of "OpenCity3D: What do Vision-Language Models know about Urban Environments?" @ WACV2025
☆19Nov 24, 2024Updated last year
ViolinLee / TelloDroneDetectionPython
View on GitHub
使用Tello无人机进行真假IKUN辨别
☆19Apr 14, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
yinghemedical / U-VLM
View on GitHub
U-VLM: Hierarchical Vision Language Modeling for Report Generation
☆18Apr 30, 2026Updated 2 months ago
gyhdog99 / RACRO2
View on GitHub
Official PyTorch implementation of RACRO (https://www.arxiv.org/abs/2506.04559)
☆19Jul 1, 2025Updated last year
TMIU / iTFA
View on GitHub
Incremental Few-Shot Object Detection via Simple Fine-Tuning Approach (ICRA 2023)
☆10Feb 14, 2023Updated 3 years ago
EchoSafe-MLLM / EchoSafe
View on GitHub
[CVPR 2026] Code for Evolving Contextual Safety in Multi-Modal Large Language Models via Inference-Time Self-Reflective Memory
☆15Mar 18, 2026Updated 4 months ago
MasterVito / DAC-RL
View on GitHub
Official Repo for DAC-RL: Training LLMs for Divide-and-Conquer Reasoning Elevates Test-Time Scalability
☆16Feb 26, 2026Updated 4 months ago
porterjenkins / region-encoder
View on GitHub
Repository for the paper "Unsupervised Representation Learning of Spatial Data via Multimodal Embedding"
☆12Dec 5, 2019Updated 6 years ago
XiaoyuXU1 / Representational_Analysis_Tools
View on GitHub
☆15May 23, 2025Updated last year
mll-lab-nu / ViewAgent
View on GitHub
☆20Jul 3, 2026Updated 2 weeks ago
axin1301 / satellite-imagery-POI
View on GitHub
☆11Feb 5, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
Zizzzzzzz / SFNet_T-ITS24
View on GitHub
☆19Feb 2, 2026Updated 5 months ago
HUuxiaobin / DiffuMatting
View on GitHub
☆18Jul 14, 2025Updated last year
sayanmndl / SAM2LoRA
View on GitHub
LoRA implementation of Segment Anything 2 Model
☆15Mar 14, 2026Updated 4 months ago
AMAP-ML / UniVG-R1
View on GitHub
UniVG-R1: Reasoning Guided Universal Visual Grounding with Reinforcement Learning
☆165Jun 2, 2025Updated last year
ChengyinLee / MulModSeg_2024
View on GitHub
Enhancing Unpaired Multi-Modal Medical Image Segmentation with Modality-Conditioned Text Embedding and Alternating Training
☆23Jan 2, 2025Updated last year
Red-Fairy / argus-code
View on GitHub
[ICCV 2025] Official repository of the paper "Beyond the Frame: Generating 360° Panoramic Videos from Perspective Videos"
☆45Feb 2, 2026Updated 5 months ago
GATECH-EIC / Castling-ViT
View on GitHub
[CVPR 2023] Castling-ViT: Compressing Self-Attention via Switching Towards Linear-Angular Attention During Vision Transformer Inference
☆31Mar 14, 2024Updated 2 years ago