seanzhuh / SeqTRView external linksLinks
SeqTR: A Simple yet Universal Network for Visual Grounding
☆144Oct 30, 2024Updated last year
Alternatives and similar repositories for SeqTR
Users that are interested in SeqTR are comparing it to the libraries listed below
Sorting:
- Replication of Pix2Seq with Pretrained Model☆59Nov 6, 2021Updated 4 years ago
- [CVPR 2022] Pseudo-Q: Generating Pseudo Language Queries for Visual Grounding☆153Jul 13, 2024Updated last year
- Improving Visual Grounding with Visual-Linguistic Verification and Iterative Reasoning, CVPR 2022☆96Dec 2, 2022Updated 3 years ago
- Official Implementation for paper "Referring Transformer: A One-step Approach to Multi-task Visual Grounding" Neurips 2021☆68May 26, 2022Updated 3 years ago
- ☆19Jan 7, 2026Updated last month
- Obj2Seq: Formatting Objects as Sequences with Class Prompt for Visual Tasks (NeurIPS2022)☆85Nov 2, 2022Updated 3 years ago
- ☆195Feb 27, 2024Updated last year
- UniTAB: Unifying Text and Box Outputs for Grounded VL Modeling, ECCV 2022 (Oral Presentation)☆89Jun 12, 2023Updated 2 years ago
- ☆161Jul 19, 2023Updated 2 years ago
- ☆221Apr 13, 2023Updated 2 years ago
- A lightweight codebase for referring expression comprehension and segmentation☆57May 21, 2022Updated 3 years ago
- ☆16Nov 14, 2022Updated 3 years ago
- [AAAI 2023] DQ-DETR: Dual Query Detection Transformer for Phrase Extraction and Grounding☆57Nov 28, 2022Updated 3 years ago
- Pix2Seq codebase: multi-tasks with generative modeling (autoregressive and diffusion)☆939Nov 7, 2023Updated 2 years ago
- (ECCVW 2025)GPT4RoI: Instruction Tuning Large Language Model on Region-of-Interest☆551Jun 3, 2025Updated 8 months ago
- [CVPR2022] Official Implementation of ReferFormer☆352Feb 15, 2025Updated last year
- ☆1,047Oct 3, 2022Updated 3 years ago
- Improving One-stage Visual Grounding by Recursive Sub-query Construction, ECCV 2020☆89Sep 30, 2021Updated 4 years ago
- ☆87Apr 15, 2022Updated 3 years ago
- [CVPR 2022 Oral] Official implementation of DN-DETR☆602Dec 20, 2023Updated 2 years ago
- Official Code for 'RSTNet: Captioning with Adaptive Attention on Visual and Non-Visual Words' (CVPR 2021)☆123Dec 17, 2022Updated 3 years ago
- Unofficial implement of "Pix2seq: A Language Modeling Framework for Object Detection" on mmdetection☆33Apr 18, 2022Updated 3 years ago
- [CVPR 2022 Oral] TubeDETR: Spatio-Temporal Video Grounding with Transformers☆193Sep 24, 2023Updated 2 years ago
- PyTorch code for "VL-Adapter: Parameter-Efficient Transfer Learning for Vision-and-Language Tasks" (CVPR2022)☆209Dec 18, 2022Updated 3 years ago
- awesome grounding: A curated list of research papers in visual grounding☆1,125Sep 21, 2025Updated 4 months ago
- [ACM MM 22] Correspondence Matters for Video Referring Expression Comprehension☆15Sep 4, 2022Updated 3 years ago
- An official implementation for MS-DETR in ACL'23☆17Jun 3, 2023Updated 2 years ago
- PyTorch Implementation of Sparse DETR☆175Jan 3, 2024Updated 2 years ago
- A full-fledged version of Pix2Seq☆238Nov 6, 2021Updated 4 years ago
- ☆41Sep 21, 2023Updated 2 years ago
- Offical PyTorch implementation of Clover: Towards A Unified Video-Language Alignment and Fusion Model (CVPR2023)☆40Feb 15, 2023Updated 3 years ago
- Grounded Language-Image Pre-training☆2,573Jan 24, 2024Updated 2 years ago
- A collection of papers about Referring Image Segmentation.☆808Jan 28, 2026Updated 2 weeks ago
- [ECCV2022] This is an official implementation of paper "RankSeg: Adaptive Pixel Classification with Image Category Ranking for Segmentati…☆78Feb 12, 2023Updated 3 years ago
- ☆61Oct 23, 2021Updated 4 years ago
- [CVPR 2023] Exploring High-Quality Pseudo Masks for Weakly Supervised Instance Segmentation☆80Apr 4, 2023Updated 2 years ago
- A new video text spotting framework with Transformer☆78May 23, 2022Updated 3 years ago
- An official PyTorch implementation of the CRIS paper☆280Jun 9, 2024Updated last year
- [NeurIPS 2022] Make Sharpness-Aware Minimization Stronger: A Sparsified Perturbation Approach -- Official Implementation☆48Jun 29, 2023Updated 2 years ago