wysnzzzz/DIT

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/wysnzzzz/DIT)

wysnzzzz / DIT

☆18

Alternatives and similar repositories for DIT

Users that are interested in DIT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

zlai0 / S-Seg
View on GitHub
☆23Jan 24, 2024Updated 2 years ago
GeWu-Lab / Stepping-Stones
View on GitHub
The official repo for "Stepping Stones: A Progressive Training Strategy for Audio-Visual Semantic Segmentation", ECCV 2024
☆18Oct 11, 2024Updated last year
lxq-jnu / SpTFuse
View on GitHub
☆16Jan 21, 2025Updated last year
ywh187 / FitPrune
View on GitHub
☆68Jan 23, 2026Updated 6 months ago
LinfengYuan1997 / LoSh
View on GitHub
[CVPR 2024] LoSh: Long-Short Text Joint Prediction Network for Referring Video Object Segmentation
☆13Jun 17, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
nini0919 / SemiRES
View on GitHub
[ICML2024]The official implementation of SemiRES in PyTorch.
☆33Jun 20, 2024Updated 2 years ago
fawnliu / TRIS
View on GitHub
[ICCV 2023] Official code release of our paper "Referring Image Segmentation Using Text Supervision"
☆75Oct 13, 2024Updated last year
rongfu-dsb / MPG-SAM2
View on GitHub
[ICCV 2025] MPG-SAM 2: Adapting SAM 2 with Mask Priors and Global Context for Referring Video Object Segmentation
☆23Sep 5, 2025Updated 10 months ago
jasongief / TGS-Agent
View on GitHub
[2026 AAAI] Think Before You Segment: An Object-aware Reasoning Agent for Referring Audio-Visual Segmentation
☆20Nov 8, 2025Updated 8 months ago
zdk258 / CorrCLIP
View on GitHub
[ICCV 2025 Oral] CorrCLIP: Reconstructing Patch Correlations in CLIP for Open-Vocabulary Semantic Segmentation
☆70Aug 1, 2025Updated 11 months ago
JiHooooo / LGA
View on GitHub
☆13Jul 6, 2024Updated 2 years ago
yannqi / COMBO-AVS
View on GitHub
[CVPR 2024 Highlight] Official implementation of the paper: Cooperation Does Matter: Exploring Multi-Order Bilateral Relations for Audio-…
☆40Apr 20, 2025Updated last year
Vibashan / PosSAM
View on GitHub
Official Repo for PosSAM: Panoptic Open-vocabulary Segment Anything
☆71Apr 7, 2024Updated 2 years ago
wds1998 / Edge-LBAM
View on GitHub
Pytorch implementation of paper "Image Inpainting with Edge-guided Learnable Bidirectional Attention Maps"
☆25Jul 24, 2021Updated 5 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
ziplab / MPVSS
View on GitHub
☆33Feb 29, 2024Updated 2 years ago
Lingyun0419 / CVPT
View on GitHub
Cross Visual Prompt Tuning [ICCV 2025]
☆13Aug 3, 2025Updated 11 months ago
yyliu01 / AuralSAM2
View on GitHub
[CVPR'26, Findings] AuralSAM2: Enabling SAM2 Hear Through Pyramid Audio-Visual Feature Prompting
☆15May 18, 2026Updated 2 months ago
SkyworkAI / DAQ-VS
View on GitHub
Code For Our Work: DVIS-DAQ: Improving Video Segmentation via Dynamic Anchor Queries [ECCV-2024]
☆15Jul 11, 2024Updated 2 years ago
Tapall-AI / MeViS_Track_Solution_2024
View on GitHub
[CVPR 2024 Challenge] 1st Place Solution for MeViS Track in CVPR 2024 PVUW Workshop: Motion Expression guided Video Segmentation
☆31Oct 18, 2024Updated last year
Huiqin-Zhang / SPGFusion
View on GitHub
This is official Pytorch implementation of SPGFusion
☆22Sep 2, 2025Updated 10 months ago
clearxu / SPT
View on GitHub
Code of ["Spectral Prompt Tuning: Unveiling Unseen Classes for Zero-Shot Semantic Segmentation"]
☆14Apr 26, 2024Updated 2 years ago
ManuelPalermo / AndroidVideoSegmentation
View on GitHub
Android video semantic segmentation using DeeplabV3+ lite
☆10Sep 20, 2019Updated 6 years ago
ruohaoguo / ovavss
View on GitHub
Official Implementation of "Open-Vocabulary Audio-Visual Semantic Segmentation" [ACM MM 2024 Oral].
☆37Nov 2, 2024Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
sosppxo / RG-SAN
View on GitHub
[NeurIPS 2024 Oral] RG-SAN: Rule-Guided Spatial Awareness Network for End-to-End 3D Referring Expression Segmentation
☆20Dec 22, 2024Updated last year
LiamLian0727 / Awesome_Underwater_Datasets
View on GitHub
Pointers to large-scale underwater datasets and relevant resources.
☆11May 22, 2025Updated last year
YuHengsss / Trident
View on GitHub
[ICCV2025] Harnessing CLIP, DINO and SAM for Open Vocabulary Segmentation
☆126Nov 22, 2025Updated 8 months ago
GeWu-Lab / Ref-AVS
View on GitHub
The official repo for "Ref-AVS: Refer and Segment Objects in Audio-Visual Scenes", ECCV 2024
☆50Oct 12, 2025Updated 9 months ago
amitakamath / vl_text_encoders_are_bottlenecks
View on GitHub
Code and datasets for "Text encoders are performance bottlenecks in contrastive vision-language models". Coming soon!
☆11May 24, 2023Updated 3 years ago
PKU-ICST-MIPL / TARA_CVPR2026
View on GitHub
☆17Mar 21, 2026Updated 4 months ago
PotatoTian / recall-semseg
View on GitHub
☆45Feb 4, 2022Updated 4 years ago
letitiabanana / PnP-OVSS
View on GitHub
[CVPR'24] Code for Emergent Open-Vocabulary Semantic Segmentation from Off-the-shelf Vision-Language Models
☆18Jul 22, 2024Updated 2 years ago
Caoyichao / UniHOI
View on GitHub
Code for the paper "Detecting Any Human-Object Interaction Relationship: Universal HOI Detector with Spatial Prompt Learning on Foundatio…
☆28Nov 8, 2023Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
visinf / veto
View on GitHub
Vision Relation Transformer for Unbiased Scene Graph Generation (ICCV 2023)
☆22Mar 23, 2026Updated 4 months ago
Leon1207 / 3DRefTR
View on GitHub
This is a PyTorch implementation of 3DRefTR proposed by our paper "A Unified Framework for 3D Point Cloud Visual Grounding"
☆26Aug 24, 2023Updated 2 years ago
ZjjConan / VLM-LwEIB
View on GitHub
The official pytorch implemention of our IJCV-2025 paper "Learning with Enriched Inductive Biases for Vision-Language Models".
☆15Jul 6, 2026Updated 2 weeks ago
showlab / Tune-An-Ellipse
View on GitHub
[CVPR 2024] Tune-An-Ellipse: CLIP Has Potential to Find What You Want
☆14Jan 5, 2025Updated last year
wang-x-1997 / DAFusion
View on GitHub
[INF FUS 2025] "A Degradation-Aware Guided Fusion Network for Infrared and Visible Image"
☆24Mar 13, 2025Updated last year
bjzhb666 / GS-LoRA
View on GitHub
Practical Continual Forgetting for Pre-trained Vision Models (CVPR 2024; T-PAMI 2026)
☆75Jan 15, 2026Updated 6 months ago
DoubtedSteam / Flash_Attn_with_Score
View on GitHub
Flash Attention implementation that returns both output and attention scores. High-performance, memory-efficient attention with score ext…
☆16Feb 6, 2026Updated 5 months ago