CASIA-IVA-Lab/SC-Tune

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/CASIA-IVA-Lab/SC-Tune)

CASIA-IVA-Lab / SC-Tune

Official code for CVPR 2024 paper, "SC-Tune: Unleashing Self-Consistent Referential Comprehension in Large Vision Language Models"

☆16

Alternatives and similar repositories for SC-Tune

Users that are interested in SC-Tune are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

CASIA-IVA-Lab / OPT_Questioner
View on GitHub
Official PyTorch implementation of the paper "Enhancing Vision-Language Pre-Training with Jointly Learned Questioner and Dense Captioner"
☆15Aug 9, 2023Updated 2 years ago
Rubics-Xuan / Med-DANet
View on GitHub
Med-DANet Series (ECCV 2022 & WACV 2024)
☆13Jan 2, 2024Updated 2 years ago
liuting20 / DARA
View on GitHub
[ICME 2024 Oral] DARA: Domain- and Relation-aware Adapters Make Parameter-efficient Tuning for Visual Grounding
☆22Feb 26, 2025Updated last year
zechao-li / SVF-few-shot-segmentation
View on GitHub
☆22May 16, 2023Updated 3 years ago
CASIA-IVA-Lab / MRES
View on GitHub
This repo holds the official code and data for "Unveiling Parts Beyond Objects: Towards Finer-Granularity Referring Expression Segmentati…
☆74Jun 3, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
CASIA-IVA-Lab / VRoPE
View on GitHub
[EMNLP 2025 Main] Official implementation of VRoPE: Rotary Position Embedding for Video Large Language Models.
☆28Nov 18, 2025Updated 8 months ago
Show-han / Zeroshot_REC
View on GitHub
Official code for Zero-shot Referring Expression Comprehension via Structural Similarity Between Images and Captions (CVPR 2024)
☆28Jun 21, 2024Updated 2 years ago
xulingjing88 / WSMA
View on GitHub
[AAAI 2024]Weakly Supervised Multimodal Affordance Grounding for Egocentric Images
☆13Nov 10, 2024Updated last year
mrflogs / ICLR24
View on GitHub
Official code for ICLR 2024 paper, "A Hard-to-Beat Baseline for Training-free CLIP-based Adaptation"
☆86Apr 21, 2024Updated 2 years ago
ZzZZCHS / WS-3DVG
View on GitHub
[ICCV 2023] Distilling Coarse-to-fine Semantic Matching Knowledge for Weakly Supervised 3D Visual Grounding
☆14Oct 2, 2024Updated last year
mrflogs / LoRA-Pre
View on GitHub
Official code for ICLR 2026 Oral paper, "Taming Momentum: Rethinking Optimizer States Through Low-Rank Approximation"
☆33Feb 26, 2026Updated 4 months ago
irvingzhang0512 / open-images-downloader
View on GitHub
☆14Aug 13, 2021Updated 4 years ago
cyzus / thoughtsculpt
View on GitHub
THOUGHTSCULPT, a general reasoning and search method for complex tasks
☆13Dec 13, 2024Updated last year
baaivision / DIVA
View on GitHub
[ICLR 2025] Diffusion Feedback Helps CLIP See Better
☆301Jan 23, 2025Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
Exgc / R1V-Free
View on GitHub
R1V, trained with AI feedback, answers open-ended visual questions.
☆14Apr 12, 2025Updated last year
Samyu0304 / thought-propagation
View on GitHub
Code and dataset for the ICLR 2024 paper "Thought Propagation: An analogical Approach to Complex Reasoning with Large Language Models."
☆16Mar 4, 2024Updated 2 years ago
AmingWu / SVD-Dictionary-Enhancement
View on GitHub
☆13Dec 4, 2021Updated 4 years ago
ybwang119 / label_recovery
View on GitHub
[ICLR 2024] Towards Elminating Hard Label Constraints in Gradient Inverision Attacks
☆14Feb 6, 2024Updated 2 years ago
cutz-j / T-GD
View on GitHub
T-GD: Transferable GAN-generated Images Detection Framework. (ICML 2020)
☆18May 12, 2021Updated 5 years ago
leofansq / Tools_make_planes
View on GitHub
AVOD needs the planes file to provide ground plane information, but the official planes generation tool has not yet been provided, which …
☆13Apr 23, 2019Updated 7 years ago
Rubics-Xuan / IVG
View on GitHub
This repo holds the official code and data for "Beyond Literal Descriptions: Understanding and Locating Open-World Objects Aligned with H…
☆15May 21, 2024Updated 2 years ago
zjh31 / CPL
View on GitHub
☆21Apr 2, 2024Updated 2 years ago
tim-learn / UEO
View on GitHub
ICML-2024 highlight paper "Realistic Unsupervised CLIP Fine-tuning with Universal Entropy Optimization"
☆19Jul 18, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
mrflogs / CraFT
View on GitHub
Official code for ICML 2024 paper, "Connecting the Dots: Collaborative Fine-tuning for Black-Box Vision-Language Models"
☆19Jun 12, 2024Updated 2 years ago
uvavision / SelfEQ
View on GitHub
[CVPR 2024] Code for "Improved Visual Grounding through Self-Consistent Explanations".
☆28Mar 1, 2024Updated 2 years ago
Leminhbinh0209 / AAAI22-ADD
View on GitHub
Official implementation of AAAI22 paper "ADD: Frequency Attention and Multi-View based Knowledge Distillation to Detect Low-Quality Compr…
☆10Mar 1, 2024Updated 2 years ago
mrflogs / SHIP
View on GitHub
Official code for ICCV 2023 paper, "Improving Zero-Shot Generalization for CLIP with Synthesized Prompts"
☆104Mar 6, 2024Updated 2 years ago
rycolab / odpo
View on GitHub
This repository contains code for the paper Direct Preference Optimization with an Offset (ODPO).
☆21Feb 17, 2025Updated last year
LivXue / VCIN
View on GitHub
Authors's code for "Variational Causal Inference Network for Explanatory Visual Question Answering" and "Integrating Neural-Symbolic Reas…
☆13Apr 13, 2026Updated 3 months ago
Jasper-Yan / TRACE-RPS
View on GitHub
[ICLR'26] Official Repository for The Paper: Stop Tracking Me! Proactive Defense Against Attribute Inference Attack in LLMs
☆19Apr 6, 2026Updated 3 months ago
PeterGriffinJin / Heterformer
View on GitHub
Heterformer: Transformer-based Deep Node Representation Learning on Heterogeneous Text-Rich Networks (KDD 2023)
☆28Feb 16, 2024Updated 2 years ago
microsoft / VisionAsAdaptations
View on GitHub
☆16May 11, 2026Updated 2 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
CASIA-IVA-Lab / VALOR
View on GitHub
[TPAMI2024] Codes and Models for VALOR: Vision-Audio-Language Omni-Perception Pretraining Model and Dataset
☆311Dec 25, 2024Updated last year
pkunlp-icler / MIC
View on GitHub
MMICL, a state-of-the-art VLM with the in context learning ability from ICL, PKU
☆49Jul 13, 2025Updated last year
I2-Multimedia-Lab / Magnet
View on GitHub
Official Implementation of "Magnet: We Never Know How Text-to-Image Diffusion Models Work, Until We Learn How Vision-Language Models Func…
☆31Dec 2, 2024Updated last year
toggle1995 / RIS-DMMI
View on GitHub
☆47Oct 3, 2023Updated 2 years ago
kingthreestones / RefCLIP
View on GitHub
☆39Jun 28, 2023Updated 3 years ago
rll-research / rune
View on GitHub
Code for paper: Reward Uncertainty for Exploration in Preference-based Reinforcement Learning
☆15May 26, 2022Updated 4 years ago
typoverflow / WiseRL
View on GitHub
PyTorch implementations for Offline Preference-Based RL (PbRL) algorithms
☆21Mar 24, 2025Updated last year