heliossun/SQ-LLaVA

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/heliossun/SQ-LLaVA)

heliossun / SQ-LLaVA

Visual self-questioning for large vision-language assistant.

☆44

Alternatives and similar repositories for SQ-LLaVA

Users that are interested in SQ-LLaVA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

heliossun / STLLaVA-Med
View on GitHub
Self-training LLaVA for medical
☆16Nov 3, 2024Updated last year
yinyueqin / DenseRewardRLHF-PPO
View on GitHub
This repository contains the code and released models for the paper Segmenting Text and Learning Their Rewards for Improved RLHF in Langu…
☆19Jan 8, 2025Updated last year
KD-TAO / VidKV
View on GitHub
VidKV: Plug-and-Play 1.x-Bit KV Cache Quantization for Video Large Language Models
☆26Mar 26, 2025Updated last year
pumpkin805 / FALIP
View on GitHub
[ECCV2024]FALIP: Visual Prompt as Foveal Attention Boosts CLIP Zero-Shot Performance
☆18Sep 11, 2024Updated last year
PhoenixZ810 / MG-LLaVA
View on GitHub
Official repository for paper MG-LLaVA: Towards Multi-Granularity Visual Instruction Tuning(https://arxiv.org/abs/2406.17770).
☆160Sep 27, 2024Updated last year
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
salesforce / GlueGen
View on GitHub
☆65Jun 16, 2025Updated last year
wanglichenxj / Dual-Relation-Semi-supervised-Multi-label-Learning
View on GitHub
☆23Sep 3, 2020Updated 5 years ago
callsys / ControlCap
View on GitHub
[ECCV 2024] ControlCap: Controllable Region-level Captioning
☆81Oct 25, 2024Updated last year
YuxiXie / V-DPO
View on GitHub
Preference Learning for LLaVA
☆60Nov 9, 2024Updated last year
codezakh / LilT
View on GitHub
[ICLR 23] Contrastive Aligned of Vision to Language Through Parameter-Efficient Transfer Learning
☆40Jul 29, 2023Updated 2 years ago
Jazzcharles / Egoinstructor
View on GitHub
Pytorch implementation for Egoinstructor at CVPR 2024
☆28Dec 1, 2024Updated last year
shiqichen17 / AdaptVis
View on GitHub
Github repository for "Why Is Spatial Reasoning Hard for VLMs? An Attention Mechanism Perspective on Focus Areas" (ICML 2025)
☆76May 2, 2025Updated last year
IIGROUP / SCL
View on GitHub
Seeing What You Miss: Vision-Language Pre-training with Semantic Completion Learning
☆20Dec 21, 2023Updated 2 years ago
canqin001 / Efficient_Graph_Similarity_Computation
View on GitHub
[NeurIPS-2021] Slow Learning and Fast Inference: Efficient Graph Similarity Computation via Knowledge Distillation
☆43Mar 24, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
NorthSummer / LGKD
View on GitHub
BEV-LGKD: A Unified LiDAR-Guided Knowledge Distillation Framework for Multi-View BEV 3D Object Detection
☆13Apr 2, 2024Updated 2 years ago
yihuaihong / Dissecting-FT-Unlearning
View on GitHub
[EMNLP 2024 Main] Code for the paper "Dissecting Fine-Tuning Unlearning in Large Language Models"
☆14Oct 10, 2024Updated last year
WeiZhang1988 / BEVFormerReimplementation
View on GitHub
☆16May 14, 2024Updated 2 years ago
MingSun-Tse / smilelogging
View on GitHub
Python logging package for easy reproducible experimenting in research
☆41Jul 29, 2025Updated 11 months ago
Hoyyyaard / NavGPT
View on GitHub
☆10Nov 16, 2023Updated 2 years ago
William-wAng618 / M2PT
View on GitHub
Official repo of M$^2$PT: Multimodal Prompt Tuning for Zero-shot Instruction Learning
☆29Mar 23, 2025Updated last year
lezhang7 / Retrieval_MuGI
View on GitHub
[EMNLP'2024 Findings] Explore generated documents for enhanced IR with LLMs. We enhance BM25 to surpass strong dense retriever on many da…
☆14Mar 28, 2025Updated last year
visionjo / facerec-bias-bfw
View on GitHub
Source code and notebooks to reproduce experiments and benchmarks on Bias Faces in the Wild (BFW).
☆52Jun 9, 2026Updated last month
MingSun-Tse / Regularization-Pruning
View on GitHub
[ICLR'21] Neural Pruning via Growing Regularization (PyTorch)
☆82Jul 15, 2021Updated 5 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Buzz-Beater / EgoTaskQA
View on GitHub
Code for NeurIPS 2022 Datasets and Benchmarks paper - EgoTaskQA: Understanding Human Tasks in Egocentric Videos.
☆44Apr 17, 2023Updated 3 years ago
HUANGLIZI / MMFundus
View on GitHub
This repository is the official data collection of MMFundus (Multimodal Fundus) dataset.
☆13Feb 2, 2026Updated 5 months ago
kangluoyao / VAP_Former
View on GitHub
[MICCAI-2023]Visual-Attribute Prompt Learning for Progressive Mild Cognitive Impairment Prediction
☆15Dec 12, 2023Updated 2 years ago
WisconsinAIVision / ViP-LLaVA
View on GitHub
[CVPR2024] ViP-LLaVA: Making Large Multimodal Models Understand Arbitrary Visual Prompts
☆338Jul 17, 2024Updated 2 years ago
ayiyayi / EgoExoBench
View on GitHub
☆15Nov 13, 2025Updated 8 months ago
WisconsinAIVision / YoLLaVA
View on GitHub
🌋👵🏻 Yo'LLaVA: Your Personalized Language and Vision Assistant (NeurIPS 2024)
☆123Mar 26, 2025Updated last year
iancovert / locality-alignment
View on GitHub
☆55Jan 17, 2025Updated last year
jcwang0602 / PLVL
View on GitHub
Progressive Language-guided Visual Learning for Multi-Task Visual Grounding
☆13May 9, 2025Updated last year
sugar-fly / VSFormer
View on GitHub
[AAAI 2024] VSFormer: Visual-Spatial Fusion Transformer for Correspondence Pruning
☆16Apr 7, 2024Updated 2 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
Jiaxing-star / LLaVA-Octopus
View on GitHub
☆11Jan 8, 2025Updated last year
ivonajdenkoska / tulip
View on GitHub
[ICLR 2025] Official code repository for "TULIP: Token-length Upgraded CLIP"
☆32Jan 26, 2026Updated 5 months ago
HKUST-LongGroup / RECODE
View on GitHub
[NeurIPS 2023] Zero-shot Visual Relation Detection via Composite Visual Cues from Large Language Models
☆23Oct 21, 2025Updated 9 months ago
rui-qian / UGround
View on GitHub
Rui Qian, Xin Yin, Chuanhang Deng, et al.: UGround: Towards Unified Visual Grounding with Unrolled Transformers (ICML 2026)
☆29Jun 18, 2026Updated last month
valeoai / MOCA
View on GitHub
MOCA: Self-supervised Representation Learning by Predicting Masked Online Codebook Assignments
☆13Jul 8, 2024Updated 2 years ago
liyongqi67 / GRACE
View on GitHub
☆29Aug 25, 2024Updated last year
hekj / FDA
View on GitHub
Official Implementation of Frequency-enhanced Data Augmentation for Vision-and-Language Navigation (NeurIPS2023)
☆14Jan 8, 2024Updated 2 years ago