Visual self-questioning for large vision-language assistant.
☆44Jul 23, 2025Updated 10 months ago
Alternatives and similar repositories for SQ-LLaVA
Users that are interested in SQ-LLaVA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Self-training LLaVA for medical☆16Nov 3, 2024Updated last year
- VidKV: Plug-and-Play 1.x-Bit KV Cache Quantization for Video Large Language Models☆26Mar 26, 2025Updated last year
- Official repository for paper MG-LLaVA: Towards Multi-Granularity Visual Instruction Tuning(https://arxiv.org/abs/2406.17770).☆159Sep 27, 2024Updated last year
- [ECCV2024]FALIP: Visual Prompt as Foveal Attention Boosts CLIP Zero-Shot Performance☆18Sep 11, 2024Updated last year
- Rui Qian, Xin Yin, Chuanhang Deng, et al.: UGround: Towards Unified Visual Grounding with Unrolled Transformers (ICML 2026)☆25May 8, 2026Updated last month
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆65Jun 16, 2025Updated 11 months ago
- ☆23Sep 3, 2020Updated 5 years ago
- 中文版hf-alignment-handbook,大模型全套sft、dpo、orpo、cpt训练教程.☆15Aug 25, 2024Updated last year
- Pytorch implementation for Egoinstructor at CVPR 2024☆28Dec 1, 2024Updated last year
- [ICLR 23] Contrastive Aligned of Vision to Language Through Parameter-Efficient Transfer Learning☆40Jul 29, 2023Updated 2 years ago
- [ECCV 2024] ControlCap: Controllable Region-level Captioning☆81Oct 25, 2024Updated last year
- Explainable Face Recognition ECCV 2020 Paper code and dataset repository☆65Apr 28, 2021Updated 5 years ago
- Official Repository for CVPR 2024 Paper: "Large Language Models are Good Prompt Learners for Low-Shot Image Classification"☆44Jul 1, 2024Updated last year
- Seeing What You Miss: Vision-Language Pre-training with Semantic Completion Learning☆20Dec 21, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆15Feb 27, 2025Updated last year
- BEV-LGKD: A Unified LiDAR-Guided Knowledge Distillation Framework for Multi-View BEV 3D Object Detection☆13Apr 2, 2024Updated 2 years ago
- [NeurIPS-2021] Slow Learning and Fast Inference: Efficient Graph Similarity Computation via Knowledge Distillation☆43Mar 24, 2023Updated 3 years ago
- Code for the paper "Stack Attention: Improving the Ability of Transformers to Model Hierarchical Patterns"☆18Mar 15, 2024Updated 2 years ago
- 最简易的R1结果在小模型上的复现,阐述类O1与DeepSeek R1最重要的本质。Think is all your need。利用实验佐证,对于强推理能力,think思考过程性内容是AGI/ASI的核心。☆45Feb 8, 2025Updated last year
- [CVPR 2025] DyCoke: Dynamic Compression of Tokens for Fast Video Large Language Models☆111Nov 22, 2025Updated 6 months ago
- Official repo of M$^2$PT: Multimodal Prompt Tuning for Zero-shot Instruction Learning☆28Mar 23, 2025Updated last year
- HD-EPIC Python script to download the entire datasets or parts of it☆21Oct 7, 2025Updated 8 months ago
- Hyperbolic Safety-Aware Vision-Language Models. CVPR 2025☆31Apr 8, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Python logging package for easy reproducible experimenting in research☆41Jul 29, 2025Updated 10 months ago
- Source code and notebooks to reproduce experiments and benchmarks on Bias Faces in the Wild (BFW).☆52May 6, 2025Updated last year
- Official code for "Can We Talk Models Into Seeing the World Differently?" (ICLR 2025).☆30Jan 26, 2025Updated last year
- [ICLR'21] Neural Pruning via Growing Regularization (PyTorch)☆82Jul 15, 2021Updated 4 years ago
- [ICLR2025 Spotlight] Advantage-Guided Distillation for Preference Alignment in Small Language Models☆26Feb 10, 2025Updated last year
- 启智平台(qz.sii.edu.cn)的 Agent 驾驶舱:Skill + CLI,一条命令直达。Agent cockpit for the Inspire ML platform — one command, every operation, straight from…☆125Updated this week
- [CVPR 2025] Mitigating Object Hallucinations in Large Vision-Language Models with Assembly of Global and Local Attention☆68Jul 16, 2024Updated last year
- One Prompt Word is Enough to Boost Adversarial Robustness for Pre-trained Vision-Language Models☆59Apr 25, 2026Updated last month
- This repository is the official data collection of MMFundus (Multimodal Fundus) dataset.☆13Feb 2, 2026Updated 4 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- [MICCAI-2023]Visual-Attribute Prompt Learning for Progressive Mild Cognitive Impairment Prediction☆15Dec 12, 2023Updated 2 years ago
- [EMNLP 2024 Main] Code for the paper "Dissecting Fine-Tuning Unlearning in Large Language Models"☆14Oct 10, 2024Updated last year
- Progressive Language-guided Visual Learning for Multi-Task Visual Grounding☆13May 9, 2025Updated last year
- ☆120Jun 2, 2026Updated last week
- [CVPR2024] ViP-LLaVA: Making Large Multimodal Models Understand Arbitrary Visual Prompts☆339Jul 17, 2024Updated last year
- [AAAI 2024] VSFormer: Visual-Spatial Fusion Transformer for Correspondence Pruning☆16Apr 7, 2024Updated 2 years ago
- ☆54Jan 17, 2025Updated last year