[NeurIPS'24] Official implementation of paper "Unveiling the Tapestry of Consistency in Large Vision-Language Models".
☆38Oct 23, 2024Updated last year
Alternatives and similar repositories for ConBench
Users that are interested in ConBench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ACM MM'23] Official implementation of paper "Avatar Knowledge Distillation: Self-ensemble Teacher Paradigm with Uncertainty".☆14Nov 22, 2023Updated 2 years ago
- [NeurIPS'24] Official PyTorch Implementation of Seeing the Image: Prioritizing Visual Correlation by Contrastive Alignment☆58Sep 26, 2024Updated last year
- [ICML'25] Official implementation of paper "SparseVLM: Visual Token Sparsification for Efficient Vision-Language Model Inference" and "Sp…☆257Dec 22, 2025Updated 3 months ago
- Official implementation of paper "Masked Distillation with Receptive Tokens", ICLR 2023.☆71Apr 14, 2023Updated 2 years ago
- YOLOv3-TensorRT-INT8-KCF is a TensorRT Int8-Quantization implementation of YOLOv3 (and YOLOv3-tiny) on NVIDIA Jetson Xavier NX Board. The…☆15Dec 20, 2021Updated 4 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Pytorch implementation of RAPQ, IJCAI 2022☆23Jul 19, 2023Updated 2 years ago
- ☆81Jul 27, 2024Updated last year
- 🔥 [AAAI 2026 Oral] Official code for Decomposing the Neurons: Activation Sparsity via Mixture of Experts for Continual Test Time Adaptat…☆76Jul 8, 2024Updated last year
- [AAAI 2026] Official code for MoLe-VLA: Dynamic Layer-skipping Vision Language Action Model via Mixture-of-Layers for Efficient Robot Man…☆66Jul 31, 2025Updated 8 months ago
- [TMC 2025/NOSSDAV 2023] Official code for RepCaM++ and RepCaM: Re-parameterization Content-aware Modulation for Neural Video Delivery☆54Apr 21, 2025Updated 11 months ago
- A Practical Zoom-in GUI Grounding and Behavior-Based Evaluation method.☆21Dec 8, 2025Updated 4 months ago
- [CVPR 2025] Official implementation of paper "MoVE-KD: Knowledge Distillation for VLMs with Mixture of Visual Encoders".☆51Jun 7, 2025Updated 10 months ago
- [Arxiv 2025] Official code for T-REX: Mixture-of-Rank-One-Experts with semantic-aware Intuition for Multi-task Large Language Model Finet…☆17May 16, 2025Updated 10 months ago
- Official codebase for “In-Context Learning with Many Demonstration Examples”☆16Feb 13, 2023Updated 3 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- [ACL 2024] Multi-modal preference alignment remedies regression of visual instruction tuning on language model☆47Nov 10, 2024Updated last year
- ☆12Mar 12, 2022Updated 4 years ago
- Official code repo of PIN: Positional Insert Unlocks Object Localisation Abilities in VLMs☆26Jan 14, 2025Updated last year
- Official implementation for paper "Knowledge Diffusion for Distillation", NeurIPS 2023☆95Jan 24, 2024Updated 2 years ago
- VisualGPTScore for visio-linguistic reasoning☆27Oct 7, 2023Updated 2 years ago
- Cross-Self KV Cache Pruning for Efficient Vision-Language Inference☆10Dec 15, 2024Updated last year
- This is code of paper "ScalableViT: Rethinking the Context-oriented Generalization of Vision Transformer"☆26Sep 16, 2023Updated 2 years ago
- Implementation of "Single Shot Video Object Detector"☆23Mar 25, 2020Updated 6 years ago
- ☆17Oct 1, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Look, Compare, Decide: Alleviating Hallucination in Large Vision-Language Models via Multi-View Multi-Path Reasoning☆24Sep 9, 2024Updated last year
- Official code implementation for the paper "Active Finetuning: Exploiting Annotation Budget in the Pretraining-Finetuning Paradigm" (CVPR…☆57Apr 16, 2025Updated 11 months ago
- [CVPR 2024] Official Code for the Paper "Compositional Chain-of-Thought Prompting for Large Multimodal Models"☆146Jun 20, 2024Updated last year
- Benchmarks for Macro Neural Architecture Search; used and described in the paper "Local Search is a Remarkably Strong Baseline for Neural…☆12Jul 25, 2024Updated last year
- [COLING2022] A Multi-turn Machine Reading Comprehension Framework with Rethink Mechanism for Emotion-Cause Pair Extraction☆18Oct 13, 2022Updated 3 years ago
- ☆24Jun 11, 2025Updated 10 months ago
- The official implementation of 《MLLMs-Augmented Visual-Language Representation Learning》☆31Mar 12, 2024Updated 2 years ago
- [CVPR 2025]Lift3D Foundation Policy: Lifting 2D Large-Scale Pretrained Models for Robust 3D Robotic Manipulation☆181Jun 20, 2025Updated 9 months ago
- [CVPR 2026] Variation-aware Vision Token Dropping for Faster Large Vision-Language Models☆29Mar 18, 2026Updated 3 weeks ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- HybridVLA: Collaborative Diffusion and Autoregression in a Unified Vision-Language-Action Model☆346Oct 3, 2025Updated 6 months ago
- The code for SCG: Saliency and Contour Guided Salient Instance Segmentation☆10Feb 22, 2023Updated 3 years ago
- RareAct: A video dataset of unusual interactions☆34Aug 4, 2020Updated 5 years ago
- [ACL 2025 Findings] Official pytorch implementation of "Don't Miss the Forest for the Trees: Attentional Vision Calibration for Large Vis…☆25Jul 21, 2024Updated last year
- ☆17Aug 8, 2024Updated last year
- [ICLR 2024] ViDA: Homeostatic Visual Domain Adapter for Continual Test Time Adaptation☆76Apr 25, 2024Updated last year
- The code for ICCV2021 Light Field Saliency Detection with Dual Local Graph Learning and Reciprocative Guidance☆12Apr 1, 2022Updated 4 years ago