PyTorch code for Improving Commonsense in Vision-Language Models via Knowledge Graph Riddles (DANCE)
☆23Nov 29, 2022Updated 3 years ago
Alternatives and similar repositories for DANCE
Users that are interested in DANCE are comparing it to the libraries listed below
Sorting:
- Danmuku dataset☆11Jul 7, 2023Updated 2 years ago
- [ICML 2025] Code for "R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts"☆19Mar 10, 2025Updated 11 months ago
- Knowledge Distillation using Contrastive Language-Image Pretraining (CLIP) without a teacher model.☆18Sep 6, 2024Updated last year
- ☆21Oct 10, 2023Updated 2 years ago
- [ECCV'22 Poster] Explicit Image Caption Editing☆22Nov 30, 2022Updated 3 years ago
- [ACL2025 Findings] Official code for MIG: Automatic Data Selection for Instruction Tuning by Maximizing Information Gain in Semantic Spac…☆28Aug 30, 2025Updated 6 months ago
- Repository containing code for blockwise SSL training☆30Oct 13, 2024Updated last year
- An official PyTorch implementation for CLIPPR☆30Jul 22, 2023Updated 2 years ago
- ☆31Sep 7, 2023Updated 2 years ago
- ☆59Aug 30, 2023Updated 2 years ago
- ☆26Mar 20, 2023Updated 2 years ago
- Official repo for the TMLR paper "Discffusion: Discriminative Diffusion Models as Few-shot Vision and Language Learners"☆30Apr 27, 2024Updated last year
- 🦩 Official repository of paper "Visual Instruction Tuning with Polite Flamingo" (AAAI-24 Oral)☆65Dec 9, 2023Updated 2 years ago
- COLA: Evaluate how well your vision-language model can Compose Objects Localized with Attributes!☆25Nov 23, 2024Updated last year
- [ICCV23] Official implementation of eP-ALM: Efficient Perceptual Augmentation of Language Models.☆27Oct 27, 2023Updated 2 years ago
- ☆32Feb 8, 2024Updated 2 years ago
- Code for paper 'Leveraging Predicate and Triplet Learning for Scene Graph Generation'. (CVPR 2024)☆32Sep 6, 2025Updated 5 months ago
- ☆32Mar 7, 2022Updated 3 years ago
- [ICLR2024] The official implementation of paper "UniAdapter: Unified Parameter-Efficient Transfer Learning for Cross-modal Modeling", by …☆77Jan 27, 2024Updated 2 years ago
- [CVPR23 Highlight] CREPE: Can Vision-Language Foundation Models Reason Compositionally?☆35Apr 27, 2023Updated 2 years ago
- ☆38Feb 8, 2024Updated 2 years ago
- ☆37Oct 7, 2023Updated 2 years ago
- [CVPR 2020] A generative model with latent factors that are independent and localized.☆12Mar 27, 2025Updated 11 months ago
- Martingale posterior neural networks for fast sequential decision making @ Neurips 2025☆23Nov 13, 2025Updated 3 months ago
- Inverse Constitutional AI [ICLR 2025]: compressing pairwise preference data into a short constitution of principles.☆41Updated this week
- Computational predictor of protein intrinsic disorder and its functions☆10Dec 4, 2023Updated 2 years ago
- ☆38Jul 24, 2023Updated 2 years ago
- codebase for the SIMAT dataset and evaluation☆38Feb 16, 2022Updated 4 years ago
- m&ms: A Benchmark to Evaluate Tool-Use for multi-step multi-modal tasks☆45Sep 26, 2024Updated last year
- Official pytorch implementation of the IrwGAN for unaligned image-to-image translation☆34Dec 15, 2021Updated 4 years ago
- Obj2Seq: Formatting Objects as Sequences with Class Prompt for Visual Tasks (NeurIPS2022)☆85Nov 2, 2022Updated 3 years ago
- Multimodal-Procedural-Planning☆93Jun 1, 2023Updated 2 years ago
- ☆10Jun 14, 2023Updated 2 years ago
- ☆16Feb 22, 2025Updated last year
- Simulation of bubbles in liquid foams using matlab☆10Oct 1, 2020Updated 5 years ago
- Official Code of ECCV 2022 paper MS-CLIP☆91Jul 27, 2022Updated 3 years ago
- ☆14Mar 20, 2025Updated 11 months ago
- ☆10Nov 17, 2022Updated 3 years ago
- ☆45Aug 14, 2023Updated 2 years ago