[NeurIPS2023] Exploring Diverse In-Context Configurations for Image Captioning
☆44Nov 26, 2024Updated last year
Alternatives and similar repositories for ExploreCfg
Users that are interested in ExploreCfg are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [AAAI2025] Video Repurposing from User Generated Content: A Large-scale Dataset and Benchmark☆27Apr 4, 2026Updated 2 weeks ago
- [AAAI2025] Unlearning Concepts in Diffusion Model via Concept Domain Correction and Concept Preserving Gradient☆44Apr 17, 2025Updated last year
- Open source implementation of the paper "MM-Vid: Advancing Video Understanding with GPT-4V(ision)".☆40Jan 4, 2026Updated 3 months ago
- [CVPR2025] Number it: Temporal Grounding Videos like Flipping Manga☆146Jan 19, 2026Updated 2 months ago
- [CVPR 2024] How to Configure Good In-Context Sequence for Visual Question Answering☆21May 28, 2025Updated 10 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- The Code for Lever LM: Configuring In-Context Sequence to Lever Large Vision Language Models☆18Oct 4, 2024Updated last year
- Using distilled CLIP model to deploy the android device☆20Feb 28, 2023Updated 3 years ago
- An in-context learning research testbed☆19Mar 16, 2025Updated last year
- [NIPS 25'] Evaluation code of paper "KRIS-Bench: Benchmarking Next-Level Intelligent Image Editing Models"☆42Oct 19, 2025Updated 6 months ago
- [NeurIPS 2024] Visual Perception by Large Language Model’s Weights☆56Mar 31, 2025Updated last year
- Training Vision Transformers for Semi-Supervised Semantic Segmentation☆15Nov 3, 2025Updated 5 months ago
- ☆20Sep 19, 2023Updated 2 years ago
- [ICLR'26] Easier Painting Than Thinking: Can Text-to-Image Models Set the Stage, but Not Direct the Play?☆51Mar 9, 2026Updated last month
- 【NeurIPS 2024】The implementation of LIVE: Learnable In-Context Vector for Visual Question Answering https://arxiv.org/abs/2406.13185☆23May 31, 2025Updated 10 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆17Mar 13, 2023Updated 3 years ago
- [TPAMI2025] BackMix: Regularizing Open Set Recognition by Removing Underlying Fore-Background Priors☆15Apr 23, 2025Updated 11 months ago
- Implementation of "Interleaved Latent Visual Reasoning with Selective Perceptual Modeling".☆48Apr 8, 2026Updated last week
- The minimal implementation of various popular AI models☆43Oct 21, 2025Updated 5 months ago
- [ICCV 2025] HQ-CLIP: Leveraging Large Vision-Language Models to Create High-Quality Image-Text Datasets☆65Aug 6, 2025Updated 8 months ago
- Personalized Image Generation with Large Multimodal Models☆15May 13, 2025Updated 11 months ago
- DevKit for SoccerNet Team Action Spotting Challenge 2025☆18Aug 26, 2025Updated 7 months ago
- [ICLR2024] (EvALign-ICL Benchmark) Beyond Task Performance: Evaluating and Reducing the Flaws of Large Multimodal Models with In-Context …☆22Mar 1, 2024Updated 2 years ago
- [ICML 2025] Official code of "DAMA: Data- and Model-aware Alignment of Multi-modal LLMs"☆16May 24, 2025Updated 10 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- A LLM model for space understanding☆25Sep 12, 2025Updated 7 months ago
- ☆13May 30, 2021Updated 4 years ago
- We introduce Reasoning via Video, a new paradigm that uses maze-solving video generation to probe multimodal reasoning; our VR-Bench show…☆58Feb 4, 2026Updated 2 months ago
- [ICCV 2025] Official Implementation of "Shot-by-Shot: Film-Grammar-Aware Training-Free Audio Description Generation". Junyu Xie, Tengda H…☆22Jul 26, 2025Updated 8 months ago
- ☆13Feb 25, 2025Updated last year
- ☆17Feb 23, 2025Updated last year
- EraseAnything, ICML 2025☆40Sep 28, 2025Updated 6 months ago
- The implementation of Learning Instance and Task-Aware Dynamic Kernels for Few Shot Learning☆13Apr 14, 2024Updated 2 years ago
- ☆13Sep 5, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆12Jul 4, 2024Updated last year
- RO-ViT CVPR 2023 "Region-Aware Pretraining for Open-Vocabulary Object Detection with Vision Transformers"☆17Aug 24, 2023Updated 2 years ago
- Code for Paper "Explore More Guidance: A Task-aware Instruction Network for Sign Language Translation Enhanced with Data Augmentation"☆12Feb 6, 2023Updated 3 years ago
- ☆13May 13, 2025Updated 11 months ago
- The code of IJCAI2022 paper, Declaration-based Prompt Tuning for Visual Question Answering☆20May 10, 2022Updated 3 years ago
- ☆25Jan 12, 2026Updated 3 months ago
- ☆20Jan 15, 2024Updated 2 years ago