[NeurIPS2023] Exploring Diverse In-Context Configurations for Image Captioning
☆44Nov 26, 2024Updated last year
Alternatives and similar repositories for ExploreCfg
Users that are interested in ExploreCfg are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [AAAI2025] Video Repurposing from User Generated Content: A Large-scale Dataset and Benchmark☆27Jan 4, 2026Updated 2 months ago
- [AAAI2025] Unlearning Concepts in Diffusion Model via Concept Domain Correction and Concept Preserving Gradient☆44Apr 17, 2025Updated 11 months ago
- Open source implementation of the paper "MM-Vid: Advancing Video Understanding with GPT-4V(ision)".☆39Jan 4, 2026Updated 2 months ago
- [CVPR2025] Number it: Temporal Grounding Videos like Flipping Manga☆146Jan 19, 2026Updated 2 months ago
- The Code for Lever LM: Configuring In-Context Sequence to Lever Large Vision Language Models☆18Oct 4, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Using distilled CLIP model to deploy the android device☆20Feb 28, 2023Updated 3 years ago
- [NeurIPS 2024] Visual Perception by Large Language Model’s Weights☆56Mar 31, 2025Updated 11 months ago
- Training Vision Transformers for Semi-Supervised Semantic Segmentation☆14Nov 3, 2025Updated 4 months ago
- [ICLR'26] Easier Painting Than Thinking: Can Text-to-Image Models Set the Stage, but Not Direct the Play?☆50Mar 9, 2026Updated 2 weeks ago
- 【NeurIPS 2024】The implementation of LIVE: Learnable In-Context Vector for Visual Question Answering https://arxiv.org/abs/2406.13185☆23May 31, 2025Updated 9 months ago
- ☆12Jan 21, 2025Updated last year
- [TPAMI2025] BackMix: Regularizing Open Set Recognition by Removing Underlying Fore-Background Priors☆15Apr 23, 2025Updated 11 months ago
- DualGNN: Dual Graph Neural Network for Micro-video Recommendation☆16Oct 27, 2021Updated 4 years ago
- Implementation of "Interleaved Latent Visual Reasoning with Selective Perceptual Modeling".☆45Jan 21, 2026Updated 2 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A curated list of papers, datasets and resources pertaining to zero-shot object detection.☆29Mar 15, 2023Updated 3 years ago
- A LLM model for space understanding☆24Sep 12, 2025Updated 6 months ago
- [ICCV 2025] HQ-CLIP: Leveraging Large Vision-Language Models to Create High-Quality Image-Text Datasets☆64Aug 6, 2025Updated 7 months ago
- Personalized Image Generation with Large Multimodal Models☆14May 13, 2025Updated 10 months ago
- A hybrid quantum-classical neural network simulation platform. Quantum simulation uses QTensor, a state-of-the-art tensor network-based s…☆14Jun 27, 2023Updated 2 years ago
- [ICLR2024] (EvALign-ICL Benchmark) Beyond Task Performance: Evaluating and Reducing the Flaws of Large Multimodal Models with In-Context …☆22Mar 1, 2024Updated 2 years ago
- [ICML 2025] Official code of "DAMA: Data- and Model-aware Alignment of Multi-modal LLMs"☆16May 24, 2025Updated 10 months ago
- 东南大学 2021 级计算机专业操作系统课程实验 - Operating System Labwork source code in Dr.Kai Dong's Operating System Class. Based on OSTEP.☆13Jun 17, 2023Updated 2 years ago
- Let the IELTS-prompted GPTs accompany you through IELTS mock exams, help you with scoring and provide suggestions for improvement.☆40Apr 8, 2024Updated last year
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- We introduce Reasoning via Video, a new paradigm that uses maze-solving video generation to probe multimodal reasoning; our VR-Bench show…☆57Feb 4, 2026Updated last month
- ☆48Apr 5, 2020Updated 5 years ago
- [ICCV 2025] Official Implementation of "Shot-by-Shot: Film-Grammar-Aware Training-Free Audio Description Generation". Junyu Xie, Tengda H…☆22Jul 26, 2025Updated 8 months ago
- ☆13Feb 25, 2025Updated last year
- ☆17Feb 23, 2025Updated last year
- iLLaVA: An Image is Worth Fewer Than 1/3 Input Tokens in Large Multimodal Models (ICLR2026)☆21Mar 10, 2026Updated 2 weeks ago
- Prompt Tuning on Graph-augmented Low-resource Text Classification. In TKDE 2024.☆15Jan 20, 2025Updated last year
- ☆13Sep 5, 2023Updated 2 years ago
- RO-ViT CVPR 2023 "Region-Aware Pretraining for Open-Vocabulary Object Detection with Vision Transformers"☆17Aug 24, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆13May 13, 2025Updated 10 months ago
- ☆25Jan 12, 2026Updated 2 months ago
- ☆27Feb 2, 2024Updated 2 years ago
- Official Repository of Personalized Visual Instruct Tuning☆34Mar 6, 2025Updated last year
- ☆16Sep 12, 2023Updated 2 years ago
- WeThink: Toward General-purpose Vision-Language Reasoning via Reinforcement Learning☆36Jun 10, 2025Updated 9 months ago
- 【ICCV 2023】Towards Instance-adaptive Inference for Federated Learning☆13Mar 31, 2025Updated 11 months ago