[NeurIPS2023] Exploring Diverse In-Context Configurations for Image Captioning
☆47Nov 26, 2024Updated last year
Alternatives and similar repositories for ExploreCfg
Users that are interested in ExploreCfg are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [AAAI2025] Video Repurposing from User Generated Content: A Large-scale Dataset and Benchmark☆30Apr 4, 2026Updated last month
- [CVPR2025] Number it: Temporal Grounding Videos like Flipping Manga☆149Jan 19, 2026Updated 4 months ago
- [CVPR 2024] How to Configure Good In-Context Sequence for Visual Question Answering☆21May 28, 2025Updated last year
- The Code for Lever LM: Configuring In-Context Sequence to Lever Large Vision Language Models☆18Oct 4, 2024Updated last year
- Using distilled CLIP model to deploy the android device☆20Feb 28, 2023Updated 3 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- An in-context learning research testbed☆19Mar 16, 2025Updated last year
- [NeurIPS 2024] Visual Perception by Large Language Model’s Weights☆56Mar 31, 2025Updated last year
- Training Vision Transformers for Semi-Supervised Semantic Segmentation☆15Nov 3, 2025Updated 6 months ago
- ☆20Sep 19, 2023Updated 2 years ago
- [ICLR'26] Easier Painting Than Thinking: Can Text-to-Image Models Set the Stage, but Not Direct the Play?☆53Mar 9, 2026Updated 2 months ago
- [TPAMI2025] BackMix: Regularizing Open Set Recognition by Removing Underlying Fore-Background Priors☆16Apr 23, 2025Updated last year
- DualGNN: Dual Graph Neural Network for Micro-video Recommendation☆17Apr 8, 2026Updated last month
- Implementation of "Interleaved Latent Visual Reasoning with Selective Perceptual Modeling".☆53Apr 8, 2026Updated last month
- Diverse Demonstrations Improve In-context Compositional Generalization☆12Jul 7, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [ICML 2025] Official code of "DAMA: Data- and Model-aware Alignment of Multi-modal LLMs"☆16May 24, 2025Updated last year
- [ICLR2024] (EvALign-ICL Benchmark) Beyond Task Performance: Evaluating and Reducing the Flaws of Large Multimodal Models with In-Context …☆22Mar 1, 2024Updated 2 years ago
- [ICCV 2025] HQ-CLIP: Leveraging Large Vision-Language Models to Create High-Quality Image-Text Datasets☆66Aug 6, 2025Updated 9 months ago
- DevKit for SoccerNet Team Action Spotting Challenge 2025☆19Aug 26, 2025Updated 9 months ago
- Personalized Image Generation with Large Multimodal Models☆17May 13, 2025Updated last year
- Pytorch implementation for "Bootstrap Latent Representations for Multi-modal Recommendation"-WWW'23☆68Aug 3, 2023Updated 2 years ago
- Let the IELTS-prompted GPTs accompany you through IELTS mock exams, help you with scoring and provide suggestions for improvement.☆40Apr 8, 2024Updated 2 years ago
- [ICLR 2026] On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification.☆567Jan 4, 2026Updated 4 months ago
- ☆48Apr 5, 2020Updated 6 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆17Feb 23, 2025Updated last year
- EraseAnything, ICML 2025☆41Sep 28, 2025Updated 8 months ago
- The implementation of Learning Instance and Task-Aware Dynamic Kernels for Few Shot Learning☆13Apr 14, 2024Updated 2 years ago
- ☆13Sep 5, 2023Updated 2 years ago
- ☆12Jul 4, 2024Updated last year
- RO-ViT CVPR 2023 "Region-Aware Pretraining for Open-Vocabulary Object Detection with Vision Transformers"☆17Aug 24, 2023Updated 2 years ago
- some object detection algo☆14Jul 25, 2024Updated last year
- Code for Paper "Explore More Guidance: A Task-aware Instruction Network for Sign Language Translation Enhanced with Data Augmentation"☆12Feb 6, 2023Updated 3 years ago
- ☆13May 13, 2025Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆27Jan 12, 2026Updated 4 months ago
- Official Repository of Personalized Visual Instruct Tuning☆34Mar 6, 2025Updated last year
- 生成对抗网络综述☆10May 22, 2022Updated 4 years ago
- ☆16Sep 12, 2023Updated 2 years ago
- WeThink: Toward General-purpose Vision-Language Reasoning via Reinforcement Learning☆36Jun 10, 2025Updated 11 months ago
- Code for the LOTA: Bit-Planes Guided AI-Generated Image Detection☆48May 12, 2026Updated 2 weeks ago
- [ICCV 2023] Latent Action Composition for Skeleton-based Action Segmentation☆22Oct 25, 2023Updated 2 years ago