yongliang-wu/ExploreCfg

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/yongliang-wu/ExploreCfg)

yongliang-wu / ExploreCfg

[NeurIPS2023] Exploring Diverse In-Context Configurations for Image Captioning

☆47

Alternatives and similar repositories for ExploreCfg

Users that are interested in ExploreCfg are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

yongliang-wu / Repurpose
View on GitHub
[AAAI2025] Video Repurposing from User Generated Content: A Large-scale Dataset and Benchmark
☆30Apr 4, 2026Updated 3 months ago
yongliang-wu / MM-VID
View on GitHub
Open source implementation of the paper "MM-Vid: Advancing Video Understanding with GPT-4V(ision)".
☆44Jan 4, 2026Updated 6 months ago
yongliang-wu / NumPro
View on GitHub
[CVPR2025] Number it: Temporal Grounding Videos like Flipping Manga
☆150Jan 19, 2026Updated 5 months ago
ForJadeForest / Lever-LM
View on GitHub
The Code for Lever LM: Configuring In-Context Sequence to Lever Large Vision Language Models
☆18Oct 4, 2024Updated last year
ForJadeForest / ImageSearchLightningCLIP
View on GitHub
Using distilled CLIP model to deploy the android device
☆20Feb 28, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Kamichanw / ICLTestbed
View on GitHub
An in-context learning research testbed
☆19Mar 16, 2025Updated last year
mercurystraw / Kris_Bench
View on GitHub
[NIPS 25'] Evaluation code of paper "KRIS-Bench: Benchmarking Next-Level Intelligent Image Editing Models"
☆45Oct 19, 2025Updated 8 months ago
FeipengMa6 / VLoRA
View on GitHub
[NeurIPS 2024] Visual Perception by Large Language Model’s Weights
☆56Mar 31, 2025Updated last year
OpenEnvision / AutoRubric-as-Reward
View on GitHub
Auto-Rubric as Reward: From Implicit Preference to Explicit Generative Criteria
☆47Jul 2, 2026Updated last week
JoyHuYY1412 / S4Former
View on GitHub
Training Vision Transformers for Semi-Supervised Semantic Segmentation
☆16Nov 3, 2025Updated 8 months ago
wenyu1009 / RTSRN
View on GitHub
☆20Sep 19, 2023Updated 2 years ago
UMR-R / QMem
View on GitHub
☆45May 16, 2026Updated last month
ForJadeForest / LIVE-Learnable-In-Context-Vector
View on GitHub
【NeurIPS 2024】The implementation of LIVE: Learnable In-Context Vector for Visual Question Answering https://arxiv.org/abs/2406.13185
☆23May 31, 2025Updated last year
SueMarsR / Emiece
View on GitHub
An End-to-end Mutually Interactive Emotion-Cause Pair Extractor via Soft-sharing
☆13Aug 11, 2022Updated 3 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
iLearn-Lab / TMM23-DualGNN
View on GitHub
DualGNN: Dual Graph Neural Network for Micro-video Recommendation
☆17Apr 8, 2026Updated 3 months ago
CGCL-codes / Gen-AF
View on GitHub
The implementation of our IEEE S&P 2024 paper "Securely Fine-tuning Pre-trained Encoders Against Adversarial Examples".
☆11Jun 28, 2024Updated 2 years ago
xxiqiao / TROJail
View on GitHub
Official implementation of "TROJail: Trajectory-Level Optimization for Multi-Turn Large Language Model Jailbreaks with Process Rewards"
☆30Updated this week
itayle / diverse-demonstrations
View on GitHub
Diverse Demonstrations Improve In-context Compositional Generalization
☆12Jul 7, 2023Updated 3 years ago
mshukor / EvALign-ICL
View on GitHub
[ICLR2024] (EvALign-ICL Benchmark) Beyond Task Performance: Evaluating and Reducing the Flaws of Large Multimodal Models with In-Context …
☆22Mar 1, 2024Updated 2 years ago
injadlu / DAMA
View on GitHub
[ICML 2025] Official code of "DAMA: Data- and Model-aware Alignment of Multi-modal LLMs"
☆16May 24, 2025Updated last year
w1oves / hqclip
View on GitHub
[ICCV 2025] HQ-CLIP: Leveraging Large Vision-Language Models to Create High-Quality Image-Text Datasets
☆67Aug 6, 2025Updated 11 months ago
JinBridger / SEU-Operating-System-Labwork
View on GitHub
东南大学 2021 级计算机专业操作系统课程实验 - Operating System Labwork source code in Dr.Kai Dong's Operating System Class. Based on OSTEP.
☆14Jun 17, 2023Updated 3 years ago
Kamichanw / SeekDeeper
View on GitHub
The minimal implementation of various popular AI models
☆48Apr 29, 2026Updated 2 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
FoundationAgents / VR-Bench
View on GitHub
We introduce Reasoning via Video, a new paradigm that uses maze-solving video generation to probe multimodal reasoning; our VR-Bench show…
☆65Feb 4, 2026Updated 5 months ago
Jyxarthur / shot-by-shot
View on GitHub
[ICCV 2025] Official Implementation of "Shot-by-Shot: Film-Grammar-Aware Training-Free Audio Description Generation". Junyu Xie, Tengda H…
☆23May 16, 2026Updated last month
JoyHuYY1412 / LST_LVIS
View on GitHub
☆48Apr 5, 2020Updated 6 years ago
HAWLYQ / InfoMetIC
View on GitHub
☆13Sep 5, 2023Updated 2 years ago
RongKaiWeskerMA / INSTA
View on GitHub
The implementation of Learning Instance and Task-Aware Dynamic Kernels for Few Shot Learning
☆13Apr 14, 2024Updated 2 years ago
tomguluson92 / EraseAnything
View on GitHub
EraseAnything, ICML 2025
☆42Sep 28, 2025Updated 9 months ago
HITsz-TMG / ICL-State-Vector
View on GitHub
☆12Jul 4, 2024Updated 2 years ago
sterzhang / PVIT
View on GitHub
Official Repository of Personalized Visual Instruct Tuning
☆34Mar 6, 2025Updated last year
LehongWu / MacDiff
View on GitHub
The official PyTorch implementation of "MacDiff: Unified Skeleton Modeling with Masked Conditional Diffusion" in ECCV 2024.
☆19Jul 6, 2025Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
mcahny / rovit
View on GitHub
RO-ViT CVPR 2023 "Region-Aware Pretraining for Open-Vocabulary Object Detection with Vision Transformers"
☆17Aug 24, 2023Updated 2 years ago
yongcaoplus / TIN-SLT
View on GitHub
Code for Paper "Explore More Guidance: A Task-aware Instruction Network for Sign Language Translation Enhanced with Data Augmentation"
☆12Feb 6, 2023Updated 3 years ago
CCIIPLab / DPT
View on GitHub
The code of IJCAI2022 paper, Declaration-based Prompt Tuning for Visual Question Answering
☆20May 10, 2022Updated 4 years ago
HauffQian / DGAP
View on GitHub
☆14May 13, 2025Updated last year
whdii / TMM
View on GitHub
☆21Jan 15, 2024Updated 2 years ago
apple / ml-rl-dllm
View on GitHub
Repository companioning the paper "Learning Unmasking Policies for Diffusion Language Models"
☆17Mar 30, 2026Updated 3 months ago
hongyurain / Recommendation-with-modality-information
View on GitHub
☆27Feb 2, 2024Updated 2 years ago