Official implementation of "Connect, Collapse, Corrupt: Learning Cross-Modal Tasks with Uni-Modal Data" (ICLR 2024)
☆35Oct 16, 2024Updated last year
Alternatives and similar repositories for C3
Users that are interested in C3 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICLR 2025] Video Action Differencing☆53Jul 3, 2025Updated 11 months ago
- Official implementation of "Automated Generation of Challenging Multiple-Choice Questions for Vision Language Model Evaluation" (CVPR 202…☆40May 26, 2025Updated last year
- Official Code Release for "Diagnosing and Rectifying Vision Models using Language" (ICLR 2023)☆34Jun 8, 2023Updated 3 years ago
- [CVPR 2025] MicroVQA eval and 🤖RefineBot code for "MicroVQA: A Multimodal Reasoning Benchmark for Microscopy-Based Scientific Research"…☆38Nov 25, 2025Updated 6 months ago
- Official implementation of "Why are Visually-Grounded Language Models Bad at Image Classification?" (NeurIPS 2024)☆98Oct 19, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Official implementation of "Describing Differences in Image Sets with Natural Language" (CVPR 2024 Oral)☆134Nov 5, 2025Updated 7 months ago
- [ICLR 2025] Video-STaR: Self-Training Enables Video Instruction Tuning with Any Supervision☆72Jul 10, 2024Updated last year
- A Vision-Language Benchmark for Microscopy Understanding☆31Mar 13, 2025Updated last year
- ☆59Aug 30, 2023Updated 2 years ago
- ☆19Oct 28, 2025Updated 7 months ago
- [EMNLP 2024] IFCap: Image-like Retrieval and Frequency-based Entity Filtering for Zero-shot Captioning☆14May 13, 2025Updated last year
- ☆36Feb 5, 2024Updated 2 years ago
- ICLR 2023 DeCap: Decoding CLIP Latents for Zero-shot Captioning☆144Mar 16, 2023Updated 3 years ago
- (CVPR2024) MeaCap: Memory-Augmented Zero-shot Image Captioning☆56Aug 16, 2024Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- [NeurIPS2024] Official code for (IMA) Implicit Multimodal Alignment: On the Generalization of Frozen LLMs to Multimodal Inputs☆23Oct 15, 2024Updated last year
- ☆11Nov 5, 2025Updated 7 months ago
- ☆12Dec 6, 2024Updated last year
- ☆12Apr 1, 2025Updated last year
- [CVPR 2025] VISCO: Benchmarking Fine-Grained Critique and Correction Towards Self-Improvement in Visual Reasoning☆13Jun 7, 2025Updated last year
- Code for paper: Reinforced Vision Perception with Tools☆72Oct 3, 2025Updated 8 months ago
- ☆34Apr 11, 2025Updated last year
- ☆19Mar 23, 2025Updated last year
- ☆14Oct 7, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆26Jun 8, 2026Updated last week
- COLA: Evaluate how well your vision-language model can Compose Objects Localized with Attributes!☆25May 14, 2026Updated last month
- Applies ROME and MEMIT on Mamba-S4 models☆15Apr 5, 2024Updated 2 years ago
- Code for "CAFe: Unifying Representation and Generation with Contrastive-Autoregressive Finetuning"☆33Mar 26, 2025Updated last year
- [CVPR 2025] BIOMEDICA: An Open Biomedical Image-Caption Archive, Dataset, and Vision-Language Models Derived from Scientific Literature☆107Mar 22, 2025Updated last year
- ☆11Feb 16, 2023Updated 3 years ago
- Source materials for CoinFT☆36Jan 23, 2026Updated 4 months ago
- Github repository for ACL 2025 paper: VoxEval: Benchmarking the Knowledge Understanding Capabilities of End-to-End Spoken Language Models☆24Jun 16, 2025Updated last year
- Python codes for mathematical modeling.☆13Sep 5, 2021Updated 4 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Benchmarking and Analyzing Generative Data for Visual Recognition☆26Jul 25, 2023Updated 2 years ago
- The example of correspondence between fine classes and superclasses (coarse classes) in ImageNet.☆13Dec 4, 2024Updated last year
- ☆12Aug 8, 2024Updated last year
- Inverse Constitutional AI [ICLR 2025]: compressing pairwise preference data into a short constitution of principles.☆41May 6, 2026Updated last month
- [ECCV 2024] Viewpoint Textual Inversion: Discovering Scene Representations and 3D View Control in 2D Diffusion Models☆111Dec 3, 2024Updated last year
- Code and instructions accompanying ICCV'23 paper Protoype-based Dataset Comparison☆18Dec 15, 2023Updated 2 years ago
- ☆13May 21, 2024Updated 2 years ago