Official implementation of "Connect, Collapse, Corrupt: Learning Cross-Modal Tasks with Uni-Modal Data" (ICLR 2024)
☆34Oct 16, 2024Updated last year
Alternatives and similar repositories for C3
Users that are interested in C3 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [NeurIPS 2023] Official Pytorch code for LOVM: Language-Only Vision Model Selection☆21Feb 3, 2024Updated 2 years ago
- [ICLR 2025] Video Action Differencing☆53Jul 3, 2025Updated 10 months ago
- Official Code Release for "Diagnosing and Rectifying Vision Models using Language" (ICLR 2023)☆34Jun 8, 2023Updated 2 years ago
- [CVPR 2025] MicroVQA eval and 🤖RefineBot code for "MicroVQA: A Multimodal Reasoning Benchmark for Microscopy-Based Scientific Research"…☆37Nov 25, 2025Updated 5 months ago
- Official implementation of "Why are Visually-Grounded Language Models Bad at Image Classification?" (NeurIPS 2024)☆99Oct 19, 2024Updated last year
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- [ICLR 2025] Video-STaR: Self-Training Enables Video Instruction Tuning with Any Supervision☆72Jul 10, 2024Updated last year
- ☆59Aug 30, 2023Updated 2 years ago
- ☆18Oct 28, 2025Updated 6 months ago
- [EMNLP 2024] IFCap: Image-like Retrieval and Frequency-based Entity Filtering for Zero-shot Captioning☆14May 13, 2025Updated 11 months ago
- ☆41Sep 9, 2025Updated 7 months ago
- SotA text-only image/video method (IJCAI 2023)☆15Jan 9, 2024Updated 2 years ago
- ☆23Dec 23, 2025Updated 4 months ago
- ICLR 2023 DeCap: Decoding CLIP Latents for Zero-shot Captioning☆143Mar 16, 2023Updated 3 years ago
- [NeurIPS2024] Official code for (IMA) Implicit Multimodal Alignment: On the Generalization of Frozen LLMs to Multimodal Inputs☆23Oct 15, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆11Nov 5, 2025Updated 6 months ago
- ☆12Apr 1, 2025Updated last year
- [CVPR 2025] VISCO: Benchmarking Fine-Grained Critique and Correction Towards Self-Improvement in Visual Reasoning☆13Jun 7, 2025Updated 11 months ago
- Code for paper: Reinforced Vision Perception with Tools☆72Oct 3, 2025Updated 7 months ago
- ☆34Apr 11, 2025Updated last year
- ☆19Mar 23, 2025Updated last year
- ☆13Aug 14, 2022Updated 3 years ago
- ☆14Oct 7, 2024Updated last year
- COLA: Evaluate how well your vision-language model can Compose Objects Localized with Attributes!☆25Nov 23, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Predicting Gene-Disease Associations☆15Mar 9, 2023Updated 3 years ago
- Applies ROME and MEMIT on Mamba-S4 models☆15Apr 5, 2024Updated 2 years ago
- [CVPR 2025] BIOMEDICA: An Open Biomedical Image-Caption Archive, Dataset, and Vision-Language Models Derived from Scientific Literature☆100Mar 22, 2025Updated last year
- A tool for calling (and calling out to) large language models.☆16Aug 13, 2024Updated last year
- ☆11Feb 16, 2023Updated 3 years ago
- Source materials for CoinFT☆33Jan 23, 2026Updated 3 months ago
- Github repository for ACL 2025 paper: VoxEval: Benchmarking the Knowledge Understanding Capabilities of End-to-End Spoken Language Models☆24Jun 16, 2025Updated 10 months ago
- Benchmarking and Analyzing Generative Data for Visual Recognition☆26Jul 25, 2023Updated 2 years ago
- The example of correspondence between fine classes and superclasses (coarse classes) in ImageNet.☆13Dec 4, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆12Aug 8, 2024Updated last year
- Support finetuning GLM4v with zero2☆16Jun 29, 2024Updated last year
- Inverse Constitutional AI [ICLR 2025]: compressing pairwise preference data into a short constitution of principles.☆41Mar 2, 2026Updated 2 months ago
- Modality Gap–Driven Subspace Alignment Training Paradigm For Multimodal Large Language Models☆59Apr 1, 2026Updated last month
- [ECCV 2024] Viewpoint Textual Inversion: Discovering Scene Representations and 3D View Control in 2D Diffusion Models☆111Dec 3, 2024Updated last year
- Code and instructions accompanying ICCV'23 paper Protoype-based Dataset Comparison☆18Dec 15, 2023Updated 2 years ago
- ☆13May 21, 2024Updated last year