☆17Oct 1, 2024Updated last year
Alternatives and similar repositories for Vision-Language-Model-in-ECCV-2024
Users that are interested in Vision-Language-Model-in-ECCV-2024 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆16Sep 29, 2024Updated last year
- The official source code for "Subgraph Federated Learning for Local Generalization (FedLoG)" at ICLR 2025 (Oral).☆15May 6, 2025Updated 10 months ago
- ☆20Mar 12, 2025Updated last year
- Advances in recent large vision language models (LVLMs)☆15Sep 23, 2024Updated last year
- Implementation of "DIME-FM: DIstilling Multimodal and Efficient Foundation Models"☆15Oct 12, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- KTH Deep Learning advanced (DD2412) project. Task: Reproducing FixMatch and investigating on Noisy (Pseudo) Labels and confirmation Erro…☆10Jul 15, 2021Updated 4 years ago
- CLIPCleaner: Cleaning Noisy Labels with CLIP (ACM MM2024)☆15Apr 28, 2025Updated 10 months ago
- Repository for the paper: dense and aligned captions (dac) promote compositional reasoning in vl models☆27Nov 29, 2023Updated 2 years ago
- Code and instructions accompanying ICCV'23 paper Protoype-based Dataset Comparison☆18Dec 15, 2023Updated 2 years ago
- [WACV 2026] An extremely simple method for validation-free efficient adaptation of CLIP-like VLMs that is robust to the learning rate.☆32Apr 17, 2025Updated 11 months ago
- Mitigating Spurious Correlations in Multi-modal Models during Fine-tuning (ICML 2023)☆19Dec 15, 2023Updated 2 years ago
- [TACL/EMNLP'24] Do Vision and Language Models Share Concepts? A Vector Space Alignment Study☆16Nov 22, 2024Updated last year
- This is the implementation of CounterCurate, the data curation pipeline of both physical and semantic counterfactual image-caption pairs.☆19Jun 27, 2024Updated last year
- Project Page (PromptStyler, ICCV 2023)☆13Aug 16, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- This is an official implementation in PyTorch of PTH-Net: Dynamic Facial Expression Recognition without Face Detection and Alignment..☆13Jul 1, 2025Updated 8 months ago
- [IJCV2025] https://arxiv.org/abs/2304.04521☆15Jan 22, 2025Updated last year
- [ICLR 2023] Soft Neighbors are Positive Supporters in Contrastive Visual Representation Learning☆15Aug 2, 2023Updated 2 years ago
- Guided Interpretable Facial Expression Recognition via Spatial Action Unit Cues☆17Sep 16, 2024Updated last year
- ☆24Apr 29, 2025Updated 10 months ago
- Visual and Embodied Concepts evaluation benchmark☆21Oct 10, 2023Updated 2 years ago
- Awesome Vision-Language Compositionality, a comprehensive curation of research papers in literature.☆39Feb 13, 2025Updated last year
- Flow4D: Leveraging 4D Voxel Network for LiDAR Scene Flow Estimation☆27Mar 13, 2025Updated last year
- [CVPR 2025] DynRefer: Delving into Region-level Multimodal Tasks via Dynamic Resolution☆59Mar 4, 2025Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- This is anonymous repository for submitting our work to a conference☆14Dec 17, 2024Updated last year
- The official source code for Self-Guided Robust Graph Structure Refinement (SG-GSR) at WWW 2024 Research Track.☆17Apr 23, 2024Updated last year
- [CVPR 2025] LION-FS: Fast & Slow Video-Language Thinker as Online Video Assistant☆27Dec 2, 2025Updated 3 months ago
- Official implementation for paper "CEPrompt: Cross-Modal Emotion-Aware Prompting for Facial Expression Recognition" (accepted to IEEE TC…☆17Oct 20, 2025Updated 5 months ago
- [ACL 2025 Findings] Official pytorch implementation of "Don't Miss the Forest for the Trees: Attentional Vision Calibration for Large Vis…☆25Jul 21, 2024Updated last year
- Official implementation of IJCAI 2024 paper "Cross-Domain Feature Augmentation for Domain Generalization"☆18Feb 21, 2026Updated last month
- ICLR 2025 paper X-NeMo & Project X-Portrati2☆123Aug 7, 2025Updated 7 months ago
- A Python tool for preprocessing the AffectNet dataset into a structure that can be directly read by Pytorch's ImageFolder method.一个用于预处理A…☆17Mar 12, 2024Updated 2 years ago
- Official repository of Generating Multiple-Length Summaries via Reinforcement Learning for Unsupervised Sentence Summarization [EMNLP'22 …☆10May 20, 2023Updated 2 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [ICCV’23] Official repository for "TextManiA: Enriching Visual Feature by Text-driven Manifold Augmentation"☆22Nov 1, 2023Updated 2 years ago
- The official source code for "Vision Language Model is NOT All You Need: Augmentation Strategies for Molecule Language Model".☆14Jul 23, 2024Updated last year
- Official code for ICCV 2023 paper, "Improving Zero-Shot Generalization for CLIP with Synthesized Prompts"☆104Mar 6, 2024Updated 2 years ago
- ☆20Oct 30, 2024Updated last year
- [ECCV2024] ClearCLIP: Decomposing CLIP Representations for Dense Vision-Language Inference☆98Mar 26, 2025Updated last year
- Dataset accompanying the paper "Adaptive Methods for Real-World Domain Generalization"☆16Aug 17, 2023Updated 2 years ago
- The official source code for "Single-cell RNA-seq data imputation using Feature Propagation", accepted at 2023 ICML Workshop on Computati…☆12Aug 31, 2023Updated 2 years ago