☆17Oct 1, 2024Updated last year
Alternatives and similar repositories for Vision-Language-Model-in-ECCV-2024
Users that are interested in Vision-Language-Model-in-ECCV-2024 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆16Sep 29, 2024Updated last year
- The official source code for "Subgraph Federated Learning for Local Generalization (FedLoG)" at ICLR 2025 (Oral).☆16May 6, 2025Updated last year
- Advances in recent large vision language models (LVLMs)☆15Sep 23, 2024Updated last year
- Implementation of "DIME-FM: DIstilling Multimodal and Efficient Foundation Models"☆15Oct 12, 2023Updated 2 years ago
- KTH Deep Learning advanced (DD2412) project. Task: Reproducing FixMatch and investigating on Noisy (Pseudo) Labels and confirmation Erro…☆10Jul 15, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Repository for the paper: dense and aligned captions (dac) promote compositional reasoning in vl models☆28Nov 29, 2023Updated 2 years ago
- Code and instructions accompanying ICCV'23 paper Protoype-based Dataset Comparison☆18Dec 15, 2023Updated 2 years ago
- ☆10Jan 29, 2019Updated 7 years ago
- Mitigating Spurious Correlations in Multi-modal Models during Fine-tuning (ICML 2023)☆19Dec 15, 2023Updated 2 years ago
- [TACL/EMNLP'24] Do Vision and Language Models Share Concepts? A Vector Space Alignment Study☆16Nov 22, 2024Updated last year
- Project Page (PromptStyler, ICCV 2023)☆13Aug 16, 2023Updated 2 years ago
- This is an official implementation in PyTorch of PTH-Net: Dynamic Facial Expression Recognition without Face Detection and Alignment..☆14Jul 1, 2025Updated 10 months ago
- Domain Generalization through Distilling CLIP with Language Guidance☆35Oct 18, 2023Updated 2 years ago
- [IJCV2025] https://arxiv.org/abs/2304.04521☆15Jan 22, 2025Updated last year
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- A collection of deep semi-supervised learning resources☆13Jul 1, 2022Updated 3 years ago
- Guided Interpretable Facial Expression Recognition via Spatial Action Unit Cues☆19Sep 16, 2024Updated last year
- Official PyTorch implementation Source code for Adaptive Self-Training Framework for Fine-grained Scene Graph generation (ST-SGG), accept…☆22Jan 30, 2024Updated 2 years ago
- ☆23Apr 29, 2025Updated last year
- [CVPR 2025] VISCO: Benchmarking Fine-Grained Critique and Correction Towards Self-Improvement in Visual Reasoning☆13Jun 7, 2025Updated 11 months ago
- Visual and Embodied Concepts evaluation benchmark☆21Oct 10, 2023Updated 2 years ago
- Awesome Vision-Language Compositionality, a comprehensive curation of research papers in literature.☆39Feb 13, 2025Updated last year
- Flow4D: Leveraging 4D Voxel Network for LiDAR Scene Flow Estimation☆27Mar 13, 2025Updated last year
- [CVPR 2025] DynRefer: Delving into Region-level Multimodal Tasks via Dynamic Resolution☆59Mar 4, 2025Updated last year
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- The official source code for Self-Guided Robust Graph Structure Refinement (SG-GSR) at WWW 2024 Research Track.☆17Apr 23, 2024Updated 2 years ago
- AI-Generated Video Detection via Perceptual Straightening (NeurIPS2025)☆42Apr 23, 2026Updated last month
- [ECCV’24] Official repository for "BEAF: Observing Before-AFter Changes to Evaluate Hallucination in Vision-language Models"☆22Mar 26, 2025Updated last year
- [ACL 2025 Findings] Official pytorch implementation of "Don't Miss the Forest for the Trees: Attentional Vision Calibration for Large Vis…☆25Jul 21, 2024Updated last year
- Official implementation for paper "CEPrompt: Cross-Modal Emotion-Aware Prompting for Facial Expression Recognition" (accepted to IEEE TC…☆17Oct 20, 2025Updated 7 months ago
- Code for Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense? [COLM 2024]☆24Aug 13, 2024Updated last year
- Pytorch Implmentation of Meta Attack via Contrastive Surrogate Objective☆12May 21, 2024Updated 2 years ago
- [ICCV’23] Official repository for "TextManiA: Enriching Visual Feature by Text-driven Manifold Augmentation"☆22Nov 1, 2023Updated 2 years ago
- ☆12Nov 1, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Official code for ICCV 2023 paper, "Improving Zero-Shot Generalization for CLIP with Synthesized Prompts"☆104Mar 6, 2024Updated 2 years ago
- ☆21Oct 30, 2024Updated last year
- [ECCV2024] ClearCLIP: Decomposing CLIP Representations for Dense Vision-Language Inference☆99Mar 26, 2025Updated last year
- Embedding-based evaluation metrics for dialogue generation.☆15Jan 8, 2023Updated 3 years ago
- [EMNLP 2025] Reasoning-to-Defend: Safety-Aware Reasoning Can Defend Large Language Models from Jailbreaking☆12Aug 22, 2025Updated 9 months ago
- [BMVC 2022] This is the official code of our Paper "Revisiting Self-Supervised Contrastive Learning for Facial Expression Recognition"☆23Jul 8, 2024Updated last year
- [CVPR 2025] PyTorch implementation of paper "FLAME: Frozen Large Language Models Enable Data-Efficient Language-Image Pre-training"☆33Jul 8, 2025Updated 10 months ago