NilouAp / Top-Vision-Language-PapersLinks
This repository collects and categorizes top vision-language papers based on their approaches and applications, with a special focus on the CLIP model.
☆13Updated 3 months ago
Alternatives and similar repositories for Top-Vision-Language-Papers
Users that are interested in Top-Vision-Language-Papers are comparing it to the libraries listed below
Sorting:
- Facial Attribute Recognition☆11Updated 7 months ago
- ☆10Updated 3 months ago
- 🏆 A ranked list of awesome machine learning Python libraries. Updated weekly.☆10Updated 7 months ago
- Leetcode Practice in Python☆11Updated 7 months ago
- Python codes for some ML algorithms related to CS 4860-5860 In University of Colorado.☆11Updated 7 months ago
- SAM: Sharpness-Aware Minimization (PyTorch)☆12Updated last year
- Exploring the fundamentals and advanced concepts of Large Language Models (LLMs) through practical implementations and collaborative lear…☆22Updated 6 months ago
- IJCB 2023: Towards Generalizable Morph Attack Detection via Consistency Regularization☆13Updated last year
- ☆14Updated 2 years ago
- Face Morphing Attack Detection Benchmark (IJCB 2022: Robust Ensemble Morph Detection with Domain Generalization)☆20Updated 7 months ago
- Public repository for the Colosseum Young Gladiators Workshop School of 2023☆11Updated 2 years ago
- I make this repo to record the papers I read every day and organize them better.☆20Updated 7 months ago
- Robust-LLaVA: On the Effectiveness of Large-Scale Robust Image Encoders for Multi-modal Large Language Models☆23Updated 5 months ago
- [ICLR 2025] Code for Self-Correcting Decoding with Generative Feedback for Mitigating Hallucinations in Large Vision-Language Models☆19Updated 3 months ago
- Identity-Preserving Face Frontalization with StyleGAN on CMU Multi-PIE☆11Updated 7 months ago
- Official code for paper "Beyond Sole Strength: Customized Ensembles for Generalized Vision-Language Models, ICML2024"☆24Updated 5 months ago
- Advances in recent large vision language models (LVLMs)☆14Updated 9 months ago
- Cross Modal Focal Loss for RGBD Face Anti-Spoofing☆12Updated 4 years ago
- ☆10Updated 4 months ago
- Robust Transformer with Locality Inductive Bias and Feature Normalization (JESTECH 2023)☆11Updated last year
- Official repository of the paper: Can ChatGPT Detect DeepFakes? A Study of Using Multimodal Large Language Models for Media Forensics☆14Updated last year
- 🔎Official code for our paper: "VL-Uncertainty: Detecting Hallucination in Large Vision-Language Model via Uncertainty Estimation".☆39Updated 4 months ago
- ☆14Updated 2 years ago
- ☆15Updated 2 months ago
- The PyTorch implementation for "DEAL: Disentangle and Localize Concept-level Explanations for VLMs" (ECCV 2024 Strong Double Blind)☆20Updated 8 months ago
- [ECCVW 2024 -- ORAL] Official repository of paper titled "Makeup-Guided Facial Privacy Protection via Untrained Neural Network Priors".☆12Updated 9 months ago
- Official pytorch implementation of "RITUAL: Random Image Transformations as a Universal Anti-hallucination Lever in Large Vision Language…☆12Updated 7 months ago
- This is the official repository for the code and datasets in the paper "Progressive Open Space Expansion for Open-Set Model Attribution",…☆24Updated last year
- [CVPR 2025] DiscoVLA: Discrepancy Reduction in Vision, Language, and Alignment for Parameter-Efficient Video-Text Retrieval☆17Updated 3 weeks ago
- [WACV 2025] Exploiting VLM Localizability and Semantics for Open Vocabulary Action Detection☆12Updated 3 months ago