NilouAp / Top-Vision-Language-PapersLinks

This repository collects and categorizes top vision-language papers based on their approaches and applications, with a special focus on the CLIP model.

☆13

Alternatives and similar repositories for Top-Vision-Language-Papers

Users that are interested in Top-Vision-Language-Papers are comparing it to the libraries listed below

Sorting:

NilouAp / Facial-Attribute-Recognition-Multi-Task
Facial Attribute Recognition
☆11Updated 7 months ago
FLotfiGit / FLotfiGit2.github.io
☆10Updated 3 months ago
FLotfiGit / best-of-ml-python
🏆 A ranked list of awesome machine learning Python libraries. Updated weekly.
☆10Updated 7 months ago
FLotfiGit / LeetcodePython
Leetcode Practice in Python
☆11Updated 7 months ago
FLotfiGit / Machine-Learning-CS5860
Python codes for some ML algorithms related to CS 4860-5860 In University of Colorado.
☆11Updated 7 months ago
FLotfiGit / sam
SAM: Sharpness-Aware Minimization (PyTorch)
☆12Updated last year
HRajoliN / Think-Out-Loud-Exploring-LLMs-
Exploring the fundamentals and advanced concepts of Large Language Models (LLMs) through practical implementations and collaborative lear…
☆22Updated 6 months ago
kashiani / SelfMorphing_GRL
IJCB 2023: Towards Generalizable Morph Attack Detection via Consistency Regularization
☆13Updated last year
HRajoliN / MalleConv_pytorch
☆14Updated 2 years ago
kashiani / Face-Morphing-Attack-Detection-Benchmark
Face Morphing Attack Detection Benchmark (IJCB 2022: Robust Ensemble Morph Detection with Domain Generalization)
☆20Updated 7 months ago
FLotfiGit / colosseum-school-2023
Public repository for the Colosseum Young Gladiators Workshop School of 2023
☆11Updated 2 years ago
kashiani / Paper-Reading-Record-in-Computer-Vision
I make this repo to record the papers I read every day and organize them better.
☆20Updated 7 months ago
HashmatShadab / Robust-LLaVA
Robust-LLaVA: On the Effectiveness of Large-Scale Robust Image Encoders for Multi-modal Large Language Models
☆23Updated 5 months ago
zhangce01 / DeGF
[ICLR 2025] Code for Self-Correcting Decoding with Generative Feedback for Mitigating Hallucinations in Large Vision-Language Models
☆19Updated 3 months ago
NilouAp / StyleGAN_for_Face_Frontalization
Identity-Preserving Face Frontalization with StyleGAN on CMU Multi-PIE
☆11Updated 7 months ago
zhiheLu / Ensemble_VLM
Official code for paper "Beyond Sole Strength: Customized Ensembles for Generalized Vision-Language Models, ICML2024"
☆24Updated 5 months ago
NKU-MetautoAI / awesome-large-vision-language-models
Advances in recent large vision language models (LVLMs)
☆14Updated 9 months ago
anjith2006 / bob.paper.cross_modal_focal_loss_cvpr2021
Cross Modal Focal Loss for RGBD Face Anti-Spoofing
☆12Updated 4 years ago
MSiam / PixFoundation
☆10Updated 4 months ago
Omid-Nejati / Locality-iN-Locality
Robust Transformer with Locality Inductive Bias and Feature Normalization (JESTECH 2023)
☆11Updated last year
shanface33 / GPT4MF_UB
Official repository of the paper: Can ChatGPT Detect DeepFakes? A Study of Using Multimodal Large Language Models for Media Forensics
☆14Updated last year
Ruiyang-061X / VL-Uncertainty
🔎Official code for our paper: "VL-Uncertainty: Detecting Hallucination in Large Vision-Language Model via Uncertainty Estimation".
☆39Updated 4 months ago
Sara-Ahmed / GMML
☆14Updated 2 years ago
CSU-JPG / Awesome-VLM-Reasoning
☆15Updated 2 months ago
deep-real / DEAL
The PyTorch implementation for "DEAL: Disentangle and Localize Concept-level Explanations for VLMs" (ECCV 2024 Strong Double Blind)
☆20Updated 8 months ago
fahadshamshad / deep-facial-privacy-prior
[ECCVW 2024 -- ORAL] Official repository of paper titled "Makeup-Guided Facial Privacy Protection via Untrained Neural Network Priors".
☆12Updated 9 months ago
sangminwoo / RITUAL
Official pytorch implementation of "RITUAL: Random Image Transformations as a Universal Anti-hallucination Lever in Large Vision Language…
☆12Updated 7 months ago
ICTMCG / POSE
This is the official repository for the code and datasets in the paper "Progressive Open Space Expansion for Open-Set Model Attribution",…
☆24Updated last year
LunarShen / DsicoVLA
[CVPR 2025] DiscoVLA: Discrepancy Reduction in Vision, Language, and Alignment for Parameter-Efficient Video-Text Retrieval
☆17Updated 3 weeks ago
Cogito2012 / OpenMixer
[WACV 2025] Exploiting VLM Localizability and Semantics for Open Vocabulary Action Detection
☆12Updated 3 months ago