NilouAp / Top-Vision-Language-PapersLinks
This repository collects and categorizes top vision-language papers based on their approaches and applications, with a special focus on the CLIP model.
☆13Updated 5 months ago
Alternatives and similar repositories for Top-Vision-Language-Papers
Users that are interested in Top-Vision-Language-Papers are comparing it to the libraries listed below
Sorting:
- Facial Attribute Recognition☆11Updated 9 months ago
- SAM: Sharpness-Aware Minimization (PyTorch)☆12Updated last year
- 🏆 A ranked list of awesome machine learning Python libraries. Updated weekly.☆10Updated 9 months ago
- Python codes for some ML algorithms related to CS 4860-5860 In University of Colorado.☆11Updated 9 months ago
- Leetcode Practice in Python☆11Updated 9 months ago
- ☆10Updated 5 months ago
- IJCB 2023: Towards Generalizable Morph Attack Detection via Consistency Regularization☆13Updated last year
- Face Morphing Attack Detection Benchmark (IJCB 2022: Robust Ensemble Morph Detection with Domain Generalization)☆20Updated 9 months ago
- Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes☆11Updated last year
- Exploring the fundamentals and advanced concepts of Large Language Models (LLMs) through practical implementations and collaborative lear…☆22Updated 9 months ago
- ☆14Updated 2 years ago
- I make this repo to record the papers I read every day and organize them better.☆20Updated 9 months ago
- ☆12Updated 2 years ago
- Public repository for the Colosseum Young Gladiators Workshop School of 2023☆11Updated 2 years ago
- Robust Transformer with Locality Inductive Bias and Feature Normalization (JESTECH 2023)☆11Updated last year
- Identity-Preserving Face Frontalization with StyleGAN on CMU Multi-PIE☆13Updated 9 months ago
- [ICCVW 2025 (Oral)] Robust-LLaVA: On the Effectiveness of Large-Scale Robust Image Encoders for Multi-modal Large Language Models☆25Updated last month
- Official repository of the paper: Can ChatGPT Detect DeepFakes? A Study of Using Multimodal Large Language Models for Media Forensics☆14Updated last year
- This repository contains the official code for our paper: Thinking Before Looking: Improving Multimodal LLM Reasoning via Mitigating Visu…☆21Updated 10 months ago
- Official pytorch implementation of "RITUAL: Random Image Transformations as a Universal Anti-hallucination Lever in Large Vision Language…☆13Updated 9 months ago
- [ICCV 2023] Towards Building More Robust Models with Frequency Bias☆18Updated last year
- [ACCV 2024] ObjectCompose: Evaluating Resilience of Vision-Based Models on Object-to-Background Compositional Changes 🚀🚀🚀☆37Updated 8 months ago
- ☆24Updated last year
- (BMVC 2022--Oral) Official repository for "Adversarial Pixel Restoration as a Pretext Task for Transferable Perturbations" …☆34Updated 2 years ago
- The PyTorch implementation for "DEAL: Disentangle and Localize Concept-level Explanations for VLMs" (ECCV 2024 Strong Double Blind)☆20Updated 10 months ago
- [WACV 2025-Oral Presentation] Test-Time Adaptation in Point Clouds: Leveraging Sampling Variation with Weight Averaging☆12Updated 5 months ago
- [WACV 2025] FDS: Feedback-guided Domain Synthesis with Multi-Source Conditional Diffusion Models for Domain Generalization☆18Updated last year
- [PR 2024] TFS-ViT: Token-Level Feature Stylization for Domain Generalization☆24Updated 2 years ago
- [ICLR 2025] Code for Self-Correcting Decoding with Generative Feedback for Mitigating Hallucinations in Large Vision-Language Models☆20Updated 5 months ago
- Official implementation of the paper "FLIP: Cross-domain Face Anti-spoofing with Language Guidance". (ICCV 2023)☆81Updated last year