NilouAp / Top-Vision-Language-PapersLinks
This repository collects and categorizes top vision-language papers based on their approaches and applications, with a special focus on the CLIP model.
β13Updated last month
Alternatives and similar repositories for Top-Vision-Language-Papers
Users that are interested in Top-Vision-Language-Papers are comparing it to the libraries listed below
Sorting:
- Facial Attribute Recognitionβ11Updated 6 months ago
- π A ranked list of awesome machine learning Python libraries. Updated weekly.β10Updated 5 months ago
- SAM: Sharpness-Aware Minimization (PyTorch)β12Updated last year
- β10Updated last month
- Python codes for some ML algorithms related to CS 4860-5860 In University of Colorado.β11Updated 5 months ago
- Leetcode Practice in Pythonβ11Updated 5 months ago
- The C++ Complete Course is a comprehensive guide for individuals to learn C++ programming language from zero to advanced level. It includβ¦β9Updated 11 months ago
- A comprehensive template for aligning large language models (LLMs) using Reinforcement Learning from Human Feedback (RLHF), transfer learβ¦β15Updated 5 months ago
- β14Updated 2 years ago
- Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakesβ11Updated last year
- FlameFinder: Illuminating Obscured Fire through Smoke with DML-guided Detectionβ12Updated last year
- Exploring the fundamentals and advanced concepts of Large Language Models (LLMs) through practical implementations and collaborative learβ¦β22Updated 5 months ago
- Face Morphing Attack Detection Benchmark (IJCB 2022: Robust Ensemble Morph Detection with Domain Generalization)β20Updated 5 months ago
- IJCB 2023: Towards Generalizable Morph Attack Detection via Consistency Regularizationβ13Updated last year
- β12Updated 2 years ago
- Public repository for the Colosseum Young Gladiators Workshop School of 2023β11Updated 2 years ago
- Identity-Preserving Face Frontalization with StyleGAN on CMU Multi-PIEβ11Updated 5 months ago
- I make this repo to record the papers I read every day and organize them better.β20Updated 5 months ago
- UAV communication using apprenticeship learning via Inverse Reinforcement Learning (IRL)β18Updated 6 months ago
- Robust Transformer with Locality Inductive Bias and Feature Normalization (JESTECH 2023)β11Updated 10 months ago
- [WACV 2025] FDS: Feedback-guided Domain Synthesis with Multi-Source Conditional Diffusion Models for Domain Generalizationβ16Updated 10 months ago
- MorDIFF: Recognition Vulnerability and Attack Detectability of Face Morphing Attacks Created by Diffusion Autoencodersβ18Updated 2 years ago
- [CVPR 2025] Spectral State Space Model for Rotation-Invariant Visual Representation Learningβ12Updated 2 weeks ago
- [PR 2024] TFS-ViT: Token-Level Feature Stylization for Domain Generalizationβ24Updated 2 years ago
- Official pytorch implementation of "RITUAL: Random Image Transformations as a Universal Anti-hallucination Lever in Large Vision Languageβ¦β11Updated 5 months ago
- β15Updated last year
- [ICLR 2025] Code for Self-Correcting Decoding with Generative Feedback for Mitigating Hallucinations in Large Vision-Language Modelsβ18Updated last month
- Official repository of the paper: Can ChatGPT Detect DeepFakes? A Study of Using Multimodal Large Language Models for Media Forensicsβ13Updated last year
- Official implementation of the paper "FLIP: Cross-domain Face Anti-spoofing with Language Guidance". (ICCV 2023)β72Updated last year
- [CVPR '25] Official implementation of the paper "Rethinking Few-Shot Adaptation of Vision-Language Models in Two Stages", accepted at (anβ¦β14Updated 2 months ago