NilouAp / Top-Vision-Language-Papers
This repository collects and categorizes top vision-language papers based on their approaches and applications, with a special focus on the CLIP model.
☆13Updated 2 weeks ago
Alternatives and similar repositories for Top-Vision-Language-Papers:
Users that are interested in Top-Vision-Language-Papers are comparing it to the libraries listed below
- Facial Attribute Recognition☆11Updated 4 months ago
- SAM: Sharpness-Aware Minimization (PyTorch)☆12Updated last year
- Leetcode Practice in Python☆11Updated 4 months ago
- Python codes for some ML algorithms related to CS 4860-5860 In University of Colorado.☆11Updated 4 months ago
- ☆10Updated last week
- 🏆 A ranked list of awesome machine learning Python libraries. Updated weekly.☆10Updated 4 months ago
- The C++ Complete Course is a comprehensive guide for individuals to learn C++ programming language from zero to advanced level. It includ…☆9Updated 10 months ago
- Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes☆11Updated last year
- A comprehensive template for aligning large language models (LLMs) using Reinforcement Learning from Human Feedback (RLHF), transfer lear…☆15Updated 4 months ago
- FlameFinder: Illuminating Obscured Fire through Smoke with DML-guided Detection☆12Updated last year
- ☆14Updated 2 years ago
- ☆12Updated 2 years ago
- IJCB 2023: Towards Generalizable Morph Attack Detection via Consistency Regularization"☆13Updated 11 months ago
- Exploring the fundamentals and advanced concepts of Large Language Models (LLMs) through practical implementations and collaborative lear…☆22Updated 4 months ago
- Face Morphing Attack Detection Benchmark☆20Updated 4 months ago
- Public repository for the Colosseum Young Gladiators Workshop School of 2023☆11Updated last year
- I make this repo to record the papers I read every day and organize them better.☆20Updated 4 months ago
- Identity-Preserving Face Frontalization with StyleGAN on CMU Multi-PIE☆10Updated 4 months ago
- Robust Transformer with Locality Inductive Bias and Feature Normalization (JESTECH 2023)☆11Updated 9 months ago
- [WACV 2025] FDS: Feedback-guided Domain Synthesis with Multi-Source Conditional Diffusion Models for Domain Generalization☆16Updated 9 months ago
- MorDIFF: Recognition Vulnerability and Attack Detectability of Face Morphing Attacks Created by Diffusion Autoencoders☆18Updated 2 years ago
- [PR 2024] TFS-ViT: Token-Level Feature Stylization for Domain Generalization☆24Updated 2 years ago
- [NeurIPS 2024] WATT: Weight Average Test-Time Adaptation of CLIP☆49Updated 6 months ago
- Official repository of the paper: Can ChatGPT Detect DeepFakes? A Study of Using Multimodal Large Language Models for Media Forensics☆12Updated last year
- Robust-LLaVA: On the Effectiveness of Large-Scale Robust Image Encoders for Multi-modal Large Language Models☆20Updated 2 months ago
- ☆20Updated 3 months ago
- ☆15Updated last year
- The official repository of ECCV 2024 paper "Outlier-Aware Test-time Adaptation with Stable Memory Replay"☆18Updated 7 months ago
- Official code for ICML 2024 paper, "Connecting the Dots: Collaborative Fine-tuning for Black-Box Vision-Language Models"☆17Updated 10 months ago
- [ICLR 2025] Code for Self-Correcting Decoding with Generative Feedback for Mitigating Hallucinations in Large Vision-Language Models☆16Updated last week