This repository collects and categorizes top vision-language papers based on their approaches and applications, with a special focus on the CLIP model.
β14Apr 11, 2025Updated last year
Alternatives and similar repositories for Top-Vision-Language-Papers
Users that are interested in Top-Vision-Language-Papers are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Facial Attribute Recognitionβ11Dec 6, 2024Updated last year
- π A ranked list of awesome machine learning Python libraries. Updated weekly.β10Dec 12, 2024Updated last year
- SAM: Sharpness-Aware Minimization (PyTorch)β12Feb 21, 2024Updated 2 years ago
- Leetcode Practice in Pythonβ12Dec 12, 2024Updated last year
- Python codes for some ML algorithms related to CS 4860-5860 In University of Colorado.β11Dec 12, 2024Updated last year
- Virtual machines for every use case on DigitalOcean β’ AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- β10Apr 15, 2025Updated last year
- Face Morphing Attack Detection Benchmark (IJCB 2022: Robust Ensemble Morph Detection with Domain Generalization)β20Dec 18, 2024Updated last year
- I make this repo to record the papers I read every day and organize them better.β20Dec 15, 2024Updated last year
- Identity-Preserving Face Frontalization with StyleGAN on CMU Multi-PIEβ16Dec 19, 2024Updated last year
- IJCB 2023: Towards Generalizable Morph Attack Detection via Consistency Regularizationβ13May 1, 2024Updated 2 years ago
- A comprehensive template for aligning large language models (LLMs) using Reinforcement Learning from Human Feedback (RLHF), transfer learβ¦β16Dec 20, 2024Updated last year
- FlameFinder: Illuminating Obscured Fire through Smoke with DML-guided Detectionβ12Jun 26, 2023Updated 2 years ago
- Public repository for the Colosseum Young Gladiators Workshop School of 2023β11Jun 6, 2023Updated 2 years ago
- Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakesβ11Jan 12, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Exploring the fundamentals and advanced concepts of Large Language Models (LLMs) through practical implementations and collaborative learβ¦β23Dec 24, 2024Updated last year
- β14Nov 14, 2022Updated 3 years ago
- UAV communication using apprenticeship learning via Inverse Reinforcement Learning (IRL)β21Nov 22, 2024Updated last year
- β12Oct 26, 2022Updated 3 years ago
- Robust Transformer with Locality Inductive Bias and Feature Normalization (JESTECH 2023)β11Jul 14, 2024Updated last year
- A comprehensive template for aligning large language models (LLMs) using Reinforcement Learning from Human Feedback (RLHF), transfer learβ¦β40Dec 15, 2024Updated last year
- TFLearn Implementation of DeXpression architecture. Batch normalization is used instead of LRN. Gives a precision of 99.3 percent, recallβ¦β15Nov 30, 2017Updated 8 years ago
- Includes Final Project (Python), Wireshark Labs, and Theoretical HWsβ13Sep 27, 2021Updated 4 years ago
- Official code for paper "Beyond Sole Strength: Customized Ensembles for Generalized Vision-Language Models, ICML2024"β28Feb 2, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- β16Oct 9, 2025Updated 7 months ago
- γNeurIPS 2024γThe implementation of LIVE: Learnable In-Context Vector for Visual Question Answering https://arxiv.org/abs/2406.13185β23May 31, 2025Updated 11 months ago
- This repository contains the code for the book chapter "Near-Field Beamforming and Multiplexing Using Extremely Large Aperture Arrays"β19Dec 23, 2022Updated 3 years ago
- Adaptation of vision-language models (CLIP) to downstream tasks using local and global prompts.β53Jul 10, 2025Updated 10 months ago
- User manual, developer documentation, and support for StartOSβ19Nov 16, 2024Updated last year
- A cross-platform, OpenGL terminal emulator.β23Jul 26, 2024Updated last year
- Browser-based, graphical operating system for a personal server.β22Nov 16, 2024Updated last year
- 2nd place solution of ECCV 2020 workshop VIPriors Image Classification Challenge, https://arxiv.org/abs/2008.00261β13Aug 22, 2021Updated 4 years ago
- Cross-platform Rust rewrite of the GNU coreutilsβ21Oct 14, 2025Updated 7 months ago
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- This is a Phi-3 book for getting started with Phi-3. Phi-3, a family of open AI models developed by Microsoft. Phi-3 models are the most β¦β28Jul 31, 2024Updated last year
- Gossip is a nostr clientβ24Nov 16, 2024Updated last year
- Face editing by e4e, text2stylegan,interfacegan,ganspaceβ72Oct 21, 2022Updated 3 years ago
- This repository contains the code for our ECCV 2022 paper on our "Non-isotropic Probabilistic Take on Proxy-based Deep Metric Learning".β12Dec 6, 2022Updated 3 years ago
- [NeurIPS 2023] Meta-Adapterβ48Nov 21, 2023Updated 2 years ago
- Code/Models for Defending Against Universal Attacks Through Selective Feature Regeneration, CVPR 2020β10Jul 31, 2020Updated 5 years ago
- Flutter Audio Query Pluginβ51Dec 19, 2021Updated 4 years ago