SarahRastegar / Best-Papers-Top-VenuesLinks
Best Papers of Top Venues like CVPR, NeurIPS, ICLR, ICML, ICCV, ECCV, ...
☆250Updated last week
Alternatives and similar repositories for Best-Papers-Top-Venues
Users that are interested in Best-Papers-Top-Venues are comparing it to the libraries listed below
Sorting:
- official implementation of "Interpreting CLIP's Image Representation via Text-Based Decomposition"☆234Updated 6 months ago
- Open source implementation of "Vision Transformers Need Registers"☆201Updated 2 months ago
- A curated list of awesome self-supervised learning methods in videos☆158Updated 2 weeks ago
- A curated list of awesome Multimodal studies.☆301Updated 3 weeks ago
- [ICLR 2024 (Spotlight)] "Frozen Transformers in Language Models are Effective Visual Encoder Layers"☆246Updated last year
- Visualizing the attention of vision-language models☆265Updated 9 months ago
- This repo lists relevant papers summarized in our survey paper: A Systematic Survey of Prompt Engineering on Vision-Language Foundation …☆507Updated 9 months ago
- [T-PAMI] A curated list of self-supervised multimodal learning resources.☆272Updated last year
- Optimizing the way of contrastive learning in PyTorch-DDP(DistributedDataParallel) multi-GPU training☆35Updated last year
- A paper list for spatial reasoning☆521Updated last week
- [NeurIPS'24] SpatialEval: a benchmark to evaluate spatial reasoning abilities of MLLMs and LLMs☆57Updated 10 months ago
- ☆13Updated 11 months ago
- [NeurIPS 2024, spotlight] Scaling Out-of-Distribution Detection for Multiple Modalities☆68Updated 2 weeks ago
- Processed / Cleaned Data for Paper Copilot☆781Updated 2 weeks ago
- [Survey] Masked Modeling for Self-supervised Representation Learning on Vision and Beyond (https://arxiv.org/abs/2401.00897)☆351Updated 7 months ago
- Mind the Gap: Understanding the Modality Gap in Multi-modal Contrastive Representation Learning☆168Updated 3 years ago
- [ECCV 2024] API: Attention Prompting on Image for Large Vision-Language Models☆107Updated last year
- Official implementation of "Why are Visually-Grounded Language Models Bad at Image Classification?" (NeurIPS 2024)☆94Updated last year
- 🌐 Permanent Hosting Site: http://ai-paper-finder.info/ 🌐 Hugging Face Hosting: https://huggingface.co/spaces/wenhanacademia/ai-paper-f…☆248Updated this week
- A curated list of Continual Learning papers and BibTeX entries☆200Updated last year
- Code for the paper "Compositional Entailment Learning for Hyperbolic Vision-Language Models".☆91Updated 6 months ago
- Sparse Linear Concept Embeddings☆126Updated 8 months ago
- XL-VLMs: General Repository for eXplainable Large Vision Language Models☆44Updated 3 months ago
- [CVPR 2025] FLAIR: VLM with Fine-grained Language-informed Image Representations☆125Updated 3 months ago
- A collection of parameter-efficient transfer learning papers focusing on computer vision and multimodal domains.☆409Updated last year
- This repository collects papers on VLLM applications. We will update new papers irregularly.☆188Updated 3 months ago
- CVPR 2023: Language in a Bottle: Language Model Guided Concept Bottlenecks for Interpretable Image Classification☆105Updated last year
- ☆263Updated last year
- [CVPR2024] Efficient Dataset Distillation via Minimax Diffusion☆104Updated last year
- Official code for the paper "Two Effects, One Trigger: On the Modality Gap, Object Bias, and Information Imbalance in Contrastive Vision-…☆21Updated 7 months ago