cvpr-org / author-kit
☆481Updated last month
Alternatives and similar repositories for author-kit:
Users that are interested in author-kit are comparing it to the libraries listed below
- Extended LaTeX template for CVPR/ICCV papers☆576Updated 2 months ago
- ICCV 2023 Papers: Discover cutting-edge research from ICCV 2023, the leading computer vision conference. Stay updated on the latest in co…☆952Updated 7 months ago
- ☆240Updated 11 months ago
- This is the official code release for our work, Denoising Vision Transformers.☆360Updated 5 months ago
- CVPR 2023-2024 Papers: Dive into advanced research presented at the leading computer vision conference. Keep up to date with the latest d…☆449Updated 9 months ago
- ☆252Updated last year
- Open source implementation of "Vision Transformers Need Registers"☆175Updated 2 weeks ago
- This repository categorizes the papers about diffusion models applied in computer vision according to their target task. The classifcatio…☆394Updated last year
- PyTorch implementation of RCG https://arxiv.org/abs/2312.03701☆912Updated 6 months ago
- [CVPR 2024] Alpha-CLIP: A CLIP Model Focusing on Wherever You Want☆809Updated 8 months ago
- Official PyTorch implementation of "Extract Free Dense Labels from CLIP" (ECCV 22 Oral)☆439Updated 2 years ago
- [ICLR 2023 Spotlight] Vision Transformer Adapter for Dense Predictions☆1,357Updated last year
- ☆511Updated 5 months ago
- [ICLR 2023 Oral] Image as Set of Points☆564Updated 11 months ago
- [ECCV2024] VideoMamba: State Space Model for Efficient Video Understanding☆940Updated 9 months ago
- ❄️🔥 Visual Prompt Tuning [ECCV 2022] https://arxiv.org/abs/2203.12119☆1,104Updated last year
- Official Open Source code for "Scaling Language-Image Pre-training via Masking"☆420Updated 2 years ago
- ☆179Updated 9 months ago
- Official Implementation of paper "A Tale of Two Features: Stable Diffusion Complements DINO for Zero-Shot Semantic Correspondence"☆302Updated last year
- Curated list of video object segmentation (VOS) papers, datasets, and projects.☆311Updated this week
- [ECCV 2024] Official PyTorch implementation of RoPE-ViT "Rotary Position Embedding for Vision Transformer"☆315Updated 4 months ago
- A collection of literature after or concurrent with Masked Autoencoder (MAE) (Kaiming He el al.).☆820Updated 9 months ago
- Official Open Source code for "Masked Autoencoders As Spatiotemporal Learners"☆336Updated 5 months ago
- A Collection of Papers and Codes for CVPR2025/CVPR2024/ECCV2024 AIGC☆540Updated last month
- [ICCV 2023] VPD is a framework that leverages the high-level and low-level knowledge of a pre-trained text-to-image diffusion model to do…☆524Updated last year
- ✨✨Latest Papers on Vision Mamba and Related Areas☆324Updated last week
- [NeurIPS'23] Emergent Correspondence from Image Diffusion☆681Updated 11 months ago
- Open-vocabulary Semantic Segmentation☆342Updated 6 months ago
- Holds code for our CVPR'23 tutorial: All Things ViTs: Understanding and Interpreting Attention in Vision.☆186Updated last year
- [CVPR 2025 Oral] Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models☆707Updated 2 weeks ago