harpreetsahota204 / awesome-cvpr-2024
π€© An AWESOME Curated List of Papers, Workshops, Datasets, and Challenges from CVPR 2024
β142Updated 10 months ago
Alternatives and similar repositories for awesome-cvpr-2024:
Users that are interested in awesome-cvpr-2024 are comparing it to the libraries listed below
- Official implementation of 'CLIP-DINOiser: Teaching CLIP a few DINO tricks' paper.β243Updated 5 months ago
- [ECCV2024 Oralπ₯] Official Implementation of "GiT: Towards Generalist Vision Transformer through Universal Language Interface"β345Updated 3 months ago
- [CVPR 2024] Probing the 3D Awareness of Visual Foundation Modelsβ301Updated 9 months ago
- Holds code for our CVPR'23 tutorial: All Things ViTs: Understanding and Interpreting Attention in Vision.β186Updated last year
- Object Recognition as Next Token Prediction (CVPR 2024 Highlight)β175Updated 3 months ago
- β201Updated last year
- 1-shot image segmentation using Stable Diffusionβ137Updated last year
- [NeurIPS2023] Code release for "Hierarchical Open-vocabulary Universal Image Segmentation"β286Updated last year
- [NeurIPS 2024] Official implementation of the paper "Interfacing Foundation Models' Embeddings"β123Updated 7 months ago
- [CVPR 24] The repository provides code for running inference and training for "Segment and Caption Anything" (SCA) , links for downloadinβ¦β219Updated 6 months ago
- This is the official code release for our work, Denoising Vision Transformers.β360Updated 5 months ago
- [ICLR 2025 oral] RMP-SAM: Towards Real-Time Multi-Purpose Segment Anythingβ236Updated this week
- [CVPR2024] ViP-LLaVA: Making Large Multimodal Models Understand Arbitrary Visual Promptsβ318Updated 9 months ago
- Official implementation of "Describing Differences in Image Sets with Natural Language" (CVPR 2024 Oral)β118Updated last year
- Learning from synthetic data - code and modelsβ314Updated last year
- Scaling Vision Pre-Training to 4K Resolutionβ119Updated 3 weeks ago
- [Fully open] [Encoder-free MLLM] Vision as LoRAβ95Updated last week
- [NeurIPS2023] 3D-OWIS is capable of detecting unknown instances in inference, and progressively learning novel classes in the process of β¦β67Updated last year
- β173Updated 6 months ago
- β507Updated 5 months ago
- [ECCV 2024] Improving 2D Feature Representations by 3D-Aware Fine-Tuningβ273Updated last month
- Referring any person or objects given a natural language description. Code base for RexSeek and HumanRef Benchmarkβ112Updated this week
- This repo contains the official implementation of ICLR 2024 paper "Is ImageNet worth 1 video? Learning strong image encoders from 1 long β¦β86Updated 11 months ago
- [CVPR24] Official Implementation of GEM (Grounding Everything Module)β114Updated last week
- β179Updated last week
- [ECCV2024] This is an official implementation for "PSALM: Pixelwise SegmentAtion with Large Multi-Modal Model"