π€© An AWESOME Curated List of Papers, Workshops, Datasets, and Challenges from CVPR 2024
β145Jun 13, 2024Updated last year
Alternatives and similar repositories for awesome-cvpr-2024
Users that are interested in awesome-cvpr-2024 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Use text-to-image models Stable Diffusion, DALL-E2, DALL-E3, SDXL, SSD-1B, Kandinsky-2.2, and LCM from UI. Add images directly to your daβ¦β33Apr 23, 2024Updated last year
- Run SOTA Vision-Language Model Florence-2 on your data!β15Mar 27, 2025Updated 11 months ago
- Albumentations Data Augmentation Plugin for FiftyOne!β14Aug 22, 2024Updated last year
- My journey during 10 weeks of building FiftyOne pluginsβ22Nov 12, 2023Updated 2 years ago
- β102Nov 25, 2025Updated 3 months ago
- [NeurIPS24] VisMin: Visual Minimal-Change Understandingβ19Mar 3, 2025Updated last year
- Solo plugin to Voxel FiftyOneβ17Nov 30, 2022Updated 3 years ago
- FiftyOne plugin for comparing object detection modelsβ14Nov 6, 2023Updated 2 years ago
- FiftyOne Plugin for finding common image quality issuesβ35Oct 21, 2024Updated last year
- A FiftyOne Plugin that allows you to search across any modality in your videos!β23May 27, 2025Updated 9 months ago
- Perform visual question answering on your imagesβ19May 8, 2024Updated last year
- A curated list of plugins that you can add to your FiftyOne install!β136Updated this week
- Run zero-shot prediction models on your dataβ37Dec 19, 2024Updated last year
- A collection of fine-tuning notebooks!β31Oct 5, 2023Updated 2 years ago
- [CVPR 2024 Highlight] ImageNet-Dβ47Oct 15, 2024Updated last year
- Code to scrape CVPR website for list of accepted papers, find their arXiv links, extract metadata, and download pdfsβ10Jun 12, 2024Updated last year
- Testbed for multimodal retrieval augmented generation techniques with FiftyOne, LlamaIndex, and Milvusβ21Aug 9, 2024Updated last year
- [EMNLP 2024] Preserving Multi-Modal Capabilities of Pre-trained VLMs for Improving Vision-Linguistic Compositionalityβ21Oct 8, 2024Updated last year
- Code release for "Understanding Bias in Large-Scale Visual Datasets"β23Dec 4, 2024Updated last year
- 2021-Spring-Capstone-Design 'μ κΈ°μ°¨ 무μ μΆ©μ λ‘λ΄'β11Nov 9, 2021Updated 4 years ago
- FineCLIP: Self-distilled Region-based CLIP for Better Fine-grained Understanding (NIPS24)β37Nov 12, 2025Updated 4 months ago
- Downstream semantic segmentation evaluation of DGInStyle.β25Apr 1, 2024Updated last year
- Official code of *Towards Event-oriented Long Video Understanding*β12Jul 26, 2024Updated last year
- This repo contains code for the Coursera MOOC Hands-on Data Centric Visual AIβ38Sep 24, 2024Updated last year
- Official implementation of "Describing Differences in Image Sets with Natural Language" (CVPR 2024 Oral)β132Nov 5, 2025Updated 4 months ago
- Code repo for "SketchODE: Learning neural sketch representation in continuous time" published in ICLR 2022β11Apr 19, 2022Updated 3 years ago
- [CVPR 2024] Contrasting Intra-Modal and Ranking Cross-Modal Hard Negatives to Enhance Visio-Linguistic Fine-grained Understandingβ56Apr 7, 2025Updated 11 months ago
- LT-Gaussian: Long-Term Map Update Using 3D Gaussian Splatting for Autonomous Drivingβ22Sep 17, 2025Updated 6 months ago
- 2023 Capstone Designβ12Nov 2, 2023Updated 2 years ago
- [NeurIPS 2024] Official PyTorch implementation of LoTLIP: Improving Language-Image Pre-training for Long Text Understandingβ50Jan 14, 2025Updated last year
- [BMVC 2023 (Oral)] SketchDreamer: Interactive Text-Augmented Creative Sketch Ideationβ27Jun 8, 2025Updated 9 months ago
- β18Sep 25, 2024Updated last year
- Personal Project To detect POI using YOLO-NAS & CTLβ18Aug 14, 2023Updated 2 years ago
- [NAACL 2022] TreeMix: Compositional Constituency-based Data Augmentation for Natural Language Understandingβ10Jul 15, 2023Updated 2 years ago
- [TACL] Do Vision and Language Models Share Concepts? A Vector Space Alignment Studyβ16Nov 22, 2024Updated last year
- Official code for the paper "Exploiting the Complementarity of 2D and 3D Networks to Address Domain-Shift in 3D Semantic Segmentation"β11Aug 25, 2023Updated 2 years ago
- Semantically Search Emojis From the Command Line!β13Nov 26, 2023Updated 2 years ago
- [CVPR 2025] PyTorch implementation of paper "FLAME: Frozen Large Language Models Enable Data-Efficient Language-Image Pre-training"β33Jul 8, 2025Updated 8 months ago
- Fine-Grained Visual Classification via Simultaneously Learning of Multi-regional Multi-grained Featuresβ12Mar 2, 2021Updated 5 years ago