π€© An AWESOME Curated List of Papers, Workshops, Datasets, and Challenges from CVPR 2024
β145Jun 13, 2024Updated last year
Alternatives and similar repositories for awesome-cvpr-2024
Users that are interested in awesome-cvpr-2024 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Run SOTA Vision-Language Model Florence-2 on your data!β15Mar 27, 2025Updated last year
- Run optical character recognition with PyTesseract from the FiftyOne App!β11Apr 5, 2024Updated 2 years ago
- Solo plugin to Voxel FiftyOneβ17Nov 30, 2022Updated 3 years ago
- A FiftyOne Plugin that allows you to search across any modality in your videos!β24May 27, 2025Updated 10 months ago
- A curated list of plugins that you can add to your FiftyOne install!β138Updated this week
- Proton VPN Special Offer - Get 70% off β’ AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- [CVPR 2024 Highlight] ImageNet-Dβ47Oct 15, 2024Updated last year
- Code to scrape CVPR website for list of accepted papers, find their arXiv links, extract metadata, and download pdfsβ10Jun 12, 2024Updated last year
- This repository contains the notebooks of the series 'transformers by doing - leaving no rock unturned'β13Sep 24, 2023Updated 2 years ago
- Testbed for multimodal retrieval augmented generation techniques with FiftyOne, LlamaIndex, and Milvusβ21Aug 9, 2024Updated last year
- β10Jul 5, 2024Updated last year
- [EMNLP 2024] Preserving Multi-Modal Capabilities of Pre-trained VLMs for Improving Vision-Linguistic Compositionalityβ22Oct 8, 2024Updated last year
- Code release for "Understanding Bias in Large-Scale Visual Datasets"β23Dec 4, 2024Updated last year
- Official implementation and dataset for the NAACL 2024 paper "ComCLIP: Training-Free Compositional Image and Text Matching"β38Aug 18, 2024Updated last year
- 2021-Spring-Capstone-Design 'μ κΈ°μ°¨ 무μ μΆ©μ λ‘λ΄'β11Nov 9, 2021Updated 4 years ago
- DigitalOcean Gradient AI Platform β’ AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Downstream semantic segmentation evaluation of DGInStyle.β25Apr 1, 2024Updated 2 years ago
- Convert datasets from Hugging Face to FiftyOne for Visualizationβ11Mar 15, 2024Updated 2 years ago
- Official code of *Towards Event-oriented Long Video Understanding*β12Jul 26, 2024Updated last year
- β54Jan 17, 2025Updated last year
- Official implementation of "Describing Differences in Image Sets with Natural Language" (CVPR 2024 Oral)β133Nov 5, 2025Updated 5 months ago
- This repo contains code for the Coursera MOOC Hands-on Data Centric Visual AIβ38Sep 24, 2024Updated last year
- Code repo for "SketchODE: Learning neural sketch representation in continuous time" published in ICLR 2022β11Apr 19, 2022Updated 3 years ago
- LT-Gaussian: Long-Term Map Update Using 3D Gaussian Splatting for Autonomous Drivingβ22Sep 17, 2025Updated 6 months ago
- β11Jan 28, 2026Updated 2 months ago
- Proton VPN Special Offer - Get 70% off β’ AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- 2023 Capstone Designβ12Nov 2, 2023Updated 2 years ago
- 'Taeyoung96'μ κ°λ° λΈλ‘κ·Έ μ λλ€. :)β14Mar 16, 2025Updated last year
- Source Code of rTVRA for Hyperspectral Image Reconstruction on Dual-camera Compressive Hyperspectral Imaging Systemβ14Dec 23, 2020Updated 5 years ago
- [NeurIPS 2024] Official PyTorch implementation of LoTLIP: Improving Language-Image Pre-training for Long Text Understandingβ50Jan 14, 2025Updated last year
- [ICLR 2025] HQ-Edit: A High-Quality and High-Coverage Dataset for General Image Editingβ113Apr 18, 2024Updated last year
- Personal Project To detect POI using YOLO-NAS & CTLβ18Aug 14, 2023Updated 2 years ago
- [NAACL 2022] TreeMix: Compositional Constituency-based Data Augmentation for Natural Language Understandingβ10Jul 15, 2023Updated 2 years ago
- [TACL/EMNLP'24] Do Vision and Language Models Share Concepts? A Vector Space Alignment Studyβ16Nov 22, 2024Updated last year
- The First to Know: How Token Distributions Reveal Hidden Knowledge in Large Vision-Language Models?β43Nov 1, 2024Updated last year
- DigitalOcean Gradient AI Platform β’ AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Official code for the paper "Exploiting the Complementarity of 2D and 3D Networks to Address Domain-Shift in 3D Semantic Segmentation"β11Aug 25, 2023Updated 2 years ago
- Semantically Search Emojis From the Command Line!β13Nov 26, 2023Updated 2 years ago
- [CVPR 2025] PyTorch implementation of paper "FLAME: Frozen Large Language Models Enable Data-Efficient Language-Image Pre-training"β33Jul 8, 2025Updated 9 months ago
- [EMNLP'2023 Findings] MoqaGPT, for zero-shot multimodal question answering with LLMsβ13Dec 28, 2024Updated last year
- Fine-Grained Visual Classification via Simultaneously Learning of Multi-regional Multi-grained Featuresβ12Mar 2, 2021Updated 5 years ago
- [CVPR 2024] The official implementation of paper "synthesize, diagnose, and optimize: towards fine-grained vision-language understanding"β53Jun 16, 2025Updated 9 months ago
- Code release for "Clue Me In: Semi-Supervised FGVC with Out-of-Distribution Data".β13Apr 11, 2022Updated 4 years ago