π€© An AWESOME Curated List of Papers, Workshops, Datasets, and Challenges from CVPR 2024
β145Jun 13, 2024Updated last year
Alternatives and similar repositories for awesome-cvpr-2024
Users that are interested in awesome-cvpr-2024 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Run SOTA Vision-Language Model Florence-2 on your data!β15Mar 27, 2025Updated last year
- Albumentations Data Augmentation Plugin for FiftyOne!β15Aug 22, 2024Updated last year
- Run optical character recognition with PyTesseract from the FiftyOne App!β11Apr 5, 2024Updated 2 years ago
- My journey during 10 weeks of building FiftyOne pluginsβ22Nov 12, 2023Updated 2 years ago
- [NeurIPS24] VisMin: Visual Minimal-Change Understandingβ19Mar 3, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI β’ AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Solo plugin to Voxel FiftyOneβ17Nov 30, 2022Updated 3 years ago
- FiftyOne Plugin for finding common image quality issuesβ35Oct 21, 2024Updated last year
- A FiftyOne Plugin that allows you to search across any modality in your videos!β25May 27, 2025Updated 11 months ago
- Run zero-shot prediction models on your dataβ37Dec 19, 2024Updated last year
- Code to scrape CVPR website for list of accepted papers, find their arXiv links, extract metadata, and download pdfsβ10Jun 12, 2024Updated last year
- This repository contains the notebooks of the series 'transformers by doing - leaving no rock unturned'β13Sep 24, 2023Updated 2 years ago
- Testbed for multimodal retrieval augmented generation techniques with FiftyOne, LlamaIndex, and Milvusβ21Aug 9, 2024Updated last year
- β10Jul 5, 2024Updated last year
- [EMNLP 2024] Preserving Multi-Modal Capabilities of Pre-trained VLMs for Improving Vision-Linguistic Compositionalityβ22Oct 8, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean β’ AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Convert datasets from Hugging Face to FiftyOne for Visualizationβ11Mar 15, 2024Updated 2 years ago
- β54Jan 17, 2025Updated last year
- Official implementation of "Describing Differences in Image Sets with Natural Language" (CVPR 2024 Oral)β133Nov 5, 2025Updated 5 months ago
- This repo contains code for the Coursera MOOC Hands-on Data Centric Visual AIβ38Sep 24, 2024Updated last year
- [CVPR 2024] Contrasting Intra-Modal and Ranking Cross-Modal Hard Negatives to Enhance Visio-Linguistic Fine-grained Understandingβ56Apr 7, 2025Updated last year
- Code repo for "SketchODE: Learning neural sketch representation in continuous time" published in ICLR 2022β11Apr 19, 2022Updated 4 years ago
- LT-Gaussian: Long-Term Map Update Using 3D Gaussian Splatting for Autonomous Drivingβ22Sep 17, 2025Updated 7 months ago
- β11Jan 28, 2026Updated 3 months ago
- 2023 Capstone Designβ12Nov 2, 2023Updated 2 years ago
- Simple, predictable pricing with DigitalOcean hosting β’ AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- 'Taeyoung96'μ κ°λ° λΈλ‘κ·Έ μ λλ€. :)β14Mar 16, 2025Updated last year
- [NeurIPS 2024] Official PyTorch implementation of LoTLIP: Improving Language-Image Pre-training for Long Text Understandingβ50Jan 14, 2025Updated last year
- [ICLR 2025] HQ-Edit: A High-Quality and High-Coverage Dataset for General Image Editingβ113Apr 18, 2024Updated 2 years ago
- β17Sep 25, 2024Updated last year
- Personal Project To detect POI using YOLO-NAS & CTLβ18Aug 14, 2023Updated 2 years ago
- [NAACL 2022] TreeMix: Compositional Constituency-based Data Augmentation for Natural Language Understandingβ10Jul 15, 2023Updated 2 years ago
- [TACL/EMNLP'24] Do Vision and Language Models Share Concepts? A Vector Space Alignment Studyβ16Nov 22, 2024Updated last year
- Semantically Search Emojis From the Command Line!β13Nov 26, 2023Updated 2 years ago
- [CVPR 2025] PyTorch implementation of paper "FLAME: Frozen Large Language Models Enable Data-Efficient Language-Image Pre-training"β33Jul 8, 2025Updated 9 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [EMNLP'2023 Findings] MoqaGPT, for zero-shot multimodal question answering with LLMsβ13Dec 28, 2024Updated last year
- Code release for "Category-Specific Prompts for Animal Action Recognition with Pretrained Vision-Language Models"β14Feb 21, 2024Updated 2 years ago
- [CVPR 2024] The official implementation of paper "synthesize, diagnose, and optimize: towards fine-grained vision-language understanding"β52Jun 16, 2025Updated 10 months ago
- Code release for "Clue Me In: Semi-Supervised FGVC with Out-of-Distribution Data".β13Apr 11, 2022Updated 4 years ago
- [EMNLP'2024 Findings] Explore generated documents for enhanced IR with LLMs. We enhance BM25 to surpass strong dense retriever on many daβ¦β15Mar 28, 2025Updated last year
- β11Dec 13, 2023Updated 2 years ago
- K-FACE Analysis Project on Pytorchβ11Sep 6, 2021Updated 4 years ago