harpreetsahota204 / awesome-cvpr-2024View external linksLinks
π€© An AWESOME Curated List of Papers, Workshops, Datasets, and Challenges from CVPR 2024
β144Jun 13, 2024Updated last year
Alternatives and similar repositories for awesome-cvpr-2024
Users that are interested in awesome-cvpr-2024 are comparing it to the libraries listed below
Sorting:
- Albumentations Data Augmentation Plugin for FiftyOne!β14Aug 22, 2024Updated last year
- Use text-to-image models Stable Diffusion, DALL-E2, DALL-E3, SDXL, SSD-1B, Kandinsky-2.2, and LCM from UI. Add images directly to your daβ¦β33Apr 23, 2024Updated last year
- Run SOTA Vision-Language Model Florence-2 on your data!β15Mar 27, 2025Updated 10 months ago
- β96Nov 25, 2025Updated 2 months ago
- Code release for "Understanding Bias in Large-Scale Visual Datasets"β22Dec 4, 2024Updated last year
- [CVPR 2024 Highlight] ImageNet-Dβ46Oct 15, 2024Updated last year
- β10Jul 5, 2024Updated last year
- A collection of fine-tuning notebooks!β30Oct 5, 2023Updated 2 years ago
- FineCLIP: Self-distilled Region-based CLIP for Better Fine-grained Understanding (NIPS24)β34Nov 12, 2025Updated 3 months ago
- A curated list of plugins that you can add to your FiftyOne install!β136Updated this week
- GoLU, a novel, self-gated and element-wise activation function that performs well over a diverse set of tasksβ24Oct 4, 2025Updated 4 months ago
- This repository contains the notebooks of the series 'transformers by doing - leaving no rock unturned'β13Sep 24, 2023Updated 2 years ago
- β54Jan 17, 2025Updated last year
- β10Aug 22, 2022Updated 3 years ago
- FiftyOne Plugin for finding common image quality issuesβ34Oct 21, 2024Updated last year
- This repo contains code for the Coursera MOOC Hands-on Data Centric Visual AIβ38Sep 24, 2024Updated last year
- [TACL] Do Vision and Language Models Share Concepts? A Vector Space Alignment Studyβ16Nov 22, 2024Updated last year
- The First to Know: How Token Distributions Reveal Hidden Knowledge in Large Vision-Language Models?β42Nov 1, 2024Updated last year
- Run zero-shot prediction models on your dataβ36Dec 19, 2024Updated last year
- 'Taeyoung96'μ κ°λ° λΈλ‘κ·Έ μ λλ€. :)β14Mar 16, 2025Updated 11 months ago
- LT-Gaussian: Long-Term Map Update Using 3D Gaussian Splatting for Autonomous Drivingβ22Sep 17, 2025Updated 5 months ago
- Solo plugin to Voxel FiftyOneβ17Nov 30, 2022Updated 3 years ago
- [EMNLP 2024] Preserving Multi-Modal Capabilities of Pre-trained VLMs for Improving Vision-Linguistic Compositionalityβ21Oct 8, 2024Updated last year
- 2021-Spring-Capstone-Design 'μ κΈ°μ°¨ 무μ μΆ©μ λ‘λ΄'β11Nov 9, 2021Updated 4 years ago
- This is the implementation of CounterCurate, the data curation pipeline of both physical and semantic counterfactual image-caption pairs.β19Jun 27, 2024Updated last year
- A FiftyOne Plugin that allows you to search across any modality in your videos!β23May 27, 2025Updated 8 months ago
- SIEVE: Multimodal Dataset Pruning using Image-Captioning Models (CVPR 2024)β18Apr 28, 2024Updated last year
- Official implementation of "Describing Differences in Image Sets with Natural Language" (CVPR 2024 Oral)β130Nov 5, 2025Updated 3 months ago
- Python library and cmd tool to backup API callsβ18Nov 14, 2025Updated 3 months ago
- Implementation of the semi-structured inference model in our ACL 2020 paper, INFOTABS: Inference on Tables as Semi-structured Data.β18Dec 7, 2021Updated 4 years ago
- Understanding Self-Supervised Learning in a non-IID Settingβ21Oct 21, 2022Updated 3 years ago
- [CVPR 2024] The official implementation of paper "synthesize, diagnose, and optimize: towards fine-grained vision-language understanding"β50Jun 16, 2025Updated 8 months ago
- [NeurIPS 2024] Official PyTorch implementation of LoTLIP: Improving Language-Image Pre-training for Long Text Understanding