SkalskiP / top-cvpr-2024-papersLinks
This repository is a curated collection of the most exciting and influential CVPR 2024 papers. π₯ [Paper + Code + Demo]
β738Updated 2 months ago
Alternatives and similar repositories for top-cvpr-2024-papers
Users that are interested in top-cvpr-2024-papers are comparing it to the libraries listed below
Sorting:
- About This repository is a curated collection of the most exciting and influential CVPR 2025 papers. π₯ [Paper + Code + Demo]β762Updated 2 months ago
- Official code for "FeatUp: A Model-Agnostic Frameworkfor Features at Any Resolution" ICLR 2024β1,563Updated last year
- Official repository for "AM-RADIO: Reduce All Domains Into One"β1,313Updated 2 weeks ago
- RobustSAM: Segment Anything Robustly on Degraded Images (CVPR 2024 Highlight)β358Updated 11 months ago
- ποΈ + π¬ + π§ = π€ Curated list of top foundation and multimodal models! [Paper + Code + Examples + Tutorials]β628Updated last year
- This repo is the homebase of a community driven course on Computer Vision with Neural Networks. Feel free to join us on the Hugging Face β¦β681Updated 2 weeks ago
- Official Implementation of CVPR24 highlight paper: Matching Anything by Segmenting Anythingβ1,331Updated 3 months ago
- Grounding DINO 1.5: IDEA Research's Most Capable Open-World Object Detection Model Seriesβ1,014Updated 7 months ago
- This repository is a curated collection of the most exciting and influential CVPR 2023 papers. π₯ [Paper + Code]β656Updated 2 months ago
- Hiera: A fast, powerful, and simple hierarchical vision transformer.β1,012Updated last year
- State-of-the-art Image & Video CLIP, Multimodal Large Language Models, and More!β1,526Updated last week
- π€© An AWESOME Curated List of Papers, Workshops, Datasets, and Challenges from CVPR 2024β144Updated last year
- Efficient Track Anythingβ620Updated 7 months ago
- A distilled Segment Anything (SAM) model capable of running real-time with NVIDIA TensorRTβ789Updated last year
- 4M: Massively Multimodal Masked Modelingβ1,760Updated 2 months ago
- This series will take you on a journey from the fundamentals of NLP and Computer Vision to the cutting edge of Vision-Language Models.β1,120Updated 7 months ago
- code for CVPR2024 paper: DiffMOT: A Real-time Diffusion-based Multiple Object Tracker with Non-linear Predictionβ423Updated last year
- [NeurIPS 2024] Code release for "Segment Anything without Supervision"β479Updated 2 months ago
- [CVPR25] Official repository for the paper: "SAMWISE: Infusing Wisdom in SAM2 for Text-Driven Video Segmentation"β305Updated last month
- This repository provides the code and model checkpoints for AIMv1 and AIMv2 research projects.β1,357Updated 2 weeks ago
- Recipes for shrinking, optimizing, customizing cutting edge vision models. πβ1,571Updated last week
- β524Updated 9 months ago
- [CVPRW'24] SoccerNet Game State Reconstruction: End-to-End Athlete Tracking and Identification on a Minimap (CVPR24 - CVSports workshop)β321Updated 3 months ago
- This repository contains the official implementation of the research paper, "MobileCLIP: Fast Image-Text Models through Multi-Modal Reinfβ¦β1,024Updated 9 months ago
- DINO-X: The World's Top-Performing Vision Model for Open-World Object Detection and Understandingβ1,178Updated last month
- Tracking Any Point (TAP)β1,636Updated 3 weeks ago
- [ICLR 2024] Official PyTorch implementation of FasterViT: Fast Vision Transformers with Hierarchical Attentionβ870Updated last month
- Grounded SAM 2: Ground and Track Anything in Videos with Grounding DINO, Florence-2 and SAM 2β2,622Updated 2 months ago
- [ICCV 2023] Tracking Anything with Decoupled Video Segmentationβ1,422Updated 3 months ago
- LightlyTrain is the first PyTorch framework to pretrain computer vision models on unlabeled data for industrial applicationsβ778Updated this week