SkalskiP / top-cvpr-2024-papers
This repository is a curated collection of the most exciting and influential CVPR 2024 papers. π₯ [Paper + Code + Demo]
β660Updated 4 months ago
Related projects β
Alternatives and complementary repositories for top-cvpr-2024-papers
- Official code for "FeatUp: A Model-Agnostic Frameworkfor Features at Any Resolution" ICLR 2024β1,371Updated 4 months ago
- ποΈ + π¬ + π§ = π€ Curated list of top foundation and multimodal models! [Paper + Code + Examples + Tutorials]β571Updated 8 months ago
- RobustSAM: Segment Anything Robustly on Degraded Images (CVPR 2024 Highlight)β304Updated 2 months ago
- This repository is a curated collection of the most exciting and influential CVPR 2023 papers. π₯ [Paper + Code]β639Updated 4 months ago
- Official Implementation of CVPR24 highligt paper: Matching Anything by Segmenting Anythingβ992Updated 3 weeks ago
- Official repository for "AM-RADIO: Reduce All Domains Into One"β785Updated this week
- π€© An AWESOME Curated List of Papers, Workshops, Datasets, and Challenges from CVPR 2024β134Updated 4 months ago
- This repo is the homebase of a community driven course on Computer Vision with Neural Networks. Feel free to join us on the Hugging Face β¦β482Updated last week
- [NeurIPS 2024] Code release for "Segment Anything without Supervision"β413Updated last month
- API for Grounding DINO 1.5: IDEA Research's Most Capable Open-World Object Detection Model Seriesβ770Updated 3 months ago
- code for CVPR2024 paper: DiffMOT: A Real-time Diffusion-based Multiple Object Tracker with Non-linear Predictionβ385Updated 4 months ago
- [CVPRW'24] SoccerNet Game State Reconstruction: End-to-End Athlete Tracking and Identification on a Minimap (CVPR24 - CVSports workshop)β234Updated 2 months ago
- Hiera: A fast, powerful, and simple hierarchical vision transformer.β891Updated 8 months ago
- A curated list of foundation models for vision and language tasksβ827Updated this week
- This repository contains the official implementation of the research paper, "MobileCLIP: Fast Image-Text Models through Multi-Modal Reinfβ¦β613Updated 3 weeks ago
- [ICCV 2023] Tracking Anything with Decoupled Video Segmentationβ1,258Updated 3 months ago
- streamline the fine-tuning process for multimodal models: PaliGemma, Florence-2, and Qwen2-VLβ1,381Updated this week
- Grounded SAM 2: Ground and Track Anything in Videos with Grounding DINO, Florence-2 and SAM 2β1,045Updated last week
- β458Updated this week
- A curated list of papers that released datasets along with their workβ124Updated 2 weeks ago
- Recipes for shrinking, optimizing, customizing cutting edge vision models. πβ860Updated last month
- [ACCV 2024 (Oral)] Official Implementation of "Moving Object Segmentation: All You Need Is SAM (and Flow)" Junyu Xie, Charig Yang, Weidi β¦β272Updated 2 weeks ago
- β42Updated last year
- [CVPR 2024 π₯] Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural language responses thaβ¦β777Updated 5 months ago
- Tracking Any Point (TAP)β1,302Updated 2 weeks ago
- Famous Vision Language Models and Their Architecturesβ401Updated 2 months ago
- SAM with text promptβ1,690Updated 2 weeks ago
- A distilled Segment Anything (SAM) model capable of running real-time with NVIDIA TensorRTβ653Updated 11 months ago
- β361Updated 11 months ago
- Implementation of XFeat (CVPR 2024). Do you need robust and fast local feature extraction? You are in the right place!β979Updated last week