SkalskiP / top-cvpr-2024-papers
This repository is a curated collection of the most exciting and influential CVPR 2024 papers. π₯ [Paper + Code + Demo]
β697Updated 7 months ago
Alternatives and similar repositories for top-cvpr-2024-papers:
Users that are interested in top-cvpr-2024-papers are comparing it to the libraries listed below
- Official repository for "AM-RADIO: Reduce All Domains Into One"β913Updated last week
- Official Implementation of CVPR24 highligt paper: Matching Anything by Segmenting Anythingβ1,203Updated 3 months ago
- Official code for "FeatUp: A Model-Agnostic Frameworkfor Features at Any Resolution" ICLR 2024β1,454Updated 7 months ago
- RobustSAM: Segment Anything Robustly on Degraded Images (CVPR 2024 Highlight)β335Updated 5 months ago
- 4M: Massively Multimodal Masked Modelingβ1,685Updated this week
- ποΈ + π¬ + π§ = π€ Curated list of top foundation and multimodal models! [Paper + Code + Examples + Tutorials]β601Updated 11 months ago
- π€© An AWESOME Curated List of Papers, Workshops, Datasets, and Challenges from CVPR 2024β142Updated 8 months ago
- Hiera: A fast, powerful, and simple hierarchical vision transformer.β950Updated 11 months ago
- Famous Vision Language Models and Their Architecturesβ635Updated last week
- Grounding DINO 1.5: IDEA Research's Most Capable Open-World Object Detection Model Seriesβ894Updated last month
- [NeurIPS 2024] Code release for "Segment Anything without Supervision"β447Updated 4 months ago
- β497Updated 3 months ago
- Recipes for shrinking, optimizing, customizing cutting edge vision models. πβ1,187Updated 2 weeks ago
- This repository is a curated collection of the most exciting and influential CVPR 2023 papers. π₯ [Paper + Code]β646Updated 7 months ago
- Grounded SAM 2: Ground and Track Anything in Videos with Grounding DINO, Florence-2 and SAM 2β1,671Updated 2 months ago
- code for CVPR2024 paper: DiffMOT: A Real-time Diffusion-based Multiple Object Tracker with Non-linear Predictionβ402Updated 8 months ago
- Images to inference with no labeling (use foundation models to train supervised models).β2,128Updated 2 months ago
- This repository provides the code and model checkpoints for AIMv1 and AIMv2 research projects.β1,182Updated 2 months ago
- [CVPR 2024 π₯] Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural language responses thaβ¦β828Updated 2 months ago
- This repo is the homebase of a community driven course on Computer Vision with Neural Networks. Feel free to join us on the Hugging Face β¦β565Updated last month
- DINO-X: The World's Top-Performing Vision Model for Open-World Object Detection and Understandingβ860Updated last month
- Official code of "EVF-SAM: Early Vision-Language Fusion for Text-Prompted Segment Anything Model"β360Updated last month
- Efficient vision foundation models for high-resolution generation and perception.β2,641Updated 3 weeks ago
- ICLR2024 Spotlight: curation/training code, metadata, distribution and pre-trained models for MetaCLIP; CVPR 2024: MoDE: CLIP Data Expertβ¦β1,350Updated 2 months ago
- [CVPRW'24] SoccerNet Game State Reconstruction: End-to-End Athlete Tracking and Identification on a Minimap (CVPR24 - CVSports workshop)β273Updated last week
- Official Pytorch Implementation for βDINO-Tracker: Taming DINO for Self-Supervised Point Tracking in a Single Videoβ (ECCV 2024)β464Updated 2 months ago
- A distilled Segment Anything (SAM) model capable of running real-time with NVIDIA TensorRTβ708Updated last year
- A curated list of foundation models for vision and language tasksβ937Updated last week
- [ACCV 2024 (Oral)] Official Implementation of "Moving Object Segmentation: All You Need Is SAM (and Flow)" Junyu Xie, Charig Yang, Weidi β¦β292Updated 2 months ago
- Open source AI/ML capabilities for the FiftyOne ecosystemβ137Updated this week