vlfom / RNCDL
[NeurIPS 2022] The official implementation of "Learning to Discover and Detect Objects".
☆108Updated last year
Related projects ⓘ
Alternatives and complementary repositories for RNCDL
- Open-source code for Generic Grouping Network (GGN, CVPR 2022)☆109Updated 6 months ago
- Official Pytorch codebase for Open-Vocabulary Instance Segmentation without Manual Mask Annotations [CVPR 2023]☆47Updated 11 months ago
- Pytorch implementation of "TokenCut: Segmenting Objects in Images and Videos with Self-supervised Transformer and Normalized Cut"☆56Updated last year
- ☆54Updated 2 years ago
- PyTorch implementation of R-MAE https//arxiv.org/abs/2306.05411☆109Updated last year
- ☆159Updated last year
- ☆57Updated last year
- code release of research paper "Exploring Long-Sequence Masked Autoencoders"☆99Updated 2 years ago
- Code for experiments for "ConvNet vs Transformer, Supervised vs CLIP: Beyond ImageNet Accuracy"☆95Updated last month
- PyTorch Implementation of Region Similarity Representation Learning (ReSim)☆86Updated 3 years ago
- (ICLR 2024, CVPR 2024) SparseFormer☆62Updated 7 months ago
- ☆41Updated 2 years ago
- [ICLR 2023 Spotlight] GPViT: A High Resolution Non-Hierarchical Vision Transformer with Group Propagation☆97Updated last year
- ☆99Updated 4 months ago
- [CVPRW'23] The official PyTorch implementation of NamedMask☆24Updated last year
- Official code for "Opening up Open World Tracking" (CVPR 2022)☆54Updated last year
- [ECCV 2022] Is Appearance Free Action Recognition Possible?☆58Updated 7 months ago
- ☆57Updated last year
- A task-agnostic vision-language architecture as a step towards General Purpose Vision☆92Updated 3 years ago
- [NeurIPS 2024] Classification Done Right for Vision-Language Pre-Training☆64Updated this week
- Code release for "Language-conditioned Detection Transformer"☆84Updated 4 months ago
- Official repository for the General Robust Image Task (GRIT) Benchmark☆50Updated last year
- [NeurIPS 2024] Official implementation of the paper "Interfacing Foundation Models' Embeddings"☆110Updated 2 months ago
- [CVPR 2024 Highlight] SPOT: Self-Training with Patch-Order Permutation for Object-Centric Learning with Autoregressive Transformers☆52Updated 4 months ago
- Code for the paper titled "CiT Curation in Training for Effective Vision-Language Data".☆78Updated last year
- Code and models for the paper "The effectiveness of MAE pre-pretraining for billion-scale pretraining" https://arxiv.org/abs/2303.13496☆80Updated 3 months ago
- This is the official released code for our paper, The Emergence of Objectness: Learning Zero-Shot Segmentation from Videos, which has bee…☆51Updated last year
- Obj2Seq: Formatting Objects as Sequences with Class Prompt for Visual Tasks (NeurIPS2022)☆84Updated 2 years ago
- Large-Vocabulary Video Instance Segmentation dataset☆76Updated 4 months ago
- ☆34Updated 2 years ago