gorkaydemir / SOLVView external linksLinks
[NeurIPS 2023] Self-supervised Object-Centric Learning for Videos
☆32Nov 28, 2024Updated last year
Alternatives and similar repositories for SOLV
Users that are interested in SOLV are comparing it to the libraries listed below
Sorting:
- Repository for our paper "Object-Centric Learning for Real-World Videos by Predicting Temporal Feature Similarities"☆33Feb 12, 2025Updated last year
- [ICLR 2023 - UNOFFICIAL] Bridging the Gap to Real-World Object-Centric Learning☆23May 10, 2024Updated last year
- [CVPR 2024 Highlight] SPOT: Self-Training with Patch-Order Permutation for Object-Centric Learning with Autoregressive Transformers☆73Jun 11, 2024Updated last year
- Official implementation of the CVPR'24 paper [Adaptive Slot Attention: Object Discovery with Dynamic Slot Number]☆65Jan 25, 2025Updated last year
- PyTorch implementation of MED-VT: Multiscale Encoder-Decoder Video Transformer with Application to Object Segmentation☆27Oct 22, 2024Updated last year
- [CVPR 2024] Guided Slot Attention for Unsupervised Video Object Segmentation☆64Dec 23, 2024Updated last year
- ☆180Feb 3, 2023Updated 3 years ago
- [NAACL 2024] Z-GMOT: Zero-shot Generic Multiple Object Tracking☆13May 3, 2024Updated last year
- ☆88Aug 13, 2025Updated 6 months ago
- [ECCV 2024] Code for Betrayed by Attention: A Simple yet Effective Approach for Self-supervised Video Object Segmentation☆34Mar 7, 2025Updated 11 months ago
- Implementation of paper 'Helping Hands: An Object-Aware Ego-Centric Video Recognition Model'☆33Nov 7, 2023Updated 2 years ago
- [NeurIPS 2023 (Spotlight)] Uncovering the Hidden Dynamics of Video Self-supervised Learning under Distribution Shifts☆13Jan 30, 2024Updated 2 years ago
- ☆18Mar 1, 2024Updated last year
- ICCV2023: Disentangling Spatial and Temporal Learning for Efficient Image-to-Video Transfer Learning☆41Sep 25, 2023Updated 2 years ago
- ☆22Mar 7, 2025Updated 11 months ago
- [CVPR 2024] Action-slot: Visual Action-centric Representations for Atomic Activity Recognition in Traffic Scenes☆24Apr 28, 2025Updated 9 months ago
- Code for NeurIPS 2022 paper "Learning Viewpoint-Agnostic Visual Representations by Recovering Tokens in 3D Space"☆20Apr 20, 2023Updated 2 years ago
- Official implementation of paper "OED: Towards One-stage End-to-End Dynamic Scene Graph Generation".☆26Mar 26, 2024Updated last year
- Visual Representation Learning with Stochastic Frame Prediction (ICML 2024)☆26Nov 27, 2024Updated last year
- This is the project for 'USG'.☆35Apr 7, 2025Updated 10 months ago
- [NeurIPS 2024] COVE: Unleashing the Diffusion Feature Correspondence for Consistent Video Editing☆25Dec 8, 2024Updated last year
- Official Pytorch Implementation of the framework TEMPURA proposed in our paper Unbiased Scene Graph Generation in Videos accepted by CVPR…☆24Sep 9, 2025Updated 5 months ago
- Code release for NeurIPS 2023 paper SlotDiffusion: Object-centric Learning with Diffusion Models☆94Jan 16, 2024Updated 2 years ago
- Code implementation for paper titled "HOI-Ref: Hand-Object Interaction Referral in Egocentric Vision"☆29Apr 16, 2024Updated last year
- Code release for ICLR 2023 paper: SlotFormer on object-centric dynamics models☆120Sep 20, 2023Updated 2 years ago
- [ICCV2023] MBPTrack: Improving 3D Point Cloud Tracking with Memory Networks and Box Priors☆29Sep 5, 2023Updated 2 years ago
- "Tail-Aware Sperm Analysis for Transparent Tracking of Spermatozoa" Official Implementation☆10Jan 21, 2026Updated 3 weeks ago
- Code for paper 'Leveraging Predicate and Triplet Learning for Scene Graph Generation'. (CVPR 2024)☆32Sep 6, 2025Updated 5 months ago
- The official implementation of our work Hawkeye: Discovering and Grounding Implicit Anomalous Sentiment in Recon-videos via Scene-enhanc…☆12Oct 14, 2024Updated last year
- [CVPR 2024] Depth-aware Test-Time Training for Zero-shot Video Object Segmentation☆29Apr 28, 2025Updated 9 months ago
- The official code of Towards Balanced Alignment: Modal-Enhanced Semantic Modeling for Video Moment Retrieval (AAAI2024)☆32Mar 29, 2024Updated last year
- Fast and general video object segmentation evaluation.☆36Jan 30, 2024Updated 2 years ago
- Associate Everything Detected: Facilitating Tracking-by-Detection to the Unknown☆41Dec 25, 2024Updated last year
- DisTime: Distribution-based Time Representation for Video Large Language Models.☆18Jul 10, 2025Updated 7 months ago
- Official implementation of Neuronal Time-Invariant Representations (NeuPRINT), NeurIPS 2023☆10Apr 17, 2024Updated last year
- Finetuning & extending DiffusionDet to video & pedestrian multi-object-tracking☆13Apr 12, 2023Updated 2 years ago
- (NeurIPS 2024) Official repository of paper "Frozen-DETR: Enhancing DETR with Image Understanding from Frozen Foundation Models"☆35Mar 22, 2025Updated 10 months ago
- The repository of VG-Refiner paper☆17Dec 9, 2025Updated 2 months ago
- This is the official source code for SLATE. We provide the code for the model, the training code, and a dataset loader for the 3D Shapes …☆88Jan 9, 2023Updated 3 years ago