HELLORPG / CV-FrameworkLinks
A simple Computer Vision Framework, mainly based on PyTorch. Including distributed training, logging and so on.
☆12Updated last year
Alternatives and similar repositories for CV-Framework
Users that are interested in CV-Framework are comparing it to the libraries listed below
Sorting:
- A Fine-grained Benchmark for Video Captioning and Retrieval☆15Updated 2 months ago
- [CVPR 2024 Accepted] TaskWeave: Decoupling and Inter-Task Feedback for Joint Moment Retrieval and Highlight Detection☆25Updated 8 months ago
- [ICCV 2023] MGMAE: Motion Guided Masking for Video Masked Autoencoding☆21Updated last year
- Awesome video instance segmentation papers☆40Updated 2 weeks ago
- Multi-Granularity Language-Guided Multi-Object Tracking☆17Updated last week
- [CVPR 2023 Hightlight] PDPP: Projected Diffusion for Procedure Planning in Instructional Videos☆33Updated last year
- History-Aware Transformation of ReID Features for Multiple Object Tracking☆17Updated 3 weeks ago
- [CVPR 2024] Adapting Short-Term Transformers for Action Detection in Untrimmed Videos☆11Updated 11 months ago
- [T-PAMI 2023] Temporal Perceiver: A General Architecture for Arbitrary Boundary Detection☆35Updated last year
- ☆19Updated 10 months ago
- [WACV2025] Official PyTorch implementation of TrackDiffusion (https://arxiv.org/abs/2312.00651)☆79Updated 11 months ago
- [NeurIPS 2024] Repository for the paper "OVT-B: A New Large-Scale Benchmark for Open-Vocabulary Multi-Object Tracking".☆22Updated 6 months ago
- A list of referring video object segmentation papers☆39Updated last week
- Open-vocabulary Video Instance Segmentation Codebase built upon Detectron2, which is really easy to use.☆23Updated 5 months ago
- ICCV2023: Disentangling Spatial and Temporal Learning for Efficient Image-to-Video Transfer Learning☆41Updated last year
- Robust Referring Video Object Segmentation with Cyclic Structural Consistency [ICCV 2023]☆30Updated last year
- A visualization tool for temporal action localization (detection/segmentation).☆12Updated 2 years ago
- The official repository for ICLR2024 paper "FROSTER: Frozen CLIP is a Strong Teacher for Open-Vocabulary Action Recognition"☆79Updated 4 months ago
- High Quality Video Reasoning Segmentation☆23Updated last month
- [ICCV'2023 Oral] Implicit Temporal Modeling with Learnable Alignment for Video Recognition☆35Updated last year
- Transactions on Multimedia (TMM25)☆14Updated last month
- Fast and general video object segmentation evaluation.☆31Updated last year
- Self-Calibrated CLIP for Training-Free Open-Vocabulary Segmentation☆48Updated 2 weeks ago
- [CVPR 2025] DynRefer: Delving into Region-level Multimodal Tasks via Dynamic Resolution☆50Updated 3 months ago
- [CVPR 2024] Context-Guided Spatio-Temporal Video Grounding☆54Updated 11 months ago
- Tracking with Human-Intent Reasoning☆71Updated 7 months ago
- [CVPR2024] The code of "UniPT: Universal Parallel Tuning for Transfer Learning with Efficient Parameter and Memory"☆68Updated 7 months ago
- [CVPR 2024] Asymmetric Masked Distillation for Pre-Training Small Foundation Models☆18Updated last year
- [ICME 2024 Oral] DARA: Domain- and Relation-aware Adapters Make Parameter-efficient Tuning for Visual Grounding☆20Updated 3 months ago
- official repository of CVPR 2024 paper, RMem: Restricted Memory Banks Improve Video Object Segmentation☆45Updated 4 months ago