yafeng19 / T-CORELinks
[CVPR 2025] PyTorch implementation of T-CORE, introduced in "When the Future Becomes the Past: Taming Temporal Correspondence for Self-supervised Video Representation Learning".
☆15Updated 5 months ago
Alternatives and similar repositories for T-CORE
Users that are interested in T-CORE are comparing it to the libraries listed below
Sorting:
- ☆16Updated last year
- [NeurIPS 2023] The official implementation of SOC: Semantic-Assisted Object Cluster for Referring Video Object Segmentation☆33Updated last year
- The official implementation of our paper ''IteRPrimE: Zero-shot Referring Image Segmentation with Iterative Grad-CAM Refinement and Prima…☆14Updated 4 months ago
- The official implementation of A Counting-Aware Hierarchical Decoding Framework for Generalized Referring Expression Segmentation☆21Updated 2 weeks ago
- Official code for CVPR2024 “VideoMAC: Video Masked Autoencoders Meet ConvNets”☆12Updated last year
- [NeurlPS 2024] One Token to Seg Them All: Language Instructed Reasoning Segmentation in Videos☆132Updated 8 months ago
- The official repository for ICLR2024 paper "FROSTER: Frozen CLIP is a Strong Teacher for Open-Vocabulary Action Recognition"☆87Updated 7 months ago
- Official implementation for "Diffusion Model is Secretly a Training-free Open Vocabulary Semantic Segmenter"☆46Updated last year
- [ECCV24] VISA: Reasoning Video Object Segmentation via Large Language Model☆188Updated last year
- [CVPR 2025 🔥]A Large Multimodal Model for Pixel-Level Visual Grounding in Videos☆78Updated 4 months ago
- ☆59Updated last year
- [CVPR 2025] COSMOS: Cross-Modality Self-Distillation for Vision Language Pre-training☆28Updated 5 months ago
- The official repo for "Ref-AVS: Refer and Segment Objects in Audio-Visual Scenes", ECCV 2024☆46Updated 8 months ago
- Official implementation of the CVPR'24 paper [Adaptive Slot Attention: Object Discovery with Dynamic Slot Number]☆53Updated 7 months ago
- [CVPR 2025 Highlight] Your Large Vision-Language Model Only Needs A Few Attention Heads For Visual Grounding☆27Updated this week
- [ICCV-2023] The official code of Bridging Vision and Language Encoders: Parameter-Efficient Tuning for Referring Image Segmentation☆135Updated 2 months ago
- [CVPR 2025] FLAIR: VLM with Fine-grained Language-informed Image Representations☆96Updated this week
- Official Implementation of "Open-Vocabulary Audio-Visual Semantic Segmentation" [ACM MM 2024 Oral].☆32Updated 9 months ago
- [ICCV 2025] Official PyTorch Code for "Advancing Textual Prompt Learning with Anchored Attributes"☆89Updated this week
- [ICLR 2025] TimeSuite: Improving MLLMs for Long Video Understanding via Grounded Tuning☆40Updated 4 months ago
- [AAAI 2025] Open-vocabulary Video Instance Segmentation Codebase built upon Detectron2, which is really easy to use.☆24Updated 8 months ago
- Official implementation of SCLIP: Rethinking Self-Attention for Dense Vision-Language Inference☆168Updated 10 months ago
- Self-Calibrated CLIP for Training-Free Open-Vocabulary Segmentation☆54Updated 3 months ago
- A list of referring video object segmentation papers☆49Updated 2 months ago
- Easy wrapper for inserting LoRA layers in CLIP.☆35Updated last year
- Code for paper "LLMs Can Evolve Continually on Modality for X-Modal Reasoning" NeurIPS2024☆37Updated 8 months ago
- Code for the paper "Compositional Entailment Learning for Hyperbolic Vision-Language Models".☆80Updated 2 months ago
- (NeurIPS 2023) Open-set visual object query search & localization in long-form videos☆24Updated last year
- The official repo for "Stepping Stones: A Progressive Training Strategy for Audio-Visual Semantic Segmentation", ECCV 2024☆16Updated 10 months ago
- Official PyTorch code of GroundVQA (CVPR'24)☆62Updated 11 months ago