An official implementation of "GOAL⚽: Global-local Object Alignment Learning" (CVPR 2025).
☆27Aug 14, 2025Updated 7 months ago
Alternatives and similar repositories for GOAL
Users that are interested in GOAL are comparing it to the libraries listed below
Sorting:
- ☆14Jul 1, 2025Updated 8 months ago
- code space of paper "Safety Layers in Aligned Large Language Models: The Key to LLM Security" (ICLR 2025)☆22Apr 26, 2025Updated 10 months ago
- 🧊 R3eVision: A Survey on Robust Rendering, Restoration, and Enhancement for 3D Low-Level Vision☆40Feb 5, 2026Updated last month
- Structuring Hour-Long Videos into Navigable Chapters and Hierarchical Summaries☆35Nov 19, 2025Updated 4 months ago
- FineCLIP: Self-distilled Region-based CLIP for Better Fine-grained Understanding (NIPS24)☆37Nov 12, 2025Updated 4 months ago
- ☆10Dec 16, 2023Updated 2 years ago
- [AAAI'25 Oral] NightReID: A Large-Scale Nighttime Person Re-Identification Benchmark☆11Jun 10, 2025Updated 9 months ago
- Code for "Are “Hierarchical” Visual Representations Hierarchical?" in NeurIPS Workshop for Symmetry and Geometry in Neural Representation…☆22Nov 8, 2023Updated 2 years ago
- [ICCV 2025] Repository for A Quality-Guided Mixture of Score-fusion Experts Framework for Human Recognition☆16Sep 29, 2025Updated 5 months ago
- Unsupervised Lifelong Person Re-identification via Contrastive Rehearsal☆11Apr 7, 2022Updated 3 years ago
- [TAC 2024] SVFAP: Self-supervised Video Facial Affect Perceiver☆23Sep 25, 2024Updated last year
- [CVPR 2025] Official implementation of paper "Multi-Granularity Class Prototype Topology Distillation for Class-Incremental Source-Free …☆17Aug 26, 2025Updated 6 months ago
- 3D LUTs for Real Time sRGB White-Balance Correction☆13Dec 14, 2023Updated 2 years ago
- Official codebase for the NeurIPS 2023 paper: Towards Last-layer Retraining for Group Robustness with Fewer Annotations. https://arxiv.or…☆12May 15, 2024Updated last year
- [NeurIPS 2025] VT-FSL: Bridging Vision and Text with LLMs for Few-Shot Learning☆32Dec 9, 2025Updated 3 months ago
- Task-adaptive Spatial-Temporal Video Sampler for Few-shot Action Recognition☆14Dec 22, 2022Updated 3 years ago
- This repo consists of my implementation of DocFormerV2☆11Mar 31, 2024Updated last year
- ☆28Nov 11, 2025Updated 4 months ago
- [ECCV'24] Official Implementation of Autoregressive Visual Entity Recognizer.☆14Mar 2, 2024Updated 2 years ago
- The official repository for Dynamic Clustering and Cluster Contrastive Learning (DCCC).☆14Dec 15, 2023Updated 2 years ago
- [CVPR 2025 HIghlight] XLRS-Bench: ould Your Multimodal LLMs Understand Extremely Large Ultra-High-Resolution Remote Sensing Imagery?☆56Oct 31, 2025Updated 4 months ago
- Code for our Source-free Unsupervised Video Domain Adaptation Paper☆13Jan 17, 2025Updated last year
- The code of MGCC: Text-based Occluded Person Re-identification via Multi-Granularity Contrastive Consistency Learning☆20Feb 26, 2025Updated last year
- ☆13Feb 18, 2025Updated last year
- [CVPR 2024] CA-Jaccard: Camera-aware Jaccard Distance for Person Re-identification☆24Oct 28, 2024Updated last year
- Deep Learning papers that enlightened me☆12Dec 22, 2017Updated 8 years ago
- Aty-TTS: Improving fairness for spoken language understanding in atypical speech with Text-to-Speech☆11May 14, 2025Updated 10 months ago
- official implementation for ECCV 2024 paper "Prioritized Semantic Learning for Zero-shot Instance Navigation"☆45Jun 6, 2025Updated 9 months ago
- DeepEarth: AI Foundation Model for Planetary Science & Sustainability☆26Mar 10, 2026Updated last week
- codes for paper "AttCAT: Explaining Transformers via Attentive Class Activation Tokens"☆12May 13, 2024Updated last year
- PyTorch Implementation of ViT-TTS (EMNLP'23)☆11Oct 20, 2023Updated 2 years ago
- ☆13Sep 7, 2023Updated 2 years ago
- ☆17Mar 31, 2024Updated last year
- UniGen: Enhanced Training & Test-Time Strategies for Unified Multimodal Understanding and Generation☆37Nov 24, 2025Updated 3 months ago
- ☆62Sep 2, 2024Updated last year
- Implementation of the paper Knowledge-Enhanced Dual-stream Zero-shot Composed Image Retrieval (CVPR 2024)☆20Nov 4, 2024Updated last year
- Official repository of "FMA-Net++: Motion- and Exposure-Aware Real-World Joint Video Super-Resolution and Deblurring"☆78Dec 5, 2025Updated 3 months ago
- SOD-YOLO (Small Object Detection YOLO) builds upon the foundational YOLOv8 model to address the unique challenges of detecting small obje…☆34Jun 16, 2024Updated last year
- The first large-scale multimodal dialogue dataset focusing on Synthetic Aperture Radar (SAR) imagery.☆67Feb 15, 2025Updated last year