AMAP-ML / Q-HawkeyeLinks
☆51Updated this week
Alternatives and similar repositories for Q-Hawkeye
Users that are interested in Q-Hawkeye are comparing it to the libraries listed below
Sorting:
- Eevee: Towards Close-up High-resolution Video-based Virtual Try-on☆67Updated last month
- ☆53Updated last month
- [AAAI2026] ImagerySearch: Adaptive Test-Time Search for Video Generation Beyond Semantic Dependency Constraints☆54Updated 3 months ago
- ☆24Updated last year
- Official implementation of ICLR 2026 paper "Urban Socio-Semantic Segmentation with Vision-Language Reasoning"☆155Updated 2 weeks ago
- SEED Dataset☆28Updated 8 months ago
- ☆21Updated last year
- [ICLR 2026] Harder Is Better: Boosting Mathematical Reasoning via Difficulty-Aware GRPO and Multi-Aspect Question Reformulation☆116Updated last week
- [ICCV 2025] Official implementation of LLaVA-KD: A Framework of Distilling Multimodal Large Language Models☆124Updated 3 months ago
- [ICLR2026] Everything in Its Place: Benchmarking Spatial Intelligence of Text-to-Image Models☆108Updated last week
- [ICLR26] NarrLV: Towards a Comprehensive Narrative-Centric Evaluation for Long Video Generation Models☆111Updated 6 months ago
- Next Token Is Enough: Realistic Image Quality and Aesthetic Scoring with Multimodal Large Language Model.☆77Updated 7 months ago
- Video Reasoning Segmentation☆28Updated last year
- [ICLR 2024 Spotlight] Bounding Box Stability against Feature Dropout Reflects Detector Generalization across Environments☆20Updated 5 months ago
- [ICLR2024 Spotlight] Code Release of CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense Prediction☆201Updated 2 years ago
- [ICCV 2023] ALIP: Adaptive Language-Image Pre-training with Synthetic Caption☆104Updated 2 years ago
- [CVPR2025] Unveil Inversion and Invariance in Flow Transformer for Versatile Image Editing☆23Updated 5 months ago
- HEtero-Assists Distillation for Heterogeneous Object Detectors☆10Updated 2 years ago
- [AAAI 2022] Pytorch implementation of "LCTR: On Awakening the Local Continuity of Transformer for Weakly Supervised Object Localization".☆22Updated 3 years ago
- ☆29Updated 8 months ago
- ☆27Updated 9 months ago
- Official repository for Scone (Subject-driven Composition and Distinction Enhancement) model, designed to support multi-subject compositi…☆28Updated 3 weeks ago
- [ICCV2025]Code Release of Harmonizing Visual Representations for Unified Multimodal Understanding and Generation☆186Updated 8 months ago
- Official PyTorch implementation of GeoDiffusion in ICLR 2024 (https://arxiv.org/abs/2306.04607)☆97Updated 6 months ago
- ☆22Updated 2 years ago
- ☆13Updated last year
- ☆53Updated 2 years ago
- Official repo for 【TLCM: Training-efficient Latent Consistency Model for Image Generation with 2-8 Steps】☆36Updated last year
- Thinking with Map: Reinforced Parallel Map-Augmented Agent for Geolocalization☆149Updated this week
- [NIPS2023] This is an official implementation of paper "DAC-DETR: Divide the Attention Layers and Conquer".☆65Updated last year