jqtangust / hawkLinks
π₯ π₯ π₯ [NeurIPS 2024] Official Implementation of Hawk: Learning to Understand Open-World Video Anomalies
β211Updated 3 months ago
Alternatives and similar repositories for hawk
Users that are interested in hawk are comparing it to the libraries listed below
Sorting:
- [CVPR 2024] Official implementation of "Universal Segmentation at Arbitrary Granularity with Language Instruction"β209Updated last year
- (ICCV-2025 Official Code)) Improving Generalist Model with Domain-Specific Expertsβ86Updated 2 weeks ago
- CoS: Chain-of-Shot Prompting for Long Video Understandingβ48Updated 5 months ago
- [CVPR 2025 Highlight] Official code for "Olympus: A Universal Task Router for Computer Vision Tasks"β428Updated last month
- [NeurIPS'24] Training-Free Adaptive Diffusion with Bounded Difference Approximation Strategyβ68Updated 5 months ago
- π¦ Yo'Chameleon: Your Personalized Chameleon (CVPR 2025)β140Updated 2 months ago
- Official repository of MMGenBenchβ121Updated 4 months ago
- [Arxiv 2024] Official code for Decomposing the Neurons: Activation Sparsity via Mixture of Experts for Continual Test Time Adaptationβ66Updated last year
- CVPR2025β41Updated 3 months ago
- [ACM MM'2024] Official repository for "Semantic Editing Increment Benefits Zero-Shot Composed Image Retrieval"β39Updated 6 months ago
- [NeurIPS 2024] AWT: Transferring Vision-Language Models via Augmentation, Weighting, and Transportationβ103Updated 9 months ago
- Inspiring the Next Generation of Segment Anything Models: Comprehensively Evaluate SAM and SAM 2 with Diverse Prompts Towards Context-Depβ¦β17Updated last month
- Evaluation of Text-to-Video Generation Models: A Dynamics Perspective[NeurIPS 2024].β275Updated 7 months ago
- [ICLR 2025] Ctrl-U: Robust Conditional Image Generation via Uncertainty-aware Reward Modelingβ79Updated 4 months ago
- Efficient DiT architecture for text2any tasks, ICLR2025β451Updated 2 months ago
- Official Repository of OmniCaptionerβ152Updated 2 months ago
- [ICCV2025] Official implementation of paper "Towards Performance Consistency in Multi-Level Model Collaboration"β41Updated 2 weeks ago
- β156Updated last month
- Intervening Anchor Token: Decoding Strategy in Alleviating Hallucinations for MLLMsβ157Updated 4 months ago
- β208Updated last month
- β73Updated 3 months ago
- Code for paper "Towards Understanding Camera Motions in Any Video"β200Updated last month
- [NeurIPS 2024] Matryoshka Query Transformer for Large Vision-Language Modelsβ110Updated last year
- Wan2.1 with Controlnetβ172Updated 3 months ago
- [NeurIPS 2022] Official code for REVIVE: Regional Visual Representation Matters in Knowledge-Based Visual Question Answeringβ102Updated 3 months ago
- [CVPR 2024] Official code for "Text-Driven Image Editing via Learnable Regions"β224Updated 9 months ago
- [ICML2025] Make LoRA Great Again: Boosting LoRA with Adaptive Singular Values and Mixture-of-Experts Optimization Alignmentβ109Updated 3 weeks ago
- Mulberry, an o1-like Reasoning and Reflection MLLM Implemented via Collective MCTSβ1,201Updated 3 months ago
- β62Updated 4 months ago
- (CVPR 2024) Towards Automatic Power Battery Detection: New Challenge, Benchmark Dataset and Baselineβ101Updated 3 months ago