[AAAI 2026 Oral] HiMo-CLIP: Modeling Semantic Hierarchy and Monotonicity in Vision-Language Alignment
☆31Dec 17, 2025Updated 4 months ago
Alternatives and similar repositories for HiMo-CLIP
Users that are interested in HiMo-CLIP are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Optimizing for the Shortest Path in Denoising Diffusion Model (CVPR2025)☆22Dec 17, 2025Updated 4 months ago
- ✨ Official code for our paper: "Uncertainty-o: One Model-agnostic Framework for Unveiling Epistemic Uncertainty in Large Multimodal Model…☆20Mar 13, 2025Updated last year
- HexAgent – An Agent harness that gives any LLM a computer to complete tasks the way humans do☆112Updated this week
- a new☆23Mar 2, 2025Updated last year
- [NeurIPS 2025 Spotlight] LeMiCa: Lexicographic Minimax Path Caching for Efficient Diffusion-Based Video Generation☆111Apr 16, 2026Updated last week
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- PyTorch(1.6+) implementation of https://github.com/kang205/SASRec☆10Aug 28, 2024Updated last year
- MSCA: Multi-Scale Channel Attention Module☆16Nov 24, 2021Updated 4 years ago
- [AAAI 2025] Enhance Vision-Language Alignment with Noise☆25Dec 19, 2024Updated last year
- [MM2024] LDA-AQU: Adaptive Query-guided Upsampling via Local Deformable Attention☆13Dec 24, 2024Updated last year
- This is the official code for the CIKM 2024 paper "MARS: Matching Attribute-aware Representations for Text-based Sequential Recommendatio…☆18Apr 23, 2025Updated last year
- A variant of Varibad that is robust to difficult tasks☆11Aug 30, 2023Updated 2 years ago
- Few-Shot Image Generation by Conditional Relaxing Diffusion Inversion☆16Mar 14, 2025Updated last year
- ☆14Apr 18, 2022Updated 4 years ago
- 不使用Recbole实现Mamba4Rec☆17Jun 3, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆57Dec 30, 2024Updated last year
- ICCV 2023: The Euclidean Space is Evil: Hyperbolic Attribute Editing for Few-shot Image Generation☆15Sep 29, 2023Updated 2 years ago
- The official Pytorch implementation of paper Where is My Spot? Few-shot Image Generation via Latent Subspace Optimization, CVPR 2023.☆11Jan 6, 2024Updated 2 years ago
- ☆20Aug 27, 2025Updated 8 months ago
- ☆19May 22, 2021Updated 4 years ago
- Codes for Hierarchical Time-Aware Mixture of Experts for Multi-Modal Sequential Recommendation (WWW2025)☆30Jun 17, 2025Updated 10 months ago
- Cross-Self KV Cache Pruning for Efficient Vision-Language Inference☆10Dec 15, 2024Updated last year
- ☆15Apr 5, 2023Updated 3 years ago
- Code release for Proto-CLIP: Vision-Language Prototypical Network for Few-Shot Learning | IROS 2024☆54Dec 30, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- UnicomAI Large Model Benchmark☆64Oct 27, 2025Updated 6 months ago
- (ICCV 2023) Generative Multiplane Neural Radiance for 3D Aware Image Generation.☆19Sep 28, 2023Updated 2 years ago
- 国家税务总局全国增值税发票查验平台(https://inv-veri.chinatax.gov.cn/) 测试查询☆12Jan 3, 2023Updated 3 years ago
- ScreenExplorer: Training a Vision-Language Model for Diverse Exploration in Open GUI World☆25Jun 17, 2025Updated 10 months ago
- MICCAI2022 GOALS Challenge & Paper accepted by TMI2023 (Retinal Layer Segmentation in OCT images with Boundary Regression and Feature Pol…☆18Oct 16, 2023Updated 2 years ago
- 📖Curated list about reasoning abilitiy of MLLM, including OpenAI o1, OpenAI o3-mini, and Slow-Thinking.☆13Feb 7, 2025Updated last year
- [CVPR 2024] Tune-An-Ellipse: CLIP Has Potential to Find What You Want☆14Jan 5, 2025Updated last year
- MCPL: Multi-modal Collaborative Prompt Learning for Medical Vision-Language Model (Initial Version)☆13Apr 17, 2024Updated 2 years ago
- This is the Github repository for a fault tolerant optimal ZNN controller with state constraints and precribed performance constraints de…☆20Nov 11, 2024Updated last year
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- IPO: Interpretable Prompt Optimization for Vision-Language Models(NeurIPS 2024)☆15Mar 4, 2025Updated last year
- This is a C++ implementation of cocoapi bbox evaluation code.☆11Dec 9, 2021Updated 4 years ago
- Code for AAAI 2024 paper "GCNext: Towards the Unity of Graph Convolutions for Human Motion Prediction"☆18Jan 16, 2025Updated last year
- ☆23Nov 27, 2025Updated 5 months ago
- An implementation of AutoScale regression-based method☆12Oct 27, 2020Updated 5 years ago
- [TNNLS 2023] Disentangled Feature Representation for Few-shot Image Classification☆28Feb 21, 2024Updated 2 years ago
- Code for paper 'Peri-midFormer: Periodic Pyramid Transformer for Time Series Analysis (NeurIPS 2024 Spotlight)'☆40Jan 8, 2025Updated last year