[AAAI 2026 Oral] HiMo-CLIP: Modeling Semantic Hierarchy and Monotonicity in Vision-Language Alignment
☆29Dec 17, 2025Updated 6 months ago
Alternatives and similar repositories for HiMo-CLIP
Users that are interested in HiMo-CLIP are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Optimizing for the Shortest Path in Denoising Diffusion Model (CVPR2025)☆20Dec 17, 2025Updated 6 months ago
- ✨ Official code for our paper: "Uncertainty-o: One Model-agnostic Framework for Unveiling Epistemic Uncertainty in Large Multimodal Model…☆21Mar 13, 2025Updated last year
- UniHarness (formerly HexAgent) – An agent harness that gives any LLM a computer to complete tasks the way humans do☆124Jun 4, 2026Updated 3 weeks ago
- a new☆22Mar 2, 2025Updated last year
- [NeurIPS 2025 Spotlight] LeMiCa: Lexicographic Minimax Path Caching for Efficient Diffusion-Based Video Generation☆121Updated this week
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- MSCA: Multi-Scale Channel Attention Module☆16Nov 24, 2021Updated 4 years ago
- [AAAI 2025] Enhance Vision-Language Alignment with Noise☆26Dec 19, 2024Updated last year
- [MM2024] LDA-AQU: Adaptive Query-guided Upsampling via Local Deformable Attention☆13Dec 24, 2024Updated last year
- This is the official code for the CIKM 2024 paper "MARS: Matching Attribute-aware Representations for Text-based Sequential Recommendatio…☆17Apr 23, 2025Updated last year
- A variant of Varibad that is robust to difficult tasks☆11Aug 30, 2023Updated 2 years ago
- ☆14Apr 18, 2022Updated 4 years ago
- Few-Shot Image Generation by Conditional Relaxing Diffusion Inversion☆16Mar 14, 2025Updated last year
- 不使用Recbole实现Mamba4Rec☆17Jun 3, 2024Updated 2 years ago
- ☆55Dec 30, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ICCV 2023: The Euclidean Space is Evil: Hyperbolic Attribute Editing for Few-shot Image Generation☆15Sep 29, 2023Updated 2 years ago
- The official Pytorch implementation of paper Where is My Spot? Few-shot Image Generation via Latent Subspace Optimization, CVPR 2023.☆11Jan 6, 2024Updated 2 years ago
- Tackling Non-Stationarity in Reinforcement Learning via Causal-Origin Representation (ICML 2024)☆12Aug 9, 2024Updated last year
- ☆21Aug 27, 2025Updated 10 months ago
- ☆19May 22, 2021Updated 5 years ago
- Codes for Hierarchical Time-Aware Mixture of Experts for Multi-Modal Sequential Recommendation (WWW2025)☆32Jun 17, 2025Updated last year
- Cross-Self KV Cache Pruning for Efficient Vision-Language Inference☆10Dec 15, 2024Updated last year
- ☆15Apr 5, 2023Updated 3 years ago
- Code release for Proto-CLIP: Vision-Language Prototypical Network for Few-Shot Learning | IROS 2024☆54Dec 30, 2024Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- (ICCV 2023) Generative Multiplane Neural Radiance for 3D Aware Image Generation.☆18Sep 28, 2023Updated 2 years ago
- 国家税务总局全国增值税发票查验平台(https://inv-veri.chinatax.gov.cn/) 测试查询☆12Jan 3, 2023Updated 3 years ago
- UnicomAI Large Model Benchmark☆68May 20, 2026Updated last month
- ScreenExplorer: Training a Vision-Language Model for Diverse Exploration in Open GUI World☆26Jun 17, 2025Updated last year
- MICCAI2022 GOALS Challenge & Paper accepted by TMI2023 (Retinal Layer Segmentation in OCT images with Boundary Regression and Feature Pol…☆18Oct 16, 2023Updated 2 years ago
- 📖Curated list about reasoning abilitiy of MLLM, including OpenAI o1, OpenAI o3-mini, and Slow-Thinking.☆13Feb 7, 2025Updated last year
- [CVPR 2024] Tune-An-Ellipse: CLIP Has Potential to Find What You Want☆14Jan 5, 2025Updated last year
- MCPL: Multi-modal Collaborative Prompt Learning for Medical Vision-Language Model (Initial Version)☆13Apr 17, 2024Updated 2 years ago
- This is the Github repository for a fault tolerant optimal ZNN controller with state constraints and precribed performance constraints de…☆20Nov 11, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- IPO: Interpretable Prompt Optimization for Vision-Language Models(NeurIPS 2024)☆15Jun 12, 2026Updated 2 weeks ago
- This is a C++ implementation of cocoapi bbox evaluation code.☆11Dec 9, 2021Updated 4 years ago
- Code for AAAI 2024 paper "GCNext: Towards the Unity of Graph Convolutions for Human Motion Prediction"☆18Jan 16, 2025Updated last year
- ☆25Nov 27, 2025Updated 7 months ago
- An implementation of AutoScale regression-based method☆12Oct 27, 2020Updated 5 years ago
- [TNNLS 2023] Disentangled Feature Representation for Few-shot Image Classification☆28Feb 21, 2024Updated 2 years ago
- Code for paper 'Peri-midFormer: Periodic Pyramid Transformer for Time Series Analysis (NeurIPS 2024 Spotlight)'☆40Jan 8, 2025Updated last year