UnicomAI/HiMo-CLIP

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/UnicomAI/HiMo-CLIP)

UnicomAI / HiMo-CLIP

[AAAI 2026 Oral] HiMo-CLIP: Modeling Semantic Hierarchy and Monotonicity in Vision-Language Alignment

☆29

Alternatives and similar repositories for HiMo-CLIP

Users that are interested in HiMo-CLIP are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

joelulu / Awesome-Acceleration-GenAI
View on GitHub
Collection of Acceleration Methods for Generative AI
☆29Dec 9, 2025Updated 7 months ago
mk-minchul / sapiensid
View on GitHub
☆26Nov 17, 2025Updated 8 months ago
LINs-lab / RCGM
View on GitHub
[ICLR 2026] Any-step Generation via N-th Order Recursive Consistent Velocity Field Estimation
☆40Feb 4, 2026Updated 5 months ago
IVY-LVLM / Counterfactual-Inception
View on GitHub
Official PyTorch Implementation for the "What if...?: Thinking Counterfactual Keywords Helps to Mitigate Hallucination in Large Multi-mod…
☆20Sep 26, 2024Updated last year
UnicomAI / LeMiCa
View on GitHub
[NeurIPS 2025 Spotlight] LeMiCa: Lexicographic Minimax Path Caching for Efficient Diffusion-Based Video Generation
☆122Jun 22, 2026Updated 3 weeks ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
eslambakr / EMCA
View on GitHub
MSCA: Multi-Scale Channel Attention Module
☆16Nov 24, 2021Updated 4 years ago
Rajpal9 / ZNN_Optimal_Control
View on GitHub
This is the Github repository for a fault tolerant optimal ZNN controller with state constraints and precribed performance constraints de…
☆21Nov 11, 2024Updated last year
hyzhang98 / PiNI
View on GitHub
[AAAI 2025] Enhance Vision-Language Alignment with Noise
☆26Dec 19, 2024Updated last year
chancharikmitra / SAVs
View on GitHub
Official Codebase for "Generative Multimodal Model Features Are Discriminative Vision-Language Classifiers"
☆26Jun 7, 2025Updated last year
gqq1210 / AS-UNet
View on GitHub
☆11Apr 4, 2021Updated 5 years ago
duzw9311 / LDA-AQU
View on GitHub
[MM2024] LDA-AQU: Adaptive Query-guided Upsampling via Local Deformable Attention
☆13Dec 24, 2024Updated last year
GasolSun36 / GRACE
View on GitHub
[ICLR 2025] Official repo for paper: "GRACE: Generative Representation Learning via Contrastive Policy Optimization"
☆39Feb 3, 2026Updated 5 months ago
lingxiao-li / HAE
View on GitHub
ICCV 2023: The Euclidean Space is Evil: Hyperbolic Attribute Editing for Few-shot Image Generation
☆15Sep 29, 2023Updated 2 years ago
ant-research / M2-Miner
View on GitHub
[ICLR 2026] M2-Miner: Multi-Agent Enhanced MCTS for Mobile GUI Agent Data Mining
☆55Apr 22, 2026Updated 2 months ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
HHHLF / SECA_AAAI2026
View on GitHub
Code for "Harnessing Textual Semantic Priors for Knowledge Transfer and Refinement in CLIP-Driven Continual Learning" (AAAI-2026 poster)
☆16Mar 13, 2026Updated 4 months ago
zihou98 / Whole-Slide-Image
View on GitHub
Working note for WSI analysis
☆10Apr 3, 2023Updated 3 years ago
apple / ml-mobileclip-dr
View on GitHub
RayGen: Multi-Modal Dataset Reinforcement for MobileCLIP and MobileCLIP2
☆40Mar 12, 2026Updated 4 months ago
yrluestc / NeurIPS2025-LEAR
View on GitHub
The official implementation for "Learning Expandable and Adaptable Representations for Continual Learning" (NeurIPS2025)
☆15Jan 18, 2026Updated 6 months ago
chansey0529 / LSO
View on GitHub
The official Pytorch implementation of paper Where is My Spot? Few-shot Image Generation via Latent Subspace Optimization, CVPR 2023.
☆11Jan 6, 2024Updated 2 years ago
BingSu12 / Log-Polar-Space-Convolution
View on GitHub
Log-Polar Space Convolution for Convolutional Neural Networks
☆13Dec 12, 2022Updated 3 years ago
MP-ReID / mp-reid
View on GitHub
Multi-modal Multi-platform Person Re-Identification: Benchmark and Method
☆25Aug 21, 2025Updated 11 months ago
LCO-Embedding / LCO-Embedding
View on GitHub
[NeurIPS 2025] Scaling Language-centric Omnimodal Representation Learning
☆47Apr 13, 2026Updated 3 months ago
bowang-lab / CellSeg-Transformers
View on GitHub
☆15May 7, 2024Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
force-sight / forcesight
View on GitHub
Given an RGBD image and a text prompt, ForceSight produces visual-force goals for a robot, enabling mobile manipulation in unseen environ…
☆25Nov 6, 2023Updated 2 years ago
ZhilingYan / Hetero-UNet
View on GitHub
☆14Oct 22, 2024Updated last year
amandpkr / GMNR
View on GitHub
(ICCV 2023) Generative Multiplane Neural Radiance for 3D Aware Image Generation.
☆18Sep 28, 2023Updated 2 years ago
webai-defi / webai-defi-core
View on GitHub
AI agents weaving intelligence, execution, and automation into DeFi
☆10Feb 28, 2025Updated last year
kaku289 / Robotics-Estimation-and-Learning
View on GitHub
Coursera MOOC on Robotics: Estimation and Learning
☆16Feb 17, 2017Updated 9 years ago
NagisaZj / MetaCURE-Public
View on GitHub
☆15Apr 5, 2023Updated 3 years ago
GaryGuTC / UniME-v2
View on GitHub
[AAAI 2026 Oral] The official code of "UniME-V2: MLLM-as-a-Judge for Universal Multimodal Embedding Learning"
☆74Dec 8, 2025Updated 7 months ago
taozh2017 / Text-SemiSeg
View on GitHub
☆21Aug 27, 2025Updated 10 months ago
iota9star / fapiao-simple
View on GitHub
国家税务总局全国增值税发票查验平台（https://inv-veri.chinatax.gov.cn/）测试查询
☆12Jan 3, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
TerryPei / CSP
View on GitHub
Cross-Self KV Cache Pruning for Efficient Vision-Language Inference
☆10Dec 15, 2024Updated last year
niuzaisheng / ScreenExplorer
View on GitHub
ScreenExplorer: Training a Vision-Language Model for Diverse Exploration in Open GUI World
☆26Jun 17, 2025Updated last year
JiahengZhao / FS-SLAM
View on GitHub
Implementation of the accepted paper "2D Laser SLAM with Closed Shape Features: Fourier Series Parameterization and Submap Joining"
☆16Jul 18, 2021Updated 5 years ago
mingukkang / FlashDecoder
View on GitHub
Official FlashDecoder Github
☆15Apr 4, 2026Updated 3 months ago
ShengjieSun419 / CARD
View on GitHub
Code for paper "A Large Language Model-Driven Reward Design Framework via Dynamic Feedback for Reinforcement Learning"
☆16Jul 15, 2025Updated last year
Roboy / Roboy
View on GitHub
modules required for running Roboy at fairs
☆12Sep 24, 2018Updated 7 years ago
tyb311 / TCCT
View on GitHub
MICCAI2022 GOALS Challenge & Paper accepted by TMI2023 (Retinal Layer Segmentation in OCT images with Boundary Regression and Feature Pol…
☆18Oct 16, 2023Updated 2 years ago