TAU-VAILab/hierarcaps

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/TAU-VAILab/hierarcaps)

TAU-VAILab / hierarcaps

Code and data for the paper "Emergent Visual-Semantic Hierarchies in Image-Text Representations" (ECCV 2024)

☆34

Alternatives and similar repositories for hierarcaps

Users that are interested in hierarcaps are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

naver-ai / prolip
View on GitHub
☆58Aug 16, 2025Updated 11 months ago
saibr / hypvl
View on GitHub
This repository is related to 'Intriguing Properties of Hyperbolic Embeddings in Vision-Language Models', published at TMLR (2024), https…
☆21Jul 5, 2024Updated 2 years ago
facebookresearch / meru
View on GitHub
Code for the paper "Hyperbolic Image-Text Representations", Desai et al, ICML 2023
☆204Aug 23, 2023Updated 2 years ago
haoyu-bu / CAFe
View on GitHub
Code for "CAFe: Unifying Representation and Generation with Contrastive-Autoregressive Finetuning"
☆33Mar 26, 2025Updated last year
aimagelab / HySAC
View on GitHub
Hyperbolic Safety-Aware Vision-Language Models. CVPR 2025
☆31Apr 8, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
kwonjunn01 / Hi-Mapper
View on GitHub
☆19Nov 29, 2024Updated last year
ytaek-oh / fsc-clip
View on GitHub
[EMNLP 2024] Preserving Multi-Modal Capabilities of Pre-trained VLMs for Improving Vision-Linguistic Compositionality
☆23Oct 8, 2024Updated last year
wangzy01 / ACTIVE-Action-from-Robotic-View
View on GitHub
ICCV 2025 Recognizing Actions from Robotic View for Natural Human-Robot Interaction
☆17Feb 5, 2026Updated 5 months ago
wusize / CLIM
View on GitHub
[AAAI2024] Code Release of CLIM: Contrastive Language-Image Mosaic for Region Representation
☆30Feb 4, 2024Updated 2 years ago
hyunji12 / Open3DRF
View on GitHub
☆21Aug 20, 2024Updated last year
adobe-research / llava-score
View on GitHub
☆11Oct 2, 2024Updated last year
dhg-wei / TOPA
View on GitHub
(NeurIPS 2024 Spotlight) TOPA: Extend Large Language Models for Video Understanding via Text-Only Pre-Alignment
☆29Sep 27, 2024Updated last year
SalesforceAIResearch / LATTE
View on GitHub
☆70Jun 2, 2026Updated last month
naver-ai / pcmepp
View on GitHub
Official Pytorch implementation of "Improved Probabilistic Image-Text Representations" (ICLR 2024)
☆64May 26, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
mshukor / eP-ALM
View on GitHub
[ICCV23] Official implementation of eP-ALM: Efficient Perceptual Augmentation of Language Models.
☆27Oct 27, 2023Updated 2 years ago
wuw2019 / LoTLIP
View on GitHub
[NeurIPS 2024] Official PyTorch implementation of LoTLIP: Improving Language-Image Pre-training for Long Text Understanding
☆49Jan 14, 2025Updated last year
snap-research / MyVLM
View on GitHub
Official Implementation for "MyVLM: Personalizing VLMs for User-Specific Queries" (ECCV 2024)
☆188Jul 5, 2024Updated 2 years ago
haon-chen / mmE5
View on GitHub
☆59Feb 27, 2025Updated last year
mlvlab / OVQA
View on GitHub
Open-Vocabulary Video Question Answering: A New Benchmark for Evaluating the Generalizability of Video Question Answering Models (ICCV 20…
☆18Apr 23, 2024Updated 2 years ago
DIAL-RPI / Fed-MENU
View on GitHub
A python (PyTorch) implementation of federated multi-encoding U-Net (Fed-MENU) method for federated learning-based multi-organ segmentati…
☆16Nov 5, 2024Updated last year
ChenyuHeidiZhang / VL-commonsense
View on GitHub
☆14May 23, 2022Updated 4 years ago
rabiulcste / vismin
View on GitHub
[NeurIPS24] VisMin: Visual Minimal-Change Understanding
☆19Mar 3, 2025Updated last year
naver-ai / lut
View on GitHub
[ECCV 2024] Official PyTorch implementation of LUT "Learning with Unmasked Tokens Drives Stronger Vision Learners"
☆14Dec 1, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
TerminologyHub / termhub-in-5-minutes
View on GitHub
Developer project for getting basic API integrations working in under 5 minutes
☆11May 22, 2026Updated 2 months ago
LisaAnne / TemporalLanguageRelease
View on GitHub
☆44Mar 8, 2021Updated 5 years ago
TwoBranchDracaena / OpenFace-PyTorch
View on GitHub
PyTorch model of OpenFace
☆12May 8, 2017Updated 9 years ago
antoyang / FrozenBiLM
View on GitHub
[NeurIPS 2022] Zero-Shot Video Question Answering via Frozen Bidirectional Language Models
☆159Dec 9, 2024Updated last year
teaching-clip-to-count / teaching-clip-to-count.github.io
View on GitHub
☆15Feb 24, 2023Updated 3 years ago
tripletclip / TripletCLIP
View on GitHub
[NeurIPS 2024] Official PyTorch implementation of "Improving Compositional Reasoning of CLIP via Synthetic Vision-Language Negatives"
☆48Dec 1, 2024Updated last year
EnricoCancelli / ProximitySocialNav
View on GitHub
repository for "Exploiting Proximity-Aware Tasks for Embodied Social Navigation" paper code
☆12Nov 16, 2023Updated 2 years ago
mala-lab / SIC-CADS
View on GitHub
Code Implementation of "Simple Image-level Classification Improves Open-vocabulary Object Detection" (AAAI'24)
☆30Jan 12, 2024Updated 2 years ago
snumprlab / isr-dpo
View on GitHub
Official Implementation of ISR-DPO:Aligning Large Multimodal Models for Videos by Iterative Self-Retrospective DPO (AAAI'25)
☆23Nov 25, 2025Updated 8 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Hleephilip / CSG
View on GitHub
Official implementation of "Conditional Score Guidance for Text-Driven Image-to-Image Translation" (NeurIPS 2023)
☆11Jul 19, 2023Updated 3 years ago
huang-yh / Owl
View on GitHub
☆52Dec 13, 2024Updated last year
WentingXu3o3 / TB-HSU
View on GitHub
[AAAI 2025] Official data and code for "TB-HSU: Hierarchical 3D Scene Understanding with Contextual Affordances"
☆15Sep 11, 2025Updated 10 months ago
yonatanbitton / data_efficient_masked_language_modeling_for_vision_and_language
View on GitHub
Repository for the paper "Data Efficient Masked Language Modeling for Vision and Language".
☆18Sep 17, 2021Updated 4 years ago
mvrl / geoclap
View on GitHub
☆13May 24, 2026Updated 2 months ago
PKU-YuanGroup / LLMBind
View on GitHub
LLMBind: A Unified Modality-Task Integration Framework
☆19Jun 16, 2024Updated 2 years ago
jaeseokbyun / GRIT-VLP
View on GitHub
This is an official implementation of GRIT-VLP
☆20Aug 8, 2022Updated 3 years ago