mshukor/ViCHA

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/mshukor/ViCHA)

mshukor / ViCHA

[BMVC22] Official Implementation of ViCHA: "Efficient Vision-Language Pretraining with Visual Concepts and Hierarchical Alignment"

☆54

Alternatives and similar repositories for ViCHA

Users that are interested in ViCHA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Paranioar / RCAR
View on GitHub
[TIP2023] The code of “Plug-and-Play Regulators for Image-Text Matching”
☆34Apr 11, 2024Updated 2 years ago
LgQu / TIGeR
View on GitHub
Code for paper: Unified Text-to-Image Generation and Retrieval
☆16Updated this week
LgQu / CAMERA
View on GitHub
Context-Aware Multi-View Summarization Network for Image-Text Matching. ACM MM'20
☆29May 26, 2022Updated 4 years ago
jiquan123 / TIER
View on GitHub
TIER: Text-Image Encoder-based Regression for AIGC Image Quality Assessment
☆10Mar 1, 2025Updated last year
mshukor / eP-ALM
View on GitHub
[ICCV23] Official implementation of eP-ALM: Efficient Perceptual Augmentation of Language Models.
☆27Oct 27, 2023Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
yonatanbitton / wysiwyr
View on GitHub
☆37Oct 7, 2023Updated 2 years ago
hila-chefer / Conceptor
View on GitHub
Official implementation of the paper The Hidden Language of Diffusion Models
☆78Jan 24, 2024Updated 2 years ago
aurooj / WSG-VQA-VLTransformers
View on GitHub
Weakly Supervised Grounding for VQA in Vision-Language Transformers
☆17May 6, 2023Updated 3 years ago
allenai / reclip
View on GitHub
☆92Apr 15, 2022Updated 4 years ago
emited / gantk2
View on GitHub
GAN(TK)²: GAN Neural Tangent Kernel ToolKit
☆13Jul 12, 2022Updated 4 years ago
codezakh / LilT
View on GitHub
[ICLR 23] Contrastive Aligned of Vision to Language Through Parameter-Efficient Transfer Learning
☆40Jul 29, 2023Updated 2 years ago
airsplay / vimpac
View on GitHub
☆73Jun 3, 2022Updated 4 years ago
CrossmodalGroup / ER-SAN
View on GitHub
Implementation of our IJCAI2022 oral paper, ER-SAN: Enhanced-Adaptive Relation Self-Attention Network for Image Captioning.
☆25Aug 5, 2023Updated 2 years ago
lixinustc / GraphAdapter
View on GitHub
The efficient tuning method for VLMs
☆83Mar 10, 2024Updated 2 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
Guillem96 / data2vec-vision
View on GitHub
PyTorch implementation of Data2Vec self-supervised approach for vision use cases.
☆18Oct 7, 2022Updated 3 years ago
gabfstr / DiffusionTrack
View on GitHub
Finetuning & extending DiffusionDet to video & pedestrian multi-object-tracking
☆13Apr 12, 2023Updated 3 years ago
Roc-Ng / HANet
View on GitHub
PyTorch implementation of HANet: Hierarchical Alignment Networks for Video-Text Retrieval (ACM MM 2021).
☆47Aug 19, 2021Updated 4 years ago
Hxyou / MSCLIP
View on GitHub
Official Code of ECCV 2022 paper MS-CLIP
☆91Jul 27, 2022Updated 3 years ago
penghu-cs / RCL
View on GitHub
Cross-Modal Retrieval with Partially Mismatched Pairs (IEEE TPAMI 2023, PyTorch Code)
☆23Sep 17, 2023Updated 2 years ago
YYJMJC / LOUPE
View on GitHub
☆45Aug 14, 2023Updated 2 years ago
mlfoundations / imagenet-captions
View on GitHub
Release of ImageNet-Captions
☆51Jan 20, 2023Updated 3 years ago
amazon-science / prompt-pretraining
View on GitHub
Official implementation for the paper "Prompt Pre-Training with Over Twenty-Thousand Classes for Open-Vocabulary Visual Recognition"
☆259May 3, 2024Updated 2 years ago
cloneofsimo / ptar
View on GitHub
☆13Jun 3, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
josiahwang / phraseloceval
View on GitHub
Phrase Localization Evaluation Toolkit
☆20Aug 16, 2019Updated 6 years ago
ylsung / VL_adapter
View on GitHub
PyTorch code for "VL-Adapter: Parameter-Efficient Transfer Learning for Vision-and-Language Tasks" (CVPR2022)
☆212Dec 18, 2022Updated 3 years ago
mzhaoshuai / CenterCLIP
View on GitHub
[SIGIR 2022] CenterCLIP: Token Clustering for Efficient Text-Video Retrieval.
☆134May 4, 2022Updated 4 years ago
Tangshitao / CENet
View on GitHub
Channel Equilibrium Networks for Learning Deep Representation, ICML2020
☆22Jul 28, 2020Updated 5 years ago
lxa9867 / QSD
View on GitHub
[CVPR 2024] "Towards Robust Audiovisual Segmentation in Complex Environments with Quantization-based Semantic Decomposition"
☆12Feb 27, 2024Updated 2 years ago
zhuchen03 / gradinit
View on GitHub
Learning to Initialize Neural Networks for Stable and Efficient Training
☆138May 24, 2022Updated 4 years ago
foolwood / DRL
View on GitHub
[arXiv22] Disentangled Representation Learning for Text-Video Retrieval
☆96Apr 7, 2022Updated 4 years ago
CuthbertCai / Ask-Confirm
View on GitHub
Ask&Confirm: Active Detail Enriching for Cross-Modal Retrieval with Partial Query (ICCV2021)
☆20Dec 4, 2021Updated 4 years ago
baaaad / ECE
View on GitHub
[ECCV'22 Poster] Explicit Image Caption Editing
☆22Nov 30, 2022Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
QiuHeqian / mmdetection-ref
View on GitHub
☆10Jan 9, 2025Updated last year
katie-gu / Image-Similarity-Search
View on GitHub
A representation learning command line application in TensorFlow that searches for images that have the most features in common with the …
☆10Aug 18, 2018Updated 7 years ago
minghangz / cnm
View on GitHub
Weakly Supervised Video Moment Localisation with Contrastive Negative Sample Mining
☆31Apr 4, 2022Updated 4 years ago
callsys / TextVR
View on GitHub
[PR 2024] A large Cross-Modal Video Retrieval Dataset with Reading Comprehension
☆32Dec 28, 2023Updated 2 years ago
zhangy0822 / USER
View on GitHub
USER: Unified Semantic Enhancement with Momentum Contrast for Image-Text Retrieval, TIP 2024
☆33Jun 18, 2025Updated last year
jayleicn / singularity
View on GitHub
[ACL 2023] Official PyTorch code for Singularity model in "Revealing Single Frame Bias for Video-and-Language Learning"
☆136May 5, 2023Updated 3 years ago
kavishgambhir / xy-cut-tree
View on GitHub
Segmenting a given document using recursive xy-cut algorithm.
☆12Oct 9, 2018Updated 7 years ago