jingliao132/CrossModalRetrieval

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/jingliao132/CrossModalRetrieval)

jingliao132 / CrossModalRetrieval

Pytorch implementation of 'See, Hear, and Read: Deep Aligned Representations'

☆33

Alternatives and similar repositories for CrossModalRetrieval

Users that are interested in CrossModalRetrieval are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Kajiyu / LLLNet
View on GitHub
Keras Implementation of "Look, Listen and Learn" Model
☆21Nov 14, 2017Updated 8 years ago
uzeful / VA_Project
View on GitHub
Cross-modality (visual-auditory) Metric Learning Project
☆15Dec 19, 2017Updated 8 years ago
surisdi / youtube-8m
View on GitHub
Starter code for working with the YouTube-8M dataset.
☆16Jun 9, 2017Updated 9 years ago
penghu-cs / MAN
View on GitHub
Multimodal Adversarial Network for Cross-modal Retrieval (PyTorch Code)
☆29Apr 7, 2020Updated 6 years ago
caoyue10 / aaai17-cdq
View on GitHub
The implementation of AAAI-17 paper "Collective Deep Quantization of Efficient Cross-modal Retrieval"
☆34Mar 15, 2017Updated 9 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
penghu-cs / SDML
View on GitHub
Scalable deep multimodal learning for cross-modal retrieval (SIGIR 2019, PyTorch Code)
☆35Jul 24, 2020Updated 6 years ago
yalesong / pvse
View on GitHub
Polysemous Visual-Semantic Embedding for Cross-Modal Retrieval (CVPR 2019)
☆135Mar 15, 2024Updated 2 years ago
YingZhangDUT / Cross-Modal-Projection-Learning
View on GitHub
TensorFlow Implementation of Deep Cross-Modal Projection Learning
☆95Nov 7, 2019Updated 6 years ago
PKU-ICST-MIPL / CM-GANS_TOMM2019
View on GitHub
Source code of our TOMM 2019 paper "CM-GANs: Cross-modal Generative Adversarial Networks for Common Representation Learning".
☆19Apr 18, 2019Updated 7 years ago
Fly2flies / Cross-modal-retrieval
View on GitHub
媒体计算实践作业：图像——文本跨模态搜索
☆40Dec 4, 2020Updated 5 years ago
HaohanWang / SelectAdditiveLearning
View on GitHub
implementation for the paper "Select-Additive Learning: Improving Cross-individual Generalization in Multimodal Sentiment Analysis"
☆23Nov 8, 2017Updated 8 years ago
Lishunkai / DenseDepthMapCreationFromSparsePoints
View on GitHub
☆10May 10, 2018Updated 8 years ago
congjianluo / crossModalRetrieval
View on GitHub
A demo of a cross-modal retrieval system
☆26Apr 29, 2020Updated 6 years ago
tranleanh / mosaic-data-augmentation
View on GitHub
Mosaic Data Augmentation in YOLOv4
☆18Jun 8, 2020Updated 6 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
lelan-li / SSAH
View on GitHub
Self-Supervised Adversarial Hashing Networks for Cross-Modal Retrieval(CVPR2018)
☆164Jul 20, 2018Updated 8 years ago
labyrinth7x / Deep-Cross-Modal-Projection-Learning-for-Image-Text-Matching
View on GitHub
Deep Cross-Modal Projection Learning for Image-Text Matching
☆77Sep 2, 2020Updated 5 years ago
PKU-ICST-MIPL / UGACH_AAAI2018
View on GitHub
Source code of our AAAI 2018 paper "Unsupervised Generative Adversarial Cross-modal Hashing"
☆53Oct 3, 2019Updated 6 years ago
rohitrango / objects-that-sound
View on GitHub
Unofficial Implementation of Google Deepmind's paper `Objects that Sound`
☆83May 7, 2018Updated 8 years ago
mwager / jAM
View on GitHub
automatic music transcription application written in java
☆12Jan 13, 2013Updated 13 years ago
yiling2018 / saem
View on GitHub
Learning Fragment Self-Attention Embeddings for Image-Text Matching, in ACM MM 2019
☆41Sep 24, 2019Updated 6 years ago
zhengyang5 / MMED400
View on GitHub
☆13Nov 19, 2024Updated last year
VisionLearningGroup / MULE
View on GitHub
Implementation of "MULE: Multimodal Universal Language Embedding"
☆16Dec 23, 2019Updated 6 years ago
DeepWiSe888 / Octopus
View on GitHub
☆16Aug 19, 2022Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
liuruoyu / cross-meida-evaluation
View on GitHub
Evaluation cross-media retrieval using a new protocol.
☆11Mar 14, 2017Updated 9 years ago
EsamGhaleb / soundNet_pytorch
View on GitHub
☆12Jul 18, 2018Updated 8 years ago
PKU-ICST-MIPL / FGCrossNet_ACMMM2019
View on GitHub
Source code of our ACM MM 2019 paper "A New Benchmark and Approach for Fine-grained Cross-media Retrieval".
☆58Dec 20, 2023Updated 2 years ago
videohatespeech / Implicit_Video_Hate
View on GitHub
☆17Aug 4, 2025Updated 11 months ago
FangxiangFeng / deepnet
View on GitHub
Implementation of some deep learning algorithms.
☆16Aug 27, 2014Updated 11 years ago
Wangt-CN / VQG-GCN
View on GitHub
A GCN based visual question generation model
☆13Aug 21, 2019Updated 6 years ago
CMU-INF-DIVA / avi-r
View on GitHub
AVI-R Package (formerly DIVA IO): A robust reader for AVI video files
☆13Dec 21, 2020Updated 5 years ago
zhongzhh8 / Cross-Modal-Retrieval
View on GitHub
Cross-Modal Retrieval, triplet loss, Pytorch, Resnet18, Bert, Deep Hashing
☆104Sep 15, 2019Updated 6 years ago
yupingso / numbered-musical-notation
View on GitHub
☆11Apr 24, 2026Updated 3 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
peri044 / STT
View on GitHub
A multi-task model which does image captioning, sentence paraphrasing and cross-modal retrieval.
☆19Nov 21, 2019Updated 6 years ago
PKU-ICST-MIPL / MGAH_TMM2019
View on GitHub
Source code of our TMM 2019 paper "Multi-pathway Generative Adversarial Hashing for Unsupervised Cross-modal Retrieval"
☆12Jun 17, 2019Updated 7 years ago
pmiller10 / DNN
View on GitHub
Deep Neural Networks for Python
☆10Sep 22, 2015Updated 10 years ago
ttanprasert / sheet-midi-sync
View on GitHub
Code and data repository for ISMIR 2019 paper: MIDI–SHEET MUSIC ALIGNMENT USING BOOTLEG SCORE SYNTHESIS
☆12Mar 1, 2022Updated 4 years ago
niluthpol / multimodal_vtt
View on GitHub
Joint Embedding with Multimodal Cues for Cross-Modal Video-Text Retrieval
☆68Apr 10, 2020Updated 6 years ago
devraj89 / Generalized-Semantic-Preserving-Hashing-for-N-Label-Cross-Modal-Retrieval
View on GitHub
This is the implementation for the paper "Generalized Semantic Preserving Hashing for N-Label Cross-Modal Retrieval"
☆14Dec 7, 2017Updated 8 years ago
LivXue / ALGCN
View on GitHub
This repository contains the author's implementation in PyTorch for the paper "Adaptive Label-aware Graph Convolutional Networks for Cros…
☆15Dec 6, 2021Updated 4 years ago