A multi-task model which does image captioning, sentence paraphrasing and cross-modal retrieval.
☆19Nov 21, 2019Updated 6 years ago
Alternatives and similar repositories for STT
Users that are interested in STT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆10Apr 20, 2018Updated 8 years ago
- ☆23Aug 18, 2018Updated 7 years ago
- Evaluation cross-media retrieval using a new protocol.☆11Mar 14, 2017Updated 9 years ago
- Learning Fragment Self-Attention Embeddings for Image-Text Matching, in ACM MM 2019☆41Sep 24, 2019Updated 6 years ago
- In this work, we implement different cross-modal learning schemes such as Siamese Network, Correlational Network and Deep Cross-Modal Pro…☆11Aug 23, 2021Updated 4 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- A new plugin for Vue3 Composition API. It provides various attributes and customized content to meet most requirements of Composition.☆17Oct 10, 2022Updated 3 years ago
- ☆15Jul 24, 2017Updated 8 years ago
- ☆30Oct 2, 2018Updated 7 years ago
- Cross-modal Coherence Modeling for Caption Generation☆11Jul 24, 2020Updated 5 years ago
- Scraping Program for Pascal Sentence Dataset☆17Sep 9, 2015Updated 10 years ago
- For ICDAR 2019 Paper on End-to-end License Plate and Scene Text Recognition with multi-head attention models☆25Aug 14, 2021Updated 4 years ago
- Source code for paper "Adversary Guided Asymmetric Hashing for Cross-Modal Retrieval".☆40Sep 4, 2019Updated 6 years ago
- I implemented a detection algorithm with a classification data set that does not have annotation information for the bounding box. Based …☆31Jan 29, 2018Updated 8 years ago
- Implementation of paper<Joint Feature Selection and Subspace Learning for Cross-Modal Retrieval>☆17Dec 14, 2017Updated 8 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- mxnet, fast-insightface,face recognition, face detect☆14Jun 6, 2019Updated 6 years ago
- My assignments for CN course [CSE232] [IIIT-Delhi].☆13May 24, 2018Updated 7 years ago
- A website which lets you download youtube videos and also maintain your profile and past downloads. Website developed using the Django Fr…☆10Oct 27, 2018Updated 7 years ago
- Code for ComEx [CVPR 2022]☆12Dec 5, 2022Updated 3 years ago
- Code for "simNet: Stepwise Image-Topic Merging Network for Generating Detailed and Comprehensive Image Captions" (EMNLP 2018)☆36Sep 5, 2018Updated 7 years ago
- Code for paper "Adaptively Aligned Image Captioning via Adaptive Attention Time". NeurIPS 2019☆50Dec 18, 2019Updated 6 years ago
- A web application made using Python 3, Django 2, Bootstrap and REST API. It's website about technology where user can find interesting ne…☆12Dec 8, 2022Updated 3 years ago
- Partially Non-Autoregressive Image Captioning☆10Sep 30, 2021Updated 4 years ago
- nocaps: novel object captioning at scale☆10May 23, 2019Updated 6 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- an implementation of Deformation Graph compatible with CUDA C++ and used in warping defamations in real-time non-rigid registration☆10Jan 22, 2020Updated 6 years ago
- Person Keypoint Detection in PyTorch☆13Mar 20, 2020Updated 6 years ago
- ☆16Jan 30, 2022Updated 4 years ago
- Synthetic_Data_Engine_For_Text_Recognition☆37Aug 10, 2017Updated 8 years ago
- Tensorflow implementation of paper: A Hierarchical Approach for Generating Descriptive Image Paragraphs☆15Apr 27, 2018Updated 8 years ago
- Matlab demo code for "MTFH: A Matrix Tri-Factorization Hashing Framework for Efficient Cross-Modal Retrieval"☆17Sep 13, 2019Updated 6 years ago
- Feature extraction by using SITF+BoF.☆22Jan 9, 2018Updated 8 years ago
- [AAAI'20] Code release for "HAL: Improved Text-Image Matching by Mitigating Visual Semantic Hubs".☆38Oct 4, 2023Updated 2 years ago
- Code for Unsupervised Image Captioning☆223Mar 24, 2023Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- (AAAI'20) The source code for the paper "Joint Parsing and Generation for Abstractive Summarization".☆15Apr 3, 2020Updated 6 years ago
- Triangular mesh generation and manipulation☆13Jul 3, 2021Updated 4 years ago
- Simple tool to change the INPUT and OUTPUT shape of ONNX.☆15Apr 1, 2025Updated last year
- Polysemous Visual-Semantic Embedding for Cross-Modal Retrieval (CVPR 2019)☆135Mar 15, 2024Updated 2 years ago
- SOIT: Segmenting Objects with Instance-Aware Transformers☆14Jun 6, 2022Updated 3 years ago
- ☆15Nov 26, 2023Updated 2 years ago
- Unofficial implementation of the paper I2V-Adapter: A General Image-to-Video Adapter for Video Diffusion Models.☆19Mar 13, 2024Updated 2 years ago