A multi-task model which does image captioning, sentence paraphrasing and cross-modal retrieval.
☆19Nov 21, 2019Updated 6 years ago
Alternatives and similar repositories for STT
Users that are interested in STT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆10Apr 20, 2018Updated 8 years ago
- ☆23Aug 18, 2018Updated 7 years ago
- Use GCN to classify Mnist☆11Mar 19, 2020Updated 6 years ago
- Learning Fragment Self-Attention Embeddings for Image-Text Matching, in ACM MM 2019☆41Sep 24, 2019Updated 6 years ago
- A new plugin for Vue3 Composition API. It provides various attributes and customized content to meet most requirements of Composition.☆17Oct 10, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆15Jul 24, 2017Updated 8 years ago
- Cross-modal Coherence Modeling for Caption Generation☆11Jul 24, 2020Updated 5 years ago
- Scraping Program for Pascal Sentence Dataset☆17Sep 9, 2015Updated 10 years ago
- This repository contains the author's implementation in PyTorch for the paper "Adaptive Label-aware Graph Convolutional Networks for Cros…☆15Dec 6, 2021Updated 4 years ago
- For ICDAR 2019 Paper on End-to-end License Plate and Scene Text Recognition with multi-head attention models☆25Aug 14, 2021Updated 4 years ago
- Extract the key frame from the tested video, and then search the most similar Images from the database, which consists over 1,4000 pictur…☆10Mar 13, 2014Updated 12 years ago
- I have created a dataset of Image-Text-Pairs by using the cosine similarity of the CLIP embeddings of the image & it's caption derrived f…☆16Apr 22, 2021Updated 5 years ago
- mxnet, fast-insightface,face recognition, face detect☆14Jun 6, 2019Updated 6 years ago
- My assignments for CN course [CSE232] [IIIT-Delhi].☆13May 24, 2018Updated 7 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A website which lets you download youtube videos and also maintain your profile and past downloads. Website developed using the Django Fr…☆10Oct 27, 2018Updated 7 years ago
- Code for ComEx [CVPR 2022]☆12Dec 5, 2022Updated 3 years ago
- web3js + infura + android + solidity☆10Feb 2, 2019Updated 7 years ago
- Code for "simNet: Stepwise Image-Topic Merging Network for Generating Detailed and Comprehensive Image Captions" (EMNLP 2018)☆36Sep 5, 2018Updated 7 years ago
- A web application made using Python 3, Django 2, Bootstrap and REST API. It's website about technology where user can find interesting ne…☆12Dec 8, 2022Updated 3 years ago
- Partially Non-Autoregressive Image Captioning☆10Sep 30, 2021Updated 4 years ago
- Code for paper "Adaptively Aligned Image Captioning via Adaptive Attention Time". NeurIPS 2019☆50Dec 18, 2019Updated 6 years ago
- nocaps: novel object captioning at scale☆10May 23, 2019Updated 6 years ago
- Self-Supervised Domain Adaptation with Consistency Training☆20Oct 28, 2020Updated 5 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- an implementation of Deformation Graph compatible with CUDA C++ and used in warping defamations in real-time non-rigid registration☆10Jan 22, 2020Updated 6 years ago
- Person Keypoint Detection in PyTorch☆13Mar 20, 2020Updated 6 years ago
- NeurIPS22 "RankFeat: Rank-1 Feature Removal for Out-of-distribution Detection" and T-PAMI Extension☆20Feb 21, 2025Updated last year
- ☆16Jan 30, 2022Updated 4 years ago
- Synthetic_Data_Engine_For_Text_Recognition☆37Aug 10, 2017Updated 8 years ago
- Task-Adaptive Feature Sub-Space Learning for few-shot classification☆12Sep 26, 2020Updated 5 years ago
- Matlab demo code for "MTFH: A Matrix Tri-Factorization Hashing Framework for Efficient Cross-Modal Retrieval"☆17Sep 13, 2019Updated 6 years ago
- Feature extraction by using SITF+BoF.☆22Jan 9, 2018Updated 8 years ago
- [AAAI'20] Code release for "HAL: Improved Text-Image Matching by Mitigating Visual Semantic Hubs".☆38Oct 4, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Code for Unsupervised Image Captioning☆223Mar 24, 2023Updated 3 years ago
- (AAAI'20) The source code for the paper "Joint Parsing and Generation for Abstractive Summarization".☆15Apr 3, 2020Updated 6 years ago
- ☆12Jan 8, 2025Updated last year
- PyTorch source code for "Stacked Cross Attention for Image-Text Matching" (ECCV 2018)☆580May 18, 2023Updated 3 years ago
- Triangular mesh generation and manipulation☆13Jul 3, 2021Updated 4 years ago
- Simple tool to change the INPUT and OUTPUT shape of ONNX.☆15Apr 1, 2025Updated last year
- Polysemous Visual-Semantic Embedding for Cross-Modal Retrieval (CVPR 2019)☆135Mar 15, 2024Updated 2 years ago