A multi-task model which does image captioning, sentence paraphrasing and cross-modal retrieval.
☆19Nov 21, 2019Updated 6 years ago
Alternatives and similar repositories for STT
Users that are interested in STT are comparing it to the libraries listed below
Sorting:
- ☆10Apr 20, 2018Updated 7 years ago
- For ICDAR 2019 Paper on End-to-end License Plate and Scene Text Recognition with multi-head attention models☆25Aug 14, 2021Updated 4 years ago
- I implemented a detection algorithm with a classification data set that does not have annotation information for the bounding box. Based …☆31Jan 29, 2018Updated 8 years ago
- Codes for MICCAI 2021 Paper: Selective Learning from External Data for CT Image Segmentation☆12Oct 10, 2021Updated 4 years ago
- some materials about deep learning on medical image like x-rays, MRI, CT☆32Aug 13, 2017Updated 8 years ago
- Code for "simNet: Stepwise Image-Topic Merging Network for Generating Detailed and Comprehensive Image Captions" (EMNLP 2018)☆36Sep 5, 2018Updated 7 years ago
- Partially Non-Autoregressive Image Captioning☆10Sep 30, 2021Updated 4 years ago
- Factor Modeling for radiomics☆12Aug 29, 2025Updated 6 months ago
- ☆12Jan 8, 2025Updated last year
- Cheatsheet for slurm command lines☆10Apr 9, 2023Updated 2 years ago
- [AAAI'20] Code release for "HAL: Improved Text-Image Matching by Mitigating Visual Semantic Hubs".☆38Oct 4, 2023Updated 2 years ago
- ☆12Sep 11, 2021Updated 4 years ago
- Submission for MICCAI HACKATHON: https://miccai-hackathon.com/#participate☆15Jul 19, 2023Updated 2 years ago
- code for "Multi-modality contrastive learning for sarcopenia screening from hip X-rays and clinical information" in MICCAI 2023☆16Dec 3, 2025Updated 2 months ago
- I have created a dataset of Image-Text-Pairs by using the cosine similarity of the CLIP embeddings of the image & it's caption derrived f…☆16Apr 22, 2021Updated 4 years ago
- Source code for paper "Adversary Guided Asymmetric Hashing for Cross-Modal Retrieval".☆39Sep 4, 2019Updated 6 years ago
- Synthetic_Data_Engine_For_Text_Recognition☆37Aug 10, 2017Updated 8 years ago
- Learning Fragment Self-Attention Embeddings for Image-Text Matching, in ACM MM 2019☆41Sep 24, 2019Updated 6 years ago
- A web application made using Python 3, Django 2, Bootstrap and REST API. It's website about technology where user can find interesting ne…☆12Dec 8, 2022Updated 3 years ago
- Measure the diversity of image descriptions, repository for our COLING 2018 paper.☆13Dec 29, 2019Updated 6 years ago
- [CVPR 2022] Sequential Voting with Relational Box Fields for Active Object Detection☆10Jun 19, 2022Updated 3 years ago
- Official implementation of "Weakly-supervised positional contrastive learning: application to cirrhosis classification", MICCAI 2023☆11Dec 16, 2025Updated 2 months ago
- FusionGAN: A generative adversarial network for infrared and visible image fusion☆10May 27, 2020Updated 5 years ago
- Anatomy-guided domain adaptation for point cloud-based 3D in-bed human pose estimation☆10Dec 7, 2022Updated 3 years ago
- Semi-Supervised Unpaired Multi-Modal Learning for Label-Efficient Medical Image Segmentation☆10Jun 29, 2021Updated 4 years ago
- [ICLR'25] Do Egocentric Video-Language Models Truly Understand Hand-Object Interactions?☆12Apr 11, 2025Updated 10 months ago
- LiverCancerAssistant☆10Mar 24, 2020Updated 5 years ago
- A website which lets you download youtube videos and also maintain your profile and past downloads. Website developed using the Django Fr…☆10Oct 27, 2018Updated 7 years ago
- Coco datasets Visualization.☆10Aug 9, 2021Updated 4 years ago
- Extract the key frame from the tested video, and then search the most similar Images from the database, which consists over 1,4000 pictur…☆10Mar 13, 2014Updated 11 years ago
- Detecting People who smoke in restricted places(Non-Smoking Areas) and inform to corresponding officials.☆11Aug 27, 2019Updated 6 years ago
- Federated Conformal Prediction with Quantile-of-Quantiles (FedCP-QQ)☆11Aug 16, 2023Updated 2 years ago
- ☆11Nov 5, 2019Updated 6 years ago
- ☆12May 7, 2018Updated 7 years ago
- code for paper `MemCap: Memorizing Style Knowledge for Image Captioning`☆11Mar 17, 2020Updated 5 years ago
- ☆14Mar 11, 2025Updated 11 months ago
- An implementation for Generator Versus Segmentor: Pseudo-healthy Synthesis☆12Oct 22, 2021Updated 4 years ago
- ☆12May 19, 2025Updated 9 months ago
- Project page for "Morphology-Aware Interactive Keypoint Estimation" accepted in MICCAI 2022.☆13Sep 14, 2024Updated last year