Image captioning models "show and tell" + "show, attend and tell" in PyTorch
☆19Jul 19, 2018Updated 7 years ago
Alternatives and similar repositories for image-captioning
Users that are interested in image-captioning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This repository reimplements "Show, Attend and Tell" model and add extra deep learning techniques.☆12Oct 3, 2023Updated 2 years ago
- An attention based sequential deep learning model implemented in pytorch to generate single line caption given an input image☆11Dec 29, 2020Updated 5 years ago
- Developing adversarial examples and showing their semantic generalization for the OpenAI CLIP model (https://github.com/openai/CLIP)☆26Mar 6, 2021Updated 5 years ago
- ☆19Mar 19, 2019Updated 7 years ago
- ICM-Assistant: Instruction-tuning Multimodal Large Language Models for Rule-based Explainable Image Content Moderation. AAAI, 2025☆14Aug 25, 2025Updated 8 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Covid-19 weibo rumor dataset, collected from 2020.1.22 to 2021.4.22☆13Jun 27, 2021Updated 4 years ago
- ACL Paper Lists(machine translation)☆13Mar 23, 2022Updated 4 years ago
- [WSDM 2019] Homogeneity-Based Transmissive Process To Model True and False News in Social Networks☆13Jun 8, 2021Updated 4 years ago
- ☆17Oct 22, 2020Updated 5 years ago
- Multi-Label Classification and Class Activation Map on Fashion MNIST☆11Mar 5, 2019Updated 7 years ago
- This is the PyTorch implementation of paper: FSR (AAAI 2023 Oral).☆12Sep 12, 2023Updated 2 years ago
- The implementation of <Factual Consistency Evaluation for Text Summarization via Counterfactual Estimation> in PyTorch.☆17Nov 11, 2021Updated 4 years ago
- ☆10Feb 27, 2020Updated 6 years ago
- Tensorflow 2.0 implementation of BSP-NET.☆11Dec 2, 2020Updated 5 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆19Jul 29, 2025Updated 9 months ago
- SW components and demos for visual kinship recognition. An emphasis is put on the FIW dataset-- data loaders, benchmarks, results in summ…☆17Mar 13, 2023Updated 3 years ago
- [NeurIPS23] PromptRestorer: A Prompting Image Restoration Method with Degradation Perception☆15Aug 4, 2024Updated last year
- Google AI 2018 BERT pytorch implementation☆13Oct 22, 2018Updated 7 years ago
- Beyond Degradation Redundancy: Contrastive Prompt Learning for All-in-One Image Restoration☆28Feb 23, 2026Updated 2 months ago
- ☆11Feb 9, 2023Updated 3 years ago
- Run CLIP inference on the ImageNet dataset and use these inferences as labels to train other models and again evaluate the trained model …☆12Jun 21, 2021Updated 4 years ago
- image captioning with flikr8k dataset☆14Dec 7, 2021Updated 4 years ago
- ☆30Oct 2, 2018Updated 7 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Show, Attend, and Tell | a PyTorch Tutorial to Image Captioning☆2,892Jul 28, 2022Updated 3 years ago
- PyTorch implementation of image captioning with adaptive attention mechanism.☆18Mar 23, 2019Updated 7 years ago
- [TPAMI 2025] Implementation of "Exploring Frequency-Inspired Optimization in Transformer for Efficient Single Image Super-Resolution"☆15Mar 27, 2025Updated last year
- Gradient as Conditions: Rethinking HOG for All-in-one Image Restoration☆38Mar 22, 2026Updated last month
- [CVIU 2024] PPformer: Using pixel-wise and patch-wise cross-attention for low-light image enhancement☆13Oct 18, 2024Updated last year
- Text perturbation methods to evaluate the robustness of NLP models☆20Oct 6, 2021Updated 4 years ago
- A MaskGIT port from JAX to PyTorch☆18Jun 18, 2022Updated 3 years ago
- Latent optimal transport (LOT) for low rank transport and clustering☆20Jul 22, 2021Updated 4 years ago
- Code for the paper "Unsupervised Learning from Narrated Instruction Videos", CVPR2016☆20Jul 27, 2016Updated 9 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- [NeurIPS 2025] Official repository for "ThermalGen: Style-Disentangled Flow-Based Generative Models for RGB-to-Thermal Image Translation"☆51Feb 12, 2026Updated 2 months ago
- A PyTorch implementation of the paper Show, Attend and Tell: Neural Image Caption Generation with Visual Attention☆87Oct 18, 2019Updated 6 years ago
- Using LSTM or Transformer to solve Image Captioning in Pytorch☆79Jul 20, 2021Updated 4 years ago
- Official repository for the paper "Random Shuffle Transformer for Image Restoration".☆17Jan 9, 2024Updated 2 years ago
- Deep Reinforcement Learning based Image Captioning with Embedding Reward☆26Aug 20, 2024Updated last year
- Machine Translation using Transfromers☆29Jan 1, 2020Updated 6 years ago
- Code for F-ViTA: Foundation Model Guided Visible to Thermal Translation☆33Jun 29, 2025Updated 10 months ago