Image Captioning with CNN, LSTM and RNN using PyTorch on COCO Dataset
☆18Mar 8, 2020Updated 6 years ago
Alternatives and similar repositories for image_captioning
Users that are interested in image_captioning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆12May 3, 2024Updated 2 years ago
- ☆15Feb 6, 2025Updated last year
- White-box Fairness Testing through Adversarial Sampling☆14Apr 16, 2021Updated 5 years ago
- The first unofficial implementation of CLIP4Caption: CLIP for Video Caption (ACMMM 2021)☆16Jan 2, 2023Updated 3 years ago
- ☆20May 3, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Machine Learning Operations with a denoising diffusion model using a butterfly dataset☆11Jun 2, 2024Updated 2 years ago
- A Pytorch-based library to evaluate learning methods on small image classification datasets☆18Jun 22, 2022Updated 3 years ago
- Implementation of CarSNN: An Efficient Spiking Neural Network for Event-Based Autonomous Cars on the Loihi Neuromorphi☆15Aug 4, 2021Updated 4 years ago
- This repository supports the BENDER series of videos for the MICCAI Education Challenge, 2022.☆15May 13, 2026Updated last month
- [CVPR 2025] This is the official source for our paper "DualTalk: Dual-Speaker Interaction for 3D Talking Head Conversations"☆62Jul 12, 2025Updated 11 months ago
- The official SpeakerVid-5M data curation code.☆76Jul 23, 2025Updated 10 months ago
- Eagleeye: fast sub-net evaluation for efficient neural network pruning with Tensorflow keras☆18Mar 27, 2021Updated 5 years ago
- 3 Minutes Machine Learning☆16Dec 8, 2022Updated 3 years ago
- ☆30Aug 14, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- image captioning with flikr8k dataset☆14Dec 7, 2021Updated 4 years ago
- 基于MuseTalk的数字人代码。☆34Sep 14, 2024Updated last year
- This method performs 3D object detection in the BEV space using images from multiple cameras.☆31Oct 26, 2022Updated 3 years ago
- ☆27Jun 17, 2022Updated 3 years ago
- (TIP'2023) Concept-Aware Video Captioning: Describing Videos with Effective Prior Information☆33Dec 26, 2024Updated last year
- Deploy Python FastAPI serverless application on Azure Functions☆20May 31, 2024Updated 2 years ago
- CIRCLe: Color Invariant Representation Learning for Unbiased Classification of Skin Lesions☆19Apr 17, 2023Updated 3 years ago
- This is a Deep learning project using Flickr8k dataset for CSE 475: Machine Learning☆17Jun 26, 2021Updated 4 years ago
- Notes about Computer vision and implementation of image-processing, face-detection, face-recognition, and character optical recognition a…☆17Sep 5, 2022Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Built and deployed scalable LLM retrieval APIs on a hybrid GCP architecture with full CI/CD, IaC, and monitoring☆81Aug 10, 2025Updated 10 months ago
- [CVPR 2024] Do you remember? Dense Video Captioning with Cross-Modal Memory Retrieval☆66Jun 19, 2024Updated last year
- Person Detection using the EfficientNet B0 and Light Head RCNN running at 12 FPS☆24Sep 20, 2019Updated 6 years ago
- HistoClean is a tool for the preprocessing and augmentation of images used in deep learning models. This easy to use application brings…☆32Feb 18, 2022Updated 4 years ago
- Production Machine Learning Pipeline for Text Classification with fastText☆33Jun 10, 2021Updated 5 years ago
- Trained a Multi-Layer Perceptron, AlexNet and pre-trained InceptionV3 architectures on NVIDIA GPUs to classify Brain MRI images into meni…☆34Oct 6, 2022Updated 3 years ago
- Pure python implementation of unsupervised MNIST classification using Spiking Neural Networks (using STDP)☆31Mar 28, 2022Updated 4 years ago
- ☆46May 5, 2023Updated 3 years ago
- Flexible simulator for mixed precision and format simulation of LLMs and vision transformers.☆51Jul 10, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Data generation using GAN on our own data☆34Aug 15, 2021Updated 4 years ago
- [CVPR2023]Discrete Point-wise Attack Is Not Enough: Generalized Manifold Adversarial Attack for Face Recognition☆40May 30, 2023Updated 3 years ago
- Pre-processing NSL-KDD dataset using Data mining techniques. Algorithm written in python to detect the attacks in NSL KDD dataset.☆27Jan 6, 2020Updated 6 years ago
- The official implementation of UFPMP-Det☆67May 23, 2022Updated 4 years ago
- wav2graph: A Framework for Supervised Learning Knowledge Graph from Speech☆94Jul 9, 2025Updated 11 months ago
- Official implementation of "ConZIC: Controllable Zero-shot Image Captioning by Sampling-Based Polishing"☆76Sep 20, 2023Updated 2 years ago
- A collection of self-supervised papers in medical imaging.☆40Mar 16, 2021Updated 5 years ago