S2VT (seq2seq) video captioning with bahdanau & luong attention implementation in Tensorflow
☆18Apr 26, 2018Updated 8 years ago
Alternatives and similar repositories for S2VT-seq2seq-video-captioning-attention
Users that are interested in S2VT-seq2seq-video-captioning-attention are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This is note for Machine Learning and having it deep and structrured (Hung-yi Lee)☆14Sep 4, 2018Updated 7 years ago
- Video to Language Challenge (MSR-VTT Challenge 2016)☆32Dec 28, 2017Updated 8 years ago
- Study of frame rate effects on MSR-VTT dataset☆14Feb 10, 2018Updated 8 years ago
- Machine learning and deep structure☆14Jul 30, 2018Updated 7 years ago
- ☆14Jan 7, 2018Updated 8 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- implementation of TDConvED for video captioning☆13Mar 18, 2020Updated 6 years ago
- A video captioning tool using S2VT method and attention mechanism (TensorFlow)☆15Oct 14, 2018Updated 7 years ago
- S2VT pytorch implementation☆20Jun 28, 2019Updated 6 years ago
- Social distance Monitoring using OpenCV and Yolo Object Detector☆11Jul 24, 2020Updated 5 years ago
- ☆15Jul 9, 2019Updated 6 years ago
- Caffe☆205Oct 9, 2017Updated 8 years ago
- Code to accompany the paper "Learning Grimaces By Watching TV" and FaceValue dataset☆12Aug 4, 2018Updated 7 years ago
- Source code for the paper "Speaking the Same Language: Matching Machine to Human Captions by Adversarial Training"☆66Apr 18, 2019Updated 7 years ago
- tensorrt部署教程☆11Aug 1, 2025Updated 10 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Pytorch implementation of audio-visual fusion video captioning model☆27Jul 26, 2018Updated 7 years ago
- Implementation of our paper "Scaling Back-Translation with Domain Text Generation for Sign Language Gloss Translation". Accepted in EACL …☆11May 22, 2023Updated 3 years ago
- anomaly detection using tensorflow, Keras, and Open CV☆12Feb 18, 2024Updated 2 years ago
- UCFCrime Annotation☆20Jan 16, 2020Updated 6 years ago
- Face recognition with VGG face net in Tensorflow and Keras python.Trained in Colab.☆15Oct 15, 2019Updated 6 years ago
- ☆192Jun 16, 2025Updated 11 months ago
- Machine Learning and having it Deep and Structured (MLDS) in 2018 spring☆146Apr 19, 2019Updated 7 years ago
- ☆15Apr 18, 2023Updated 3 years ago
- Using Semantic Compositional Networks for Video Captioning☆96Nov 27, 2018Updated 7 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Official code for "Mean Shift for Self-Supervised Learning"☆56Oct 12, 2021Updated 4 years ago
- [ECCV2022] 3D-PL: Domain Adaptive Depth Estimation with 3D-aware Pseudo-Labeling☆17Sep 20, 2022Updated 3 years ago
- Source code from paper Reconstruction of Panoramic Dental Images Through Bézier Function Optimization☆17Feb 27, 2021Updated 5 years ago
- Reading list for multimodal sequence learning☆14Sep 4, 2023Updated 2 years ago
- Egocentric Video Description based on Temporally-Linked Sequences☆11Jul 17, 2017Updated 8 years ago
- Frozen Pretrained Transformers for Neural Sign Language Translation☆15Apr 23, 2022Updated 4 years ago
- ☆11Oct 5, 2020Updated 5 years ago
- video captioning using 3DCNN and LSTM (pytorch)☆11Sep 26, 2019Updated 6 years ago
- This is the official code repository for the paper 'Cross-modality Data Augmentation for End-to-End Sign Language Translation'. Accepted…☆16Oct 18, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆12Mar 30, 2022Updated 4 years ago
- Activity Recognition using Temporal Optical Flow Convolutional Features and Multi-Layer LSTM☆25Jul 27, 2025Updated 10 months ago
- CatNet: Class Incremental 3D ConvNets for Lifelong Egocentric Gesture Recognition☆12Apr 21, 2020Updated 6 years ago
- Code for Enhancing Self-supervised Video Representation Learning via Multi-level Feature Optimization.☆10Sep 28, 2021Updated 4 years ago
- This repository reimplements "Show, Attend and Tell" model and add extra deep learning techniques.☆12Oct 3, 2023Updated 2 years ago
- pytorch implementation of video captioning☆401Aug 19, 2019Updated 6 years ago
- Human activity recognition(LSTM, BidLSTM, BidLSTM+CNN, LSTM+CNN)☆16Mar 6, 2018Updated 8 years ago