alvinbhou / Video2Text
📺 An Encoder-Decoder Model for Sequence-to-Sequence learning: Video to Text
☆26Updated 6 years ago
Alternatives and similar repositories for Video2Text
Users that are interested in Video2Text are comparing it to the libraries listed below
Sorting:
- Video to Text: Natural language description generator for some given video. [Video Captioning]☆343Updated 3 years ago
- 这是一个基于Pytorch平台、Transformer框架实现的视频描述生成 (Video Captioning) 深度学习模型。 视频描述生成任务指的是:输入一个视频,输出一句描述整个视频内容的文字(前提是视频较短且可以用一句话来描述)。本repo主要目的是帮助视力障碍…☆87Updated 3 years ago
- The notebook explains the various steps to obtain the results of publication: "Is Space-Time Attention All You Need for Video Understandi…☆43Updated 4 years ago
- Video Captioning is an encoder decoder mode based on sequence to sequence learning☆136Updated last year
- ☆66Updated 4 years ago
- A PyTorch implementation of state of the art video captioning models from 2015-2019 on MSVD and MSRVTT datasets.☆71Updated last year
- Implementation of ViViT: A Video Vision Transformer☆533Updated 3 years ago
- ☆21Updated 2 years ago
- Code on selecting an action based on multimodal inputs. Here in this case inputs are voice and text.☆73Updated 3 years ago
- Source code for "Bi-modal Transformer for Dense Video Captioning" (BMVC 2020)☆227Updated 2 years ago
- SAM-SLR-v2 is an improved version of SAM-SLR for sign language recognition.☆34Updated 3 years ago
- [AAAI 2020] Official implementation of VAANet for Emotion Recognition☆78Updated last year
- PyTorch implementation of a collections of scalable Video Transformer Benchmarks.☆295Updated 3 years ago
- Code and pre-trained models for my submission to the ChaLearn 2021 LAP challenge.☆18Updated 2 years ago
- Continuous Sign Language Recognition with Correlation Network (CVPR 2023)☆120Updated 3 months ago
- fourierer / Video_Classification_ResNet3D_R2plus1D_ip-CSN_train-UCF101-HMDB51-Kinetics400-from-scratchUsing ResNet3D-50,R(2+1)D-50, and ip_CSN-50 to train UCD-101,HMDB-51 and Kinetics-400 from scratch.☆28Updated 4 years ago
- Using VideoBERT to tackle video prediction☆125Updated 4 years ago
- This repository contains the code for a video captioning system inspired by Sequence to Sequence -- Video to Text. This system takes as i…☆166Updated 5 years ago
- PyTorch implementation of Multi-modal Dense Video Captioning (CVPR 2020 Workshops)☆144Updated 2 years ago
- Pytorch implementation of image captioning using transformer-based model.☆66Updated 2 years ago
- Transformer & CNN Image Captioning model in PyTorch.☆43Updated 2 years ago
- ☆45Updated 3 years ago
- Tool for automating common video key-frame extraction, video compression and Image Auto-crop/Image-resize tasks☆355Updated 9 months ago
- End-to-End Dense Video Captioning with Parallel Decoding (ICCV 2021)☆219Updated last year
- Intro of some sign language datasets suitable for research☆20Updated 4 years ago
- Release of the pretrained S3D Network in PyTorch (ECCV 2018)☆131Updated last year
- An Attention Based Approach to Sign Language Recognition | SOTA 2022 on WLASL Joints | https://arxiv.org/abs/2212.10746☆17Updated last week
- A repository for extract CNN features from videos using pytorch☆69Updated 2 years ago
- Tools for loading video dataset and transforms on video in pytorch. You can directly load video files without preprocessing.☆69Updated 2 years ago
- CNN LSTM architecture implemented in Pytorch for Video Classification☆282Updated 2 years ago