alvinbhou / Video2TextLinks
📺 An Encoder-Decoder Model for Sequence-to-Sequence learning: Video to Text
☆25Updated 6 years ago
Alternatives and similar repositories for Video2Text
Users that are interested in Video2Text are comparing it to the libraries listed below
Sorting:
- Efficient violence detection in surveillance videos using Human Skeletons and Motion Estimation☆49Updated last year
- 这是一个基于Pytorch平台、Transformer框架实现的视频描述生成 (Video Captioning) 深度学习模型。 视频描述生成任务指的是:输入一个视频,输出一句描述整个视频内容的文字(前提是视频较短且可以用一句话来描述)。本repo主要目的是帮助视力障碍…☆88Updated 3 years ago
- Real Time Violence Detection using MobileNet and Bi-directional LSTM☆18Updated 2 years ago
- ☆63Updated 3 years ago
- Code on selecting an action based on multimodal inputs. Here in this case inputs are voice and text.☆73Updated 4 years ago
- Video Captioning is an encoder decoder mode based on sequence to sequence learning☆136Updated last year
- Video to Text: Natural language description generator for some given video. [Video Captioning]☆345Updated 3 years ago
- ☆51Updated 3 years ago
- Extract video features from raw videos using multiple GPUs. We support RAFT flow frames as well as S3D, I3D, R(2+1)D, VGGish, CLIP, and T…☆596Updated 4 months ago
- Violence Detection using 3D Convolutional Neural Networks☆71Updated 5 years ago
- A PyTorch Implementation of PGL-SUM from "Combining Global and Local Attention with Positional Encoding for Video Summarization" (IEEE IS…☆89Updated 2 years ago
- Deep learning based algorithm which is capable of detecting violence in indoor or outdoor environments: fight, fire or car crash and even…☆80Updated 5 months ago
- Violence detection in videos using Deep Learning (CNNs + LSTMs). 98.5% video accuracy and 97.81% frame level accuracy (with threshold=3) …☆98Updated 3 years ago
- A PyTorch implementation of state of the art video captioning models from 2015-2019 on MSVD and MSRVTT datasets.☆70Updated last year
- A jupyter notebook showing how to finetune the vision transformer on a facial expression dataset (FER-2013)☆33Updated 3 years ago
- Abnormal Human Behaviors Detection/ Road Accident Detection From Surveillance Videos/ Real-World Anomaly Detection in Surveillance Videos…☆164Updated 2 years ago
- A human violence detection & classification system using recurrent neural networks(RNN).☆40Updated last year
- Where is the emotion? Dissecting a multi-gap network for image emotion classification☆10Updated 4 years ago
- Transformer & CNN Image Captioning model in PyTorch.☆44Updated 2 years ago
- Sign Language Alphabet Detection and Recognition using YOLOv8☆46Updated 2 years ago
- Crime detection in cctv footage using deep learning☆90Updated 5 years ago
- ☆61Updated 4 years ago
- Source code for "Bi-modal Transformer for Dense Video Captioning" (BMVC 2020)☆227Updated 2 years ago
- Code for the paper: "Efficient Two-Stream Network for Violence Detection Using Separable Convolutional LSTM"☆60Updated last year
- Pytorch implementation of DSR-RL for Video Summarization Task☆11Updated 3 years ago
- This repository contains the code for a video captioning system inspired by Sequence to Sequence -- Video to Text. This system takes as i…☆166Updated 5 years ago
- Real-world Anomaly Detection in Surveillance Videos CVPR2018 UCF-Crime dataset☆130Updated 3 years ago
- pre-trained model and source code for generate description of images.☆27Updated 4 years ago
- A curated list of deep learning resources for video-text retrieval.☆623Updated last year
- Image Captioning using CNN and Transformer.☆53Updated 3 years ago