alvinbhou / Video2TextLinks
📺 An Encoder-Decoder Model for Sequence-to-Sequence learning: Video to Text
☆25Updated 6 years ago
Alternatives and similar repositories for Video2Text
Users that are interested in Video2Text are comparing it to the libraries listed below
Sorting:
- Video Captioning is an encoder decoder mode based on sequence to sequence learning☆138Updated last year
- Efficient violence detection in surveillance videos using Human Skeletons and Motion Estimation☆50Updated last year
- 这是一个基于Pytorch平台、Transformer框架实现的视频描述生成 (Video Captioning) 深度学习模型。 视频描述生成任务指的是:输入一个视频,输出一句描述整个视频内容的文字(前提是视频较短且可以用一句话来描述)。本repo主要目的是帮助视力障碍…☆93Updated 3 years ago
- Extract video features from raw videos using multiple GPUs. We support RAFT flow frames as well as S3D, I3D, R(2+1)D, VGGish, CLIP, and T…☆610Updated 7 months ago
- ☆147Updated 3 years ago
- Video to Text: Natural language description generator for some given video. [Video Captioning]☆354Updated 3 years ago
- Transformer & CNN Image Captioning model in PyTorch.☆44Updated 2 years ago
- Implemented 3 different architectures to tackle the Image Caption problem, i.e, Merged Encoder-Decoder - Bahdanau Attention - Transformer…☆40Updated 4 years ago
- ☆70Updated 3 years ago
- Abnormal Human Behaviors Detection/ Road Accident Detection From Surveillance Videos/ Real-World Anomaly Detection in Surveillance Videos…☆165Updated 2 years ago
- Simple implementation of OpenAI CLIP model in PyTorch.☆700Updated last year
- Crime detection in cctv footage using deep learning☆94Updated 5 years ago
- Real-world Anomaly Detection in Surveillance Videos- pytorch Re-implementation☆134Updated 3 years ago
- Real Time Violence Detection using MobileNet and Bi-directional LSTM☆20Updated 2 years ago
- ☆50Updated 3 years ago
- ☆61Updated 4 years ago
- An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"☆977Updated last year
- A large scale video database for violence detection, which has 2,000 video clips containing violent or non-violent behaviours.☆445Updated last year
- Real-world Anomaly Detection in Surveillance Videos CVPR2018 UCF-Crime dataset☆132Updated 3 years ago
- Violence detection in videos using Deep Learning (CNNs + LSTMs). 98.5% video accuracy and 97.81% frame level accuracy (with threshold=3) …☆99Updated 3 years ago
- image captioning trained using COCO dataset in pytorch☆36Updated 5 years ago
- Code on selecting an action based on multimodal inputs. Here in this case inputs are voice and text.☆73Updated 4 years ago
- Source code for "Bi-modal Transformer for Dense Video Captioning" (BMVC 2020)☆228Updated 2 years ago
- Image Captioning using CNN and Transformer.☆55Updated 3 years ago
- Computer Vision Project : Action Recognition on UCF101 Dataset☆38Updated 5 years ago
- Simple image-captioning model using Flickr8K dataset☆15Updated 3 years ago
- CLIPxGPT Captioner is Image Captioning Model based on OpenAI's CLIP and GPT-2.☆117Updated 6 months ago
- [AAAI 2020] Official implementation of VAANet for Emotion Recognition☆79Updated last year
- This is the official implement of paper "ActionCLIP: A New Paradigm for Action Recognition"☆574Updated last year
- Pytorch version of - https://github.com/WaqasSultani/AnomalyDetectionCVPR2018☆190Updated last week