Spareman / Action-recognition-BagOfWords-Early-Late-FusionLinks
Video classification using the UCF101 dataset for action recognition. We extract SIFT, MFCC and STIP features from the videos, we encode them using the Bag of Words framework and we implement early and late feature fusion using different combinations of the feature types available.
☆28Updated 4 years ago
Alternatives and similar repositories for Action-recognition-BagOfWords-Early-Late-Fusion
Users that are interested in Action-recognition-BagOfWords-Early-Late-Fusion are comparing it to the libraries listed below
Sorting:
- Major refactor works ongoing☆14Updated 6 years ago
- Train the C3D network with my own data set. Video or gif can be supported as a training file. Video streams or image frames can be used a…☆15Updated 6 years ago
- Code for the paper: Audio-Visual Model Distillation Using Acoustic Images☆21Updated 2 years ago
- Multimodal classification solution for the SIGIR eCOM using Co-attention and transformer language models☆19Updated 5 years ago
- Implemented Sports Action Recognition system using SVM classifier. First, extracted the features from each video frame from each categor…☆12Updated 5 years ago
- Learning Spatiotemporal Features using 3DCNN and Convolutional LSTM for Gesture Recognition☆62Updated 6 years ago
- ☆13Updated 3 years ago
- Video classification tools using 3D ResNet☆24Updated 7 years ago
- An end-to-end system that performs temporal recognition of gesture sequences using speech and skeletal input. The model combines three ne…☆29Updated 6 years ago
- ☆25Updated 8 years ago
- menovideo: pytorch library for video action recognition and video understanding☆29Updated 3 years ago
- Source code for abnormal detection on MIT video surveillance dataset using Nonnegative Matrix Factorization☆11Updated 5 years ago
- Multimodal short video classification task, integrating video, image, audio and text modes for short video classification☆19Updated 5 years ago
- A public Python implementation for generating Dynamic Images introduced in 'Dynamic Image Networks for Action Recognition' by Bilen et a…☆39Updated 5 years ago
- ☆11Updated 7 years ago
- Bilinear CNNs in PyTorch☆20Updated 5 years ago
- Action recognition network -- CNN + LSTM.☆75Updated 7 years ago
- Deep learning model that predicts human action in a given video feed using pose estimation☆22Updated 6 years ago
- Repository for 2019 CVPR AI City Challenge Track 3 from IPL@UW☆12Updated 6 years ago
- Key frames extraction in traffic videos using K-Means☆13Updated 7 years ago
- I3D implemetation in Keras + video preprocessing + visualization of results☆42Updated last year
- FUKinect-Fall dataset was created using Kinect V1. The dataset includes walking, bending, sitting, squatting, lying and falling actions p…☆26Updated 9 months ago
- Codes for Category-aware Generative Adversarial Networks (AAAI 2020)☆18Updated 4 years ago
- Frame level anomaly detection and localization in videos using auto-encoders☆64Updated 3 years ago
- Convolutional Neural Networks (CNNs) are being widely used for various tasks in Computer Vision. We focus on the task of image classifica…☆25Updated 7 years ago
- Action recognition using skeleton information based on HMM model☆36Updated 11 years ago
- This Pytorch repo uses BiConvLSTM in a Spatiotemporal Encoder to detect violence in Videos. Three benchmark datasets namely Hockey, Movie…☆37Updated 6 years ago
- Action Recognition in Video Sequences using Deep Bi-directional LSTM with CNN Features☆46Updated 2 years ago
- Course Project for CS763 Computer Vision IIT Bombay☆34Updated 7 years ago
- Attention Based Multi-modal Emotion Recognition; Stanford Emotional Narratives Dataset☆17Updated 6 years ago