Spareman / Action-recognition-BagOfWords-Early-Late-Fusion
Video classification using the UCF101 dataset for action recognition. We extract SIFT, MFCC and STIP features from the videos, we encode them using the Bag of Words framework and we implement early and late feature fusion using different combinations of the feature types available.
☆28Updated 4 years ago
Alternatives and similar repositories for Action-recognition-BagOfWords-Early-Late-Fusion:
Users that are interested in Action-recognition-BagOfWords-Early-Late-Fusion are comparing it to the libraries listed below
- Use human pose information to help action recognition, explored with attention-pooling method, C3D method and two-stream architecture, im…☆18Updated 6 years ago
- Implemented Sports Action Recognition system using SVM classifier. First, extracted the features from each video frame from each categor…☆12Updated 5 years ago
- Learning Spatiotemporal Features using 3DCNN and Convolutional LSTM for Gesture Recognition☆61Updated 6 years ago
- Action recognition network -- CNN + LSTM.☆75Updated 6 years ago
- Implementation of Action Recognition using 3D Convnet on UCF-101 dataset.☆74Updated 6 years ago
- Human activity recognition(LSTM, BidLSTM, BidLSTM+CNN, LSTM+CNN)☆15Updated 7 years ago
- Use 3D ResNet to extract features of UCF101 and HMDB51 and then classify them.☆42Updated 6 years ago
- ☆90Updated 6 years ago
- ☆25Updated 7 years ago
- part-aware lstm implemented in tensorflow used in skeleton-based action recognition with dataset NTU RGB+D.☆48Updated 5 years ago
- Major refactor works ongoing☆14Updated 5 years ago
- Vision basesd human action recognition using Two Stream LSTM☆8Updated 7 years ago
- Train the C3D network with my own data set. Video or gif can be supported as a training file. Video streams or image frames can be used a…☆15Updated 5 years ago
- PyTorch Implementation for Global and Local Attention Network☆24Updated 4 years ago
- Source code of our TCSVT 2018 paper "Two-stream Collaborative Learning with Spatial-Temporal Attention for Video Classification".☆20Updated 7 years ago
- an improvement of the paper: Learning to Detect Violent Videos using Convolution LSTM☆11Updated 4 years ago
- ☆11Updated 6 years ago
- An implementation of the paper "Skeleton-based abnormal gait detection"☆9Updated 6 years ago
- This Pytorch repo uses BiConvLSTM in a Spatiotemporal Encoder to detect violence in Videos. Three benchmark datasets namely Hockey, Movie…☆37Updated 6 years ago
- Our implementation of Recurrent Pose Attention in Du et al.: "RPAN: An End-to-End Recurrent Pose-attention Network for Action Recognition…☆37Updated 6 years ago
- Video classification tools using 3D ResNet☆23Updated 7 years ago
- Two-stream CNNs for video action recognition implemented in Keras☆122Updated 5 years ago
- Emotional Video to Audio Transformation with ANFIS-DeepRNN (Vanilla RNN and LSTM-DeepRNN) [MPE 2020]☆25Updated 4 years ago
- Deep learning model that predicts human action in a given video feed using pose estimation☆22Updated 6 years ago
- Action Recognition in Video Sequences using Deep Bi-directional LSTM with CNN Features☆45Updated last year
- Generalized cross-modal NNs; new audiovisual benchmark (IEEE TNNLS 2019)☆25Updated 4 years ago
- Code for our paper "Acoustic Features Fusion using Attentive Multi-channel Deep Architecture" in Keras and tensorflow☆26Updated 6 years ago
- A public Python implementation for generating Dynamic Images introduced in 'Dynamic Image Networks for Action Recognition' by Bilen et a…☆39Updated 4 years ago
- ☆13Updated 3 years ago
- Code for the paper: Audio-Visual Model Distillation Using Acoustic Images☆20Updated 2 years ago