Spareman / Action-recognition-BagOfWords-Early-Late-Fusion
Video classification using the UCF101 dataset for action recognition. We extract SIFT, MFCC and STIP features from the videos, we encode them using the Bag of Words framework and we implement early and late feature fusion using different combinations of the feature types available.
☆28Updated 4 years ago
Alternatives and similar repositories for Action-recognition-BagOfWords-Early-Late-Fusion
Users that are interested in Action-recognition-BagOfWords-Early-Late-Fusion are comparing it to the libraries listed below
Sorting:
- Vision basesd human action recognition using Two Stream LSTM☆8Updated 7 years ago
- Use human pose information to help action recognition, explored with attention-pooling method, C3D method and two-stream architecture, im…☆18Updated 6 years ago
- Implemented Sports Action Recognition system using SVM classifier. First, extracted the features from each video frame from each categor…☆12Updated 5 years ago
- Learning Spatiotemporal Features using 3DCNN and Convolutional LSTM for Gesture Recognition☆61Updated 6 years ago
- ☆25Updated 7 years ago
- A public Python implementation for generating Dynamic Images introduced in 'Dynamic Image Networks for Action Recognition' by Bilen et a…☆39Updated 4 years ago
- Implementation of Action Recognition using 3D Convnet on UCF-101 dataset.☆74Updated 6 years ago
- Use 3D ResNet to extract features of UCF101 and HMDB51 and then classify them.☆42Updated 6 years ago
- part-aware lstm implemented in tensorflow used in skeleton-based action recognition with dataset NTU RGB+D.☆48Updated 5 years ago
- ☆45Updated 5 years ago
- Multi 3DCNN for action recognition using global and local information☆38Updated 7 years ago
- ☆91Updated 6 years ago
- Video Action Classification Using Spatial Temporal Clues. Original paper: arXiv:1504.01561☆23Updated 6 years ago
- Video classification tools using 3D ResNet☆23Updated 7 years ago
- Code for Group-Level Emotion Recognition Using Hybrid Deep Models Based on Faces, Scenes, Skeletons and Visual Attentions☆17Updated 6 years ago
- Key frames extraction in traffic videos using K-Means☆13Updated 6 years ago
- code for Emotion Recognition in the Wild (EmotiW) challenge☆38Updated 6 years ago
- An end-to-end system that performs temporal recognition of gesture sequences using speech and skeletal input. The model combines three ne…☆29Updated 6 years ago
- Action Recognition in Videos using Stacked Optical Flow and HOGHOF features.☆11Updated 8 years ago
- Multi-stream CNN architectures for action detection with actor-centric filtering☆26Updated 6 years ago
- Action recognition network -- CNN + LSTM.☆75Updated 6 years ago
- Train action classification model based on individual frames☆41Updated 6 years ago
- ☆13Updated 3 years ago
- Our implementation of Recurrent Pose Attention in Du et al.: "RPAN: An End-to-End Recurrent Pose-attention Network for Action Recognition…☆37Updated 6 years ago
- Source code of our TCSVT 2018 paper "Two-stream Collaborative Learning with Spatial-Temporal Attention for Video Classification".☆20Updated 7 years ago
- An implementation of the paper "Skeleton-based abnormal gait detection"☆9Updated 7 years ago
- Code for the paper: Audio-Visual Model Distillation Using Acoustic Images☆21Updated 2 years ago
- Major refactor works ongoing☆14Updated 5 years ago
- Code for paper: Optical Flow Guided Feature: A Fast and Robust Motion Representation for Video Action Recognition☆28Updated 2 years ago
- Two-stream CNNs for video action recognition implemented in Keras☆122Updated 5 years ago