Spareman / Action-recognition-BagOfWords-Early-Late-FusionLinks
Video classification using the UCF101 dataset for action recognition. We extract SIFT, MFCC and STIP features from the videos, we encode them using the Bag of Words framework and we implement early and late feature fusion using different combinations of the feature types available.
☆28Updated 4 years ago
Alternatives and similar repositories for Action-recognition-BagOfWords-Early-Late-Fusion
Users that are interested in Action-recognition-BagOfWords-Early-Late-Fusion are comparing it to the libraries listed below
Sorting:
- Implemented Sports Action Recognition system using SVM classifier. First, extracted the features from each video frame from each categor…☆12Updated 5 years ago
- Train the C3D network with my own data set. Video or gif can be supported as a training file. Video streams or image frames can be used a…☆15Updated 5 years ago
- Key frames extraction in traffic videos using K-Means☆13Updated 7 years ago
- Generalized cross-modal NNs; new audiovisual benchmark (IEEE TNNLS 2019)☆27Updated 5 years ago
- Code for the paper: Audio-Visual Model Distillation Using Acoustic Images☆21Updated 2 years ago
- Major refactor works ongoing☆14Updated 6 years ago
- menovideo: pytorch library for video action recognition and video understanding☆29Updated 3 years ago
- Action Recognition based on Pose Estimation.☆27Updated 4 years ago
- Multimodal classification solution for the SIGIR eCOM using Co-attention and transformer language models☆19Updated 4 years ago
- Bilinear CNNs in PyTorch☆20Updated 5 years ago
- Video classification tools using 3D ResNet☆24Updated 7 years ago
- LSTM/BOF model to encode Videos. Implementation of our BMVC paper "Story Understanding in Video Advertisements".☆14Updated 4 years ago
- Vision basesd human action recognition using Two Stream LSTM☆8Updated 7 years ago
- Some scripts used for action recognition on UCF101 dataset☆11Updated 9 years ago
- Action recognition network -- CNN + LSTM.☆75Updated 7 years ago
- Learning Spatiotemporal Features using 3DCNN and Convolutional LSTM for Gesture Recognition☆61Updated 6 years ago
- Multi 3DCNN for action recognition using global and local information☆38Updated 7 years ago
- ☆25Updated 8 years ago
- TensorFlow implementation for video classification.☆41Updated 7 years ago
- ☆13Updated 3 years ago
- A public Python implementation for generating Dynamic Images introduced in 'Dynamic Image Networks for Action Recognition' by Bilen et a…☆39Updated 5 years ago
- Real-time fall detection using two-stream convolutional neural net (CNN) with Motion History Image (MHI)☆64Updated 2 years ago
- an improvement of the paper: Learning to Detect Violent Videos using Convolution LSTM☆11Updated 5 years ago
- Source code for abnormal detection on MIT video surveillance dataset using Nonnegative Matrix Factorization☆11Updated 5 years ago
- Multi-Label Multi-Class Fine-Grain Image-Classification using Keras for iMaterialist_challenge_FGVC5 at CVPR18☆14Updated 7 years ago
- Multimodal short video classification task, integrating video, image, audio and text modes for short video classification☆19Updated 5 years ago
- This Pytorch repo uses BiConvLSTM in a Spatiotemporal Encoder to detect violence in Videos. Three benchmark datasets namely Hockey, Movie…☆37Updated 6 years ago
- Action Recognition on the KTH Dataset☆54Updated 7 years ago
- ☆11Updated 6 years ago
- Action Recognition in Video Sequences using Deep Bi-directional LSTM with CNN Features☆46Updated 2 years ago