Spareman / Action-recognition-BagOfWords-Early-Late-FusionLinks
Video classification using the UCF101 dataset for action recognition. We extract SIFT, MFCC and STIP features from the videos, we encode them using the Bag of Words framework and we implement early and late feature fusion using different combinations of the feature types available.
☆30Updated 5 years ago
Alternatives and similar repositories for Action-recognition-BagOfWords-Early-Late-Fusion
Users that are interested in Action-recognition-BagOfWords-Early-Late-Fusion are comparing it to the libraries listed below
Sorting:
- Key frames extraction in traffic videos using K-Means☆13Updated 7 years ago
- Generalized cross-modal NNs; new audiovisual benchmark (IEEE TNNLS 2019)☆30Updated 5 years ago
- ☆11Updated 7 years ago
- Train the C3D network with my own data set. Video or gif can be supported as a training file. Video streams or image frames can be used a…☆16Updated 6 years ago
- Learning Spatiotemporal Features using 3DCNN and Convolutional LSTM for Gesture Recognition☆63Updated 7 years ago
- Action recognition using skeleton information based on HMM model☆36Updated 11 years ago
- ☆13Updated 4 years ago
- A public Python implementation for generating Dynamic Images introduced in 'Dynamic Image Networks for Action Recognition' by Bilen et a…☆40Updated 5 years ago
- Implemented Sports Action Recognition system using SVM classifier. First, extracted the features from each video frame from each categor…☆14Updated 6 years ago
- Major refactor works ongoing☆14Updated 6 years ago
- ☆25Updated 8 years ago
- Action recognition network -- CNN + LSTM.☆75Updated 7 years ago
- Convolutional Neural Networks (CNNs) are being widely used for various tasks in Computer Vision. We focus on the task of image classifica…☆25Updated 7 years ago
- Multimodal classification solution for the SIGIR eCOM using Co-attention and transformer language models☆19Updated 5 years ago
- Code for the paper: Audio-Visual Model Distillation Using Acoustic Images☆21Updated 2 years ago
- Source code of our TCSVT 2018 paper "Two-stream Collaborative Learning with Spatial-Temporal Attention for Video Classification".☆20Updated 7 years ago
- A tutorial for using deep learning for activity recognition (Pytorch and Tensorflow)☆231Updated 4 years ago
- menovideo: pytorch library for video action recognition and video understanding☆29Updated 4 years ago
- Attention Based Multi-modal Emotion Recognition; Stanford Emotional Narratives Dataset☆17Updated 6 years ago
- part-aware lstm implemented in tensorflow used in skeleton-based action recognition with dataset NTU RGB+D.☆49Updated 5 years ago
- This Pytorch repo uses BiConvLSTM in a Spatiotemporal Encoder to detect violence in Videos. Three benchmark datasets namely Hockey, Movie…☆37Updated 7 years ago
- Implementation of Action Recognition using 3D Convnet on UCF-101 dataset.☆75Updated 7 years ago
- Code for Group-Level Emotion Recognition Using Hybrid Deep Models Based on Faces, Scenes, Skeletons and Visual Attentions☆17Updated 7 years ago
- 🤗 Facial expression recognition with Pytorch (My First Project in 2018)☆77Updated 7 years ago
- Use 3D ResNet to extract features of UCF101 and HMDB51 and then classify them.☆44Updated 7 years ago
- Human activity recognition(LSTM, BidLSTM, BidLSTM+CNN, LSTM+CNN)☆16Updated 7 years ago
- Action Recognition in Video Sequences using Deep Bi-directional LSTM with CNN Features☆46Updated 2 years ago
- Course Project for CS763 Computer Vision IIT Bombay☆34Updated 7 years ago
- TensorFlow implementation for video classification.☆44Updated 7 years ago
- Multimodal Gesture Recognition Using 3D Convolution and Convolutional LSTM☆93Updated 7 years ago