Video classification using the UCF101 dataset for action recognition. We extract SIFT, MFCC and STIP features from the videos, we encode them using the Bag of Words framework and we implement early and late feature fusion using different combinations of the feature types available.
☆30Dec 12, 2020Updated 5 years ago
Alternatives and similar repositories for Action-recognition-BagOfWords-Early-Late-Fusion
Users that are interested in Action-recognition-BagOfWords-Early-Late-Fusion are comparing it to the libraries listed below
Sorting:
- The project aims to improve the accuracy of target recognition through multi-feature fusion.Including manual feature extraction, deep lea…☆11Feb 18, 2020Updated 6 years ago
- Code for our paper "Acoustic Features Fusion using Attentive Multi-channel Deep Architecture" in Keras and tensorflow☆26Nov 23, 2018Updated 7 years ago
- Implemented Sports Action Recognition system using SVM classifier. First, extracted the features from each video frame from each categor…☆14Oct 15, 2019Updated 6 years ago
- Contains code for C3D, LCN and TSM for action recognition models.☆10May 31, 2020Updated 5 years ago
- This project implements a state-of-the-art skin cancer classification system that combines Convolutional Neural Networks (EfficientNetV2,…☆12Mar 2, 2026Updated last week
- Feature level fusion of LBP and Gabor☆17Jun 1, 2015Updated 10 years ago
- Image Feature Descriptors☆26Feb 15, 2013Updated 13 years ago
- Experiment on QnA tabular data using LLMs and SQL☆27Oct 24, 2024Updated last year
- This project is designed to classify human action recognition datasets with a CNN + LSTM model.☆18May 12, 2019Updated 6 years ago
- An efficient and multi-scale feature fusion behavior recognition algorithm☆19Jul 20, 2020Updated 5 years ago
- Implementation of CycleGAN for unsupervised image segmentaion, performed on brain tumor scans☆49Aug 22, 2019Updated 6 years ago
- SUPERVAIZER is a toolkit built for the age of AI interoperability. At its core, it implements Google's Agent-to-Agent (A2A) protocol, ena…☆14Feb 4, 2026Updated last month
- ☆23Mar 5, 2018Updated 8 years ago
- Evaluation code for CAMELYON16 challenge.☆27Oct 18, 2017Updated 8 years ago
- Simple vs complex temporal recurrences for video saliency prediction (BMVC 2019)☆26Nov 22, 2022Updated 3 years ago
- PyTorch Implementation for Global and Local Attention Network☆23Aug 11, 2020Updated 5 years ago
- ☆29Jun 24, 2021Updated 4 years ago
- This project is an AI Recruitment System designed to accelerate the hiring process for HR and technical recruiters.☆14Jan 3, 2025Updated last year
- Supervoxel-based Segmentation of 3D Volumetric Images☆11Feb 6, 2018Updated 8 years ago
- MultiResUNet implementation in PyTorch; MultiResUNet: Rethinking the U-Net Architecture for Multimodal☆29Jul 17, 2020Updated 5 years ago
- A comprehensive Python and R-based toolkit for clustering and sorting electrophysiology data recorded using Intan RHD2132 chips. Original…☆10Updated this week
- A multimodal live AI assistant designed to enhance the browsing experience using Gemini.☆11Feb 15, 2025Updated last year
- ☆10Jul 12, 2022Updated 3 years ago
- ESG Insights AI simplifies ESG data analysis with advanced AI models, ensuring compliance with GRI standards. It helps asset managers ass…☆13Oct 31, 2024Updated last year
- ☆13Mar 8, 2022Updated 4 years ago
- High-throughput PET image reconstruction with high quantitative accuracy and precision☆35Dec 18, 2025Updated 2 months ago
- ☆10Dec 21, 2022Updated 3 years ago
- ☆10Sep 25, 2018Updated 7 years ago
- Self hosted AI workflow for scraping Instagram Reels (audio and description). Extracting, summarising and categorising, then storing all …☆28Jan 10, 2026Updated last month
- Implementation of NAACL'19 Strong and Simple Baselines for Multimodal Utterance Embeddings☆10Jun 4, 2019Updated 6 years ago
- Pytorch implementation of Generative Adversarial Networks (GAN) for ULTRASOUND image.☆13Sep 12, 2018Updated 7 years ago
- A rules induction system for data mining and exploratory data analysis☆11Jul 17, 2024Updated last year
- 1st place solution to the DCASE 2020 - Task 5 - Urban Sound Tagging with Spatiotemporal Context☆16Dec 8, 2022Updated 3 years ago
- Simple model for sentence compression (a.k.a Baseline in Klerke et al., NAACL 2016)☆10Dec 16, 2018Updated 7 years ago
- ☆10Feb 12, 2020Updated 6 years ago
- Full stack data-science project☆12Jan 13, 2022Updated 4 years ago
- An AI-powered tool that translates plain English commands into multi-step API workflows, automating the entire testing process.☆17Jul 27, 2025Updated 7 months ago
- 2D Fused LASSO using Gradient Descent for grayscale image restoration 🎈☆10Jan 24, 2019Updated 7 years ago
- Whole Heart MRI Segmenter based on data from HVSMR MICCAI 2016 Challenge☆11Apr 25, 2020Updated 5 years ago