An end-to-end system that performs temporal recognition of gesture sequences using speech and skeletal input. The model combines three networks with a CTC output layer that recognises gestures from continuous stream.
☆29May 1, 2019Updated 6 years ago
Alternatives and similar repositories for Multimodal-Gesture-Recognition-with-LSTMs-and-CTC
Users that are interested in Multimodal-Gesture-Recognition-with-LSTMs-and-CTC are comparing it to the libraries listed below
Sorting:
- Implement Real-Time Gesture Recognition and Segmentation with Mask RCNN☆11Jan 16, 2019Updated 7 years ago
- Learning Spatiotemporal Features using 3DCNN and Convolutional LSTM for Gesture Recognition☆62Dec 6, 2018Updated 7 years ago
- Starter project for the Kaggle State Farm Distracted Driver Detection Competition☆24Jul 7, 2017Updated 8 years ago
- PyTorch implementation of "Deep Speech 2: End-to-End Speech Recognition in English and Mandarin" (ICML, 2016)☆26Mar 5, 2021Updated 5 years ago
- Multimodal speech recognition using lipreading (with CNNs) and audio (using LSTMs). Sensor fusion is done with an attention network.☆69Nov 19, 2022Updated 3 years ago
- A2B Neural Rendering of Ambisonic Recordings to Binaural☆18Aug 5, 2025Updated 7 months ago
- 1st place solution to the DCASE 2020 - Task 5 - Urban Sound Tagging with Spatiotemporal Context☆16Dec 8, 2022Updated 3 years ago
- We Need No Pixels: Video Manipulation Detection Using Stream Descriptors☆10Oct 4, 2019Updated 6 years ago
- 基于Python的开源量化交易平台开发框架☆10Feb 5, 2020Updated 6 years ago
- This repository shows how to implement a basic model for multimodal entailment.☆10Aug 17, 2021Updated 4 years ago
- [JBHI 2024] HierAttn: Deeply Supervised Skin Lesions Diagnosis with Stage and Branch Attention☆11Nov 16, 2024Updated last year
- The repository is created to support a Capstone project on the topic of "Study and Implementation of Sound Source Localization Techniques…☆13Apr 27, 2021Updated 4 years ago
- Various ops for handling several entities in a document, perform anaphora resolution, clustering, etc.☆12Dec 8, 2022Updated 3 years ago
- Bag-of-features image classification using OpenCV☆12Sep 25, 2013Updated 12 years ago
- Uses Node.js and Leap Motion to control an AR Drone and stream video to the browser.☆62Nov 20, 2013Updated 12 years ago
- Rule-Based Thyroid Whole Slide Image Diagnosis☆10Aug 12, 2020Updated 5 years ago
- A collection of minimal examples for the sparta plug-ins.☆13Jul 12, 2025Updated 7 months ago
- Dummy repo for testing the doxygen - breathe - readthedocs build process.☆11Jun 17, 2022Updated 3 years ago
- Functions for creating speech features in MATLAB.☆14Jul 7, 2020Updated 5 years ago
- Utilities for negotiating between circles and polygons in SVG☆13Apr 22, 2017Updated 8 years ago
- An example of how to use parser combinators with Express for routing.☆11Nov 8, 2017Updated 8 years ago
- A simple example project in nodejs to demonstrate the compatibility of the AppRTCDemo Android App with the Kurento Media Server.☆10Mar 11, 2016Updated 9 years ago
- 🧲 Magnetism☆13May 7, 2025Updated 10 months ago
- ☆10Apr 7, 2022Updated 3 years ago
- ☆11Aug 19, 2016Updated 9 years ago
- A CNN audio classifier via spectrogram images.☆10Jul 21, 2017Updated 8 years ago
- The product being developed is a mobile application for android operating system. It is an emotion and pain assessment tool and can be in…☆10Oct 2, 2018Updated 7 years ago
- Official repository for "Survey on AI Memory: Theories, Taxonomies, Evaluations, and Emerging Trends".☆30Jan 22, 2026Updated last month
- Neural Haircut: Prior-Guided Strand-Based Hair Reconstruction. ICCV 2023☆14Mar 8, 2024Updated 2 years ago
- KERL: reinforcement learning algorithms and tools implemented using Keras☆11Aug 2, 2024Updated last year
- Source code of ICML'22 paper: FEDformer: Frequency Enhanced Decomposed Transformer for Long-term Series Forecasting☆10Jun 10, 2022Updated 3 years ago
- Experiments with generating GPT-2 fanfiction on specified topics.☆11Jun 2, 2019Updated 6 years ago
- Submission to MediaEval 2021 Emotions and Themes in Music challenge. Noisy-student training for music emotion tagging☆11Dec 2, 2021Updated 4 years ago
- MICCAI 2013 code - Segmenting Multiple Overlapping Cervical Cells by Joint Level Set☆12Jun 19, 2013Updated 12 years ago
- Probabilistic Entity Matching in Python☆13Apr 5, 2017Updated 8 years ago
- ☆13Aug 13, 2023Updated 2 years ago
- 爬去虫虫钢琴的曲谱☆11Jul 6, 2018Updated 7 years ago
- Software for Decoding of High Order Ambisonics to Irregular Layouts☆12Mar 20, 2014Updated 11 years ago
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io☆16Apr 18, 2024Updated last year