Code on selecting an action based on multimodal inputs. Here in this case inputs are voice and text.
☆73Jun 7, 2021Updated 5 years ago
Alternatives and similar repositories for Multimodal-action-recognition
Users that are interested in Multimodal-action-recognition are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Central repository for all public AIDA resources☆13Mar 1, 2021Updated 5 years ago
- Generalized cross-modal NNs; new audiovisual benchmark (IEEE TNNLS 2019)☆31Apr 13, 2020Updated 6 years ago
- Pytorch implementation of DSR-RL for Video Summarization Task☆12Aug 30, 2021Updated 4 years ago
- MMAct Challenge☆13Jun 20, 2021Updated 5 years ago
- collection of skeleton-based human action recognition☆10Jun 28, 2020Updated 6 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- (2020) Video Classification Neural Network☆30Feb 18, 2020Updated 6 years ago
- Multimodal datasets.☆34Jan 26, 2024Updated 2 years ago
- A collection of multimodal datasets, and visual features for VQA and captionning in pytorch. Just run "pip install multimodal"☆83Feb 25, 2022Updated 4 years ago
- Implementation of "Mutimodal Convolution Neural Networks for Matching Image and Sentence"☆12Oct 25, 2015Updated 10 years ago
- A search engine implementation using OpenAI's clip model☆10Jun 20, 2021Updated 5 years ago
- Analyze the open source human action regonition data from UT Dallas using Python☆11Nov 21, 2018Updated 7 years ago
- Pretraining summarization models using a corpus of nonsense☆13Sep 28, 2021Updated 4 years ago
- Tensorflow implementation of "Hide-and-Seek: Forcing a Network to be Meticulous for Weakly-supervised Object and Action Localization"[ICC…☆13Mar 29, 2019Updated 7 years ago
- Annotated dataset of quadrotor Eagle for object detection of UAVs☆15Apr 4, 2022Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- This repository contains various models targetting multimodal representation learning, multimodal fusion for downstream tasks such as mul…☆918Mar 15, 2023Updated 3 years ago
- Code for paper "Cross-Modal Global Interaction and Local Alignment for Audio-Visual Speech Recognition"☆18Jun 21, 2023Updated 3 years ago
- Transformer for Action Recognition in PyTorch☆39Mar 14, 2020Updated 6 years ago
- k-t CLAIR: Self-Consistency Guided Multi-Prior Learning for Dynamic Parallel MR Image Reconstruction☆11Jan 30, 2025Updated last year
- video summarization lstm-gan pytorch implementation☆27Dec 6, 2019Updated 6 years ago
- Zicx's Notebook.☆11Nov 7, 2025Updated 7 months ago
- The official code of paper "Multi-to-Single: Reducing Multimodal Dependency in Emotion Recognition through Contrastive Learning" (AAAI 20…☆33Sep 30, 2025Updated 9 months ago
- acnn for text-independent speaker recognition☆10Feb 8, 2022Updated 4 years ago
- Awesome GNN Learning For beginners☆16Oct 18, 2021Updated 4 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- (Competition) 6th -- Scene-Text-Detection-and-Recognition.☆11Jun 14, 2022Updated 4 years ago
- Official pytorch implementation of I2I translation with low resolution conditioning☆23Sep 2, 2021Updated 4 years ago
- Official repository of "TDSD: Text-Driven Scene-Decoupled Weakly Supervised Video Anomaly Detection"☆11May 25, 2025Updated last year
- FG2021: Cross Attentional AV Fusion for Dimensional Emotion Recognition☆34Nov 29, 2024Updated last year
- Code for the paper: Audio-Visual Model Distillation Using Acoustic Images☆21Mar 24, 2023Updated 3 years ago
- ☆19Oct 30, 2023Updated 2 years ago
- A multimodal UAV assistant dataset.☆11Jun 14, 2021Updated 5 years ago
- Codebase for the paper: "TIM: A Time Interval Machine for Audio-Visual Action Recognition"☆54Nov 7, 2024Updated last year
- Autonomous Exploration of mobile robots in unknown environments using Deep Reinforcement learning☆14Oct 28, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- An official implementation for " UniVL: A Unified Video and Language Pre-Training Model for Multimodal Understanding and Generation"☆365Jul 25, 2024Updated last year
- Code associated with our paper "Learning Group Structure and Disentangled Representations of Dynamical Environments"☆15Dec 8, 2022Updated 3 years ago
- Attacks against proposed image encryption schemes☆10Apr 27, 2020Updated 6 years ago
- Mobile exploration robot with the ability to release an auxiliary drone to increase its sensing and operational capabilities.☆11Mar 16, 2024Updated 2 years ago
- Study into Pointnet and Pointnet++ for possible enhancement☆12Nov 13, 2018Updated 7 years ago
- ☆19Jul 27, 2021Updated 4 years ago
- GPU Accelerated Euclidean Distance Transform☆18May 29, 2024Updated 2 years ago