Implementation of the BReG-NeXt architecture
☆22Mar 24, 2023Updated 2 years ago
Alternatives and similar repositories for BReG-NeXt
Users that are interested in BReG-NeXt are comparing it to the libraries listed below
Sorting:
- This is the code repository of model TDGAN. Paper: Facial Expression Recognition with Two-branch Disentangled Generative Adversarial Netw…☆24Dec 23, 2021Updated 4 years ago
- cross-modal model between audio(MFCC) and text(KoBERT)☆12Jan 14, 2021Updated 5 years ago
- DeepCU: Integrating Both Common and Unique Latent Information for Multimodal Sentiment Analysis, IJCAI-19☆19Nov 21, 2019Updated 6 years ago
- A PyTorch implementation of the paper Multimodal Transformer with Multiview Visual Representation for Image Captioning☆25Sep 4, 2020Updated 5 years ago
- A Unified Evaluation Benchmark for Cross-Domain Facial Expression Recognition (TPAMI'22, ACM MM'20)☆109Apr 9, 2024Updated last year
- ☆28Dec 22, 2021Updated 4 years ago
- MIMAMO Net: Integrating Micro- and Macro-motion for Video Emotion Recognition☆62Dec 19, 2020Updated 5 years ago
- ☆28Jul 9, 2021Updated 4 years ago
- Implementation of the paper "Real-Time Emotion Recognition via Attention Gated Hierarchical Memory Network" in AAAI-2020.☆31Sep 2, 2022Updated 3 years ago
- Efficient Facial Feature Learning with Wide Ensemble-based Convolutional Neural Networks☆195Nov 22, 2022Updated 3 years ago
- Multimodal Transformer for Korean Sentiment Analysis with Audio and Text Features☆28Sep 7, 2021Updated 4 years ago
- TensorFlow implementation of "Attentive Modality Hopping for Speech Emotion Recognition," ICASSP-20☆33Aug 10, 2020Updated 5 years ago
- Pytorch code for our TOMM2022 paper "A Multimodal framework for large scale Emotion Recognition by Fusing Music and Electrodermal Activit…☆36Mar 15, 2022Updated 3 years ago
- ☆30Sep 19, 2021Updated 4 years ago
- Pre-training Cross-modal Transformer for Audio-and-Language Representations☆38Apr 20, 2021Updated 4 years ago
- This branch of Asteroid contains code for the vocal harmony and chamber ensemble separation related papers.☆12Nov 7, 2024Updated last year
- This repository shows how to implement a basic model for multimodal entailment.☆10Aug 17, 2021Updated 4 years ago
- Attempt to enable wireless CarPlay on Proton X50 via qdlink.☆10Apr 14, 2021Updated 4 years ago
- Frequency tracking in time-frequency representations☆13Jan 19, 2021Updated 5 years ago
- 1st place solution to the DCASE 2020 - Task 5 - Urban Sound Tagging with Spatiotemporal Context☆16Dec 8, 2022Updated 3 years ago
- Deployed a facial emotion recognition using neural network model which predicts the emotion from faces in images, videos and live feed fr…☆11May 2, 2021Updated 4 years ago
- This is a short tutorial for using the CMU-MultimodalSDK.☆87Mar 20, 2019Updated 6 years ago
- Patch Attentive Deep Network for Action Unit Detection☆38Oct 9, 2021Updated 4 years ago
- Facial Expression Recognition using Inception V3 Model in keras☆11Nov 27, 2017Updated 8 years ago
- Time frequency ridge detection based on relevant ridge portions☆11Aug 17, 2023Updated 2 years ago
- EmoCapCLIP: Learning Transferable Facial Emotion Representations from Large-Scale Semantically Rich Captions☆20Jul 29, 2025Updated 7 months ago
- A fine multimodality fusion network :)☆11Aug 9, 2021Updated 4 years ago
- Reference implementation and test synthetic data for Sorted Center Time echo density measure for acoustic impulse responses☆15Mar 18, 2020Updated 5 years ago
- Code and data recipes for the paper: Optimal Condition Training for Target Source Separation by Efthymios Tzinis, Gordon Wichern, Paris S…☆14Feb 15, 2023Updated 3 years ago
- This repository contains the official code for "Flexible Biometrics Recognition: Bridging the Multimodality Gap through Attention, Alignm…☆11Oct 9, 2024Updated last year
- Use CenterLoss , IslandLoss at solve the Facial Expression Recognition task. (Use FER2013 Dataset)☆36Mar 9, 2019Updated 6 years ago
- ☆10Nov 10, 2021Updated 4 years ago
- A CNN audio classifier via spectrogram images.☆10Jul 21, 2017Updated 8 years ago
- Adaptive Multimodal Reasoning via Reinforcement Learning☆23Jan 11, 2026Updated last month
- This repository contains the speaker labeled information of VoxCeleb2 and LRS3 audio-visual datasets. (AAAI 2025)☆13Sep 6, 2024Updated last year
- Implementation of STN (Spatial Transformer Network) and ICSTN (Inverse Compositional Spatial Transformer Networks) in Tensorlayer to pred…☆16Dec 6, 2021Updated 4 years ago
- This is our collected datasets for challenge condition facial expression recognition☆250May 25, 2020Updated 5 years ago
- ☆11Nov 5, 2025Updated 4 months ago
- 2023 Spring SNU Computer Vision Project☆14Jun 13, 2023Updated 2 years ago