alantanlc / torchemotion
Emotion recognition library for PyTorch
☆22Updated 4 years ago
Alternatives and similar repositories for torchemotion
Users that are interested in torchemotion are comparing it to the libraries listed below
Sorting:
- Official implementation of FOP method as described in "Fusion and Orthogonal Projection for Improved Face-Voice Association"☆19Updated last year
- ☆21Updated 4 years ago
- Code for the paper "Unsupervised Contrastive Learning of Sound Event Representations", ICASSP 2021.☆93Updated 2 years ago
- Pytorch implementation of INTEGRATED PARAMETER-EFFICIENT TUNING FOR GENERAL-PURPOSE AUDIO MODELS☆10Updated last year
- The Pytorch implementation of paper: Masked Spectrogram Prediction For Self-Supervised Audio Pre-Training☆42Updated 5 months ago
- Transformer-based online speech recognition system with TensorFlow 2☆26Updated 4 years ago
- COLA contrastive pre-training method implemented in PyTorch☆43Updated 4 years ago
- Pytorch implementation of Meta-Learning for Short Utterance Speaker Recognition with Imbalance Length Pairs (Interspeech, 2020)☆74Updated 4 years ago
- A Pytorch implementation of 'AUTOMATIC SPEECH EMOTION RECOGNITION USING RECURRENT NEURAL NETWORKS WITH LOCAL ATTENTION'☆41Updated 6 years ago
- WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation, and training audio classification models wi…☆90Updated 3 years ago
- Companion repository for the paper "A Comparison of Metric Learning Loss Functions for End-to-End Speaker Verification" published at SLSP…☆59Updated 4 years ago
- This is the code for controllable EVC framework for seen and unseen emotion generation.☆44Updated 3 years ago
- A new comprehensive and diverse few-shot acoustic classification benchmark.☆63Updated 7 months ago
- an Audio-Visual Voice Activity Detection using Deep Learning☆48Updated 6 years ago
- Implementation of the paper "Attentive Statistics Pooling for Deep Speaker Embedding" in Pytorch☆43Updated 4 years ago
- Author's repository for reproducing DcaseNet, an integrated pre-trained DNN that performs acoustic scene classification, audio tagging, a…☆41Updated 3 years ago
- the implementation of chunk-level attention-based temporal aggregation framework for sequence-to-one recognition tasks☆9Updated last year
- Neural network based similarity scoring for diarization (pytorch implementation of "LSTM based Similarity Measurement with Spectral Clust…☆45Updated 4 years ago
- Implementation of the paper "Improved End-to-End Speech Emotion Recognition Using Self Attention Mechanism and Multitask Learning" From I…☆57Updated 4 years ago
- Unofficial PyTorch implementation of Masked Autoencoders that Listen☆66Updated 2 years ago
- Code repo for "Multi-Task Learning for Interpretable Weakly Labelled Sound Event Detection"☆16Updated 2 years ago
- ☆131Updated 8 months ago
- The official repository for Audio ALBERT☆65Updated 3 years ago
- Self-Supervised Contrastive Learning for Unsupervised Phoneme Segmentation (INTERSPEECH 2020)☆141Updated 2 years ago
- Attention Backend for Aotumatic Speaker Verification with Multiple Enrollment Utterances☆49Updated 2 years ago
- Contains code for Deep Self Supervised Heirarchical Clustering for Speaker Diarization☆17Updated 3 years ago
- ☆53Updated 4 years ago
- Face Landmark-based Speaker-Independent Audio-Visual Speech Enhancement in Multi-Talker Environments☆107Updated last year
- VoViT: Low Latency Graph-based Audio-Visual VoiceSeparation Transformer☆34Updated 2 years ago
- The Additive Margin SincNet (AM-SincNet) is a new approach for speaker recognition problems which is based in the neural network architec…☆44Updated last year