linhndt / spoken_language_classification
Predicting the labels (spoken languages) of audio files with audio features (MFCC, RASTA, PLP) using ML-based and statistical approaches (Random Forest, SVM, GMM)
☆10Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for spoken_language_classification
- Implementation of the paper "Improved End-to-End Speech Emotion Recognition Using Self Attention Mechanism and Multitask Learning" From I…☆58Updated 3 years ago
- Audio data augmentation examples☆35Updated 6 years ago
- Feature extraction of speech signal is the initial stage of any speech recognition system.☆91Updated 4 years ago
- Companion repository for the paper "A Comparison of Metric Learning Loss Functions for End-to-End Speaker Verification" published at SLSP…☆59Updated 4 years ago
- Human emotions are one of the strongest ways of communication. Even if a person doesn’t understand a language, he or she can very well u…☆24Updated 3 years ago
- ☆21Updated 4 years ago
- Baseline of DCASE 2020 task 4☆42Updated 2 years ago
- fast SpecAugmentation code with numpy and scipy☆30Updated 5 years ago
- Keras (tensorflow) implementation of SincNet (Mirco Ravanelli, Yoshua Bengio - https://github.com/mravanelli/SincNet)☆72Updated 3 years ago
- 1st Place Public Leaderboard Solution for ERC2019☆69Updated 4 years ago
- Author's repository for reproducing DcaseNet, an integrated pre-trained DNN that performs acoustic scene classification, audio tagging, a…☆40Updated 3 years ago
- TensorFlow implementation of "Attentive Modality Hopping for Speech Emotion Recognition," ICASSP-20☆32Updated 4 years ago
- The Additive Margin SincNet (AM-SincNet) is a new approach for speaker recognition problems which is based in the neural network architec…☆43Updated last year
- ☆53Updated 4 years ago
- ☆15Updated 4 years ago
- Audio classification via transfer learning☆32Updated 5 years ago
- Light-SERNet: A lightweight fully convolutional neural network for speech emotion recognition☆66Updated 2 years ago
- [ICASSP19] An Interaction-aware Attention Network for Speech Emotion Recognition in Spoken Dialogs☆35Updated 4 years ago
- Automatic speech emotion recognition based on transfer learning from spectrograms using ResNET☆21Updated 2 years ago
- A new comprehensive and diverse few-shot acoustic classification benchmark.☆60Updated last month
- Time series course Fall 2019 project☆53Updated 4 years ago
- This is the official code for paper "Speech Emotion Recognition with Global-Aware Fusion on Multi-scale Feature Representation" published…☆43Updated 2 years ago
- ☆17Updated 2 years ago
- an Audio-Visual Voice Activity Detection using Deep Learning☆48Updated 5 years ago
- Augmentation adversarial training for self-supervised speaker recognition☆77Updated 3 years ago
- 3-D Convolutional Recurrent Neural Networks With Attention Model for Speech Emotion Recognition.☆35Updated 4 years ago
- Code for the paper: "Leveraging speaker attribute information using multi task learning for speaker verification and diarization" present…☆24Updated 2 years ago
- Implementation of the multi-time-scale convolution layer used in the paper Multi-Time-Scale Convolution for Emotion Recognition from Spee…☆11Updated 5 years ago
- Implementation of IEEE Access paper - Lung Sound Recognition Algorithm Based on VGGish-BiGRU☆26Updated 4 years ago
- ☆53Updated 6 years ago