linhndt / spoken_language_classification
Predicting the labels (spoken languages) of audio files with audio features (MFCC, RASTA, PLP) using ML-based and statistical approaches (Random Forest, SVM, GMM)
☆10Updated 4 years ago
Related projects: ⓘ
- Implementation of the paper "Improved End-to-End Speech Emotion Recognition Using Self Attention Mechanism and Multitask Learning" From I…☆57Updated 3 years ago
- Repository of code for Speech emotion recognition using voiced speech and attention model, submitted to ICSigSys 2019☆13Updated 4 years ago
- Neural network based similarity scoring for diarization (pytorch implementation of "LSTM based Similarity Measurement with Spectral Clust…☆43Updated 3 years ago
- 1st Place Public Leaderboard Solution for ERC2019☆69Updated 4 years ago
- Human emotions are one of the strongest ways of communication. Even if a person doesn’t understand a language, he or she can very well u…☆22Updated 3 years ago
- Classify the emotions from variable-length speech segments☆11Updated 6 years ago
- Keras (tensorflow) implementation of SincNet (Mirco Ravanelli, Yoshua Bengio - https://github.com/mravanelli/SincNet)☆72Updated 3 years ago
- ☆52Updated 4 years ago
- ☆51Updated 6 years ago
- Audio classification via transfer learning☆32Updated 4 years ago
- Repository for my paper: Dimensional Speech Emotion Recognition Using Acoustic Features and Word Embeddings using Multitask Learning☆16Updated last month
- ☆21Updated 4 years ago
- Audio data augmentation examples☆35Updated 6 years ago
- Implementation of the multi-time-scale convolution layer used in the paper Multi-Time-Scale Convolution for Emotion Recognition from Spee…☆11Updated 4 years ago
- ☆26Updated 2 years ago
- Author's repository for reproducing DcaseNet, an integrated pre-trained DNN that performs acoustic scene classification, audio tagging, a…☆40Updated 2 years ago
- Companion repository for the paper "A Comparison of Metric Learning Loss Functions for End-to-End Speaker Verification" published at SLSP…☆59Updated 3 years ago
- implementation of "EFFICIENT KEYWORD SPOTTING USING DILATED CONVOLUTIONS AND GATING"☆35Updated 4 years ago
- WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation, and training audio classification models wi…☆88Updated 3 years ago
- This is the code for controllable EVC framework for seen and unseen emotion generation.☆41Updated 2 years ago
- Simple, straight-forward extraction of acoustic and prosodic features from sound waves based on Praat and Parselmouth.☆20Updated 4 years ago
- Implementation of IEEE Access paper - Lung Sound Recognition Algorithm Based on VGGish-BiGRU☆26Updated 4 years ago
- Feature extraction of speech signal is the initial stage of any speech recognition system.☆91Updated 4 years ago
- Estimating the Age, Height, and Gender of a speaker with their speech signal. https://arxiv.org/pdf/2110.13653.pdf☆57Updated 3 years ago
- TensorFlow implementation of "Attentive Modality Hopping for Speech Emotion Recognition," ICASSP-20☆32Updated 4 years ago
- Code for the paper: "Leveraging speaker attribute information using multi task learning for speaker verification and diarization" present…☆24Updated last year
- Audio classification is a popular topic, here I implement several models using TenserFlow and Keras.☆22Updated 3 years ago
- [ICASSP19] An Interaction-aware Attention Network for Speech Emotion Recognition in Spoken Dialogs☆35Updated 4 years ago
- 📁 This repo makes it easy to download the raw audio files from AudioSet (32.45 GB, 632 classes).☆97Updated last year
- ☆26Updated 2 years ago