linhndt / spoken_language_classification
Predicting the labels (spoken languages) of audio files with audio features (MFCC, RASTA, PLP) using ML-based and statistical approaches (Random Forest, SVM, GMM)
☆10Updated 5 years ago
Alternatives and similar repositories for spoken_language_classification:
Users that are interested in spoken_language_classification are comparing it to the libraries listed below
- Audio data augmentation examples☆34Updated 6 years ago
- Implementation of the paper "Improved End-to-End Speech Emotion Recognition Using Self Attention Mechanism and Multitask Learning" From I…☆57Updated 4 years ago
- Companion repository for the paper "A Comparison of Metric Learning Loss Functions for End-to-End Speaker Verification" published at SLSP…☆59Updated 4 years ago
- Baseline of DCASE 2020 task 4☆43Updated 2 years ago
- Audio classification is a popular topic, here I implement several models using TenserFlow and Keras.☆24Updated 4 years ago
- ☆21Updated 4 years ago
- Keras (tensorflow) implementation of SincNet (Mirco Ravanelli, Yoshua Bengio - https://github.com/mravanelli/SincNet)☆72Updated 3 years ago
- ☆53Updated 4 years ago
- Multi class audio classification using Deep Learning (MLP, CNN): The objective of this project is to build a multi class classifier to id…☆66Updated 4 years ago
- fast SpecAugmentation code with numpy and scipy☆30Updated 5 years ago
- Author's repository for reproducing DcaseNet, an integrated pre-trained DNN that performs acoustic scene classification, audio tagging, a…☆39Updated 3 years ago
- Feature extraction of speech signal is the initial stage of any speech recognition system.☆92Updated 4 years ago
- 1st Place Public Leaderboard Solution for ERC2019☆70Updated 5 years ago
- 1st place solution to the DCASE 2019 - Task 5 - Urban Sound Tagging☆30Updated 3 years ago
- ☆15Updated 4 years ago
- Human emotions are one of the strongest ways of communication. Even if a person doesn’t understand a language, he or she can very well u…☆24Updated 3 years ago
- Neural network based similarity scoring for diarization (pytorch implementation of "LSTM based Similarity Measurement with Spectral Clust…☆44Updated 4 years ago
- A light weight neural speaker embeddings extraction based on Kaldi and PyTorch.☆136Updated 5 years ago
- Repository of code for Speech emotion recognition using voiced speech and attention model, submitted to ICSigSys 2019☆13Updated 5 years ago
- Sound event detection with depthwise separable and dilated convolutions.☆53Updated 4 years ago
- Implement a GRU/LSTM model using Keras, and train it to classify the languages using MFCC features☆25Updated 6 months ago
- an Audio-Visual Voice Activity Detection using Deep Learning☆48Updated 5 years ago
- The project is related to the development of labs for the ITMO Speaker Recognition Course.☆10Updated 2 years ago
- DropClass and DropAdapt - repository for the paper accepted to Speaker Odyssey 2020☆22Updated 4 years ago
- Audio classification via transfer learning☆33Updated 5 years ago
- ☆16Updated 5 years ago
- 📁 This repo makes it easy to download the raw audio files from AudioSet (32.45 GB, 632 classes).☆100Updated last year
- Multi-class audio classification with MFCC features using CNN☆28Updated 5 years ago
- Spectra extraction tutorials based on torch and torchaudio.☆41Updated last year
- DCASE 2020 Task 2 - Unsupervised Detection of Anomalous Sounds for Machine Condition Monitoring☆53Updated 4 years ago