linhndt / spoken_language_classification

Predicting the labels (spoken languages) of audio files with audio features (MFCC, RASTA, PLP) using ML-based and statistical approaches (Random Forest, SVM, GMM)

☆10

Alternatives and similar repositories for spoken_language_classification:

Users that are interested in spoken_language_classification are comparing it to the libraries listed below

alibugra / audio-data-augmentation
Audio data augmentation examples
☆34Updated 6 years ago
mortezaro / ad-recognition-from-speech
☆12Updated 3 years ago
KrishnaDN / speech-emotion-recognition-using-self-attention
Implementation of the paper "Improved End-to-End Speech Emotion Recognition Using Self Attention Mechanism and Multitask Learning" From I…
☆57Updated 4 years ago
WWH98932 / Audio-Classification-Models
Audio classification is a popular topic, here I implement several models using TenserFlow and Keras.
☆24Updated 4 years ago
vaibhavsundharam / Speech-Emotion-Analysis
Human emotions are one of the strongest ways of communication. Even if a person doesn’t understand a language, he or she can very well u…
☆24Updated 3 years ago
Jungjee / DcaseNet
Author's repository for reproducing DcaseNet, an integrated pre-trained DNN that performs acoustic scene classification, audio tagging, a…
☆40Updated 3 years ago
grausof / keras-sincnet
Keras (tensorflow) implementation of SincNet (Mirco Ravanelli, Yoshua Bengio - https://github.com/mravanelli/SincNet)
☆72Updated 3 years ago
suicao / Pytorch-Audio-Emotion-Recognition
1st Place Public Leaderboard Solution for ERC2019
☆70Updated 5 years ago
geekysethi / audio_classification
☆21Updated 5 years ago
dr-costas / dnd-sed
Sound event detection with depthwise separable and dilated convolutions.
☆53Updated 5 years ago
qiuqiangkong / sound_event_detection_dcase2017_task4
☆53Updated 4 years ago
KimJeongSun / SpecAugment_numpy_scipy
fast SpecAugmentation code with numpy and scipy
☆30Updated 5 years ago
bagustris / SER_ICSigSys2019
Repository of code for Speech emotion recognition using voiced speech and attention model, submitted to ICSigSys 2019
☆13Updated 5 years ago
sainathadapa / dcase2019-task5-urban-sound-tagging
1st place solution to the DCASE 2019 - Task 5 - Urban Sound Tagging
☆30Updated 4 years ago
flaviorainhoavila / IEMOCAPspeechEmotionRecognition
Automatic speech emotion recognition based on transfer learning from spectrograms using ResNET
☆21Updated 3 years ago
juanmc2005 / SpeakerEmbeddingLossComparison
Companion repository for the paper "A Comparison of Metric Learning Loss Functions for End-to-End Speaker Verification" published at SLSP…
☆59Updated 4 years ago
shangeth / SpeakerProfiling
Estimating the Age, Height, and Gender of a speaker with their speech signal. https://arxiv.org/pdf/2110.13653.pdf
☆65Updated 3 years ago
aishoot / Speech_Feature_Extraction
Feature extraction of speech signal is the initial stage of any speech recognition system.
☆92Updated 4 years ago
siddiquelatif / URDU-Dataset
Urdu Language Speech Emotional Corpus
☆45Updated 6 years ago
KunZhou9646 / controllable_evc_code
This is the code for controllable EVC framework for seen and unseen emotion generation.
☆42Updated 3 years ago
turpaultn / dcase20_task4
Baseline of DCASE 2020 task 4
☆43Updated 2 years ago
mystlee / rasta_py
RASTA-PLP and MFCC tool based rasta-mat
☆33Updated 2 years ago
AmirmohammadRostami / KeywordsSpotting-EfficientNet-A0
EfficientNet-Absolute Zero for Continuous Speech Keyword Spotting
☆23Updated 2 years ago
soham97 / MTL_Weakly_labelled_audio_data
Code repo for "Multi-Task Learning for Interpretable Weakly Labelled Sound Event Detection"
☆16Updated 2 years ago
uzaymacar / simple-speech-features
Simple, straight-forward extraction of acoustic and prosodic features from sound waves based on Praat and Parselmouth.
☆22Updated 5 years ago
musikalkemist / audioDataAugmentationTutorial
Repository hosting code and slides of the Audio Data Augmentation series on The Sound of AI YT channel.
☆37Updated 3 years ago
EIHW / EmoNet
☆27Updated 3 years ago
bepierre / SpeechVGG
Feature extractor for DL speech processing.
☆65Updated 2 years ago
iPRoBe-lab / 1D-Triplet-CNN
PyTorch implementation of the 1D-Triplet-CNN neural network model described in Fusing MFCC and LPC Features using 1D Triplet CNN for Spea…
☆27Updated 5 years ago
qiuqiangkong / dcase2018_task5
☆9Updated 6 years ago