bbjornstad / audio-feature-extraction
A repository holding my personal implementations of audio feature extraction for environmental and musical auditory analysis and classification.
☆10Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for audio-feature-extraction
- ☆23Updated 5 years ago
- https://dodiku.github.io/audio_noise_clustering/results/ ==> An experiment with a variety of clustering (and clustering-like) techniques …☆26Updated 7 years ago
- Audio classification via transfer learning☆32Updated 5 years ago
- An in-depth analysis of audio classification on the RAVDESS dataset. Feature engineering, hyperparameter optimization, model evaluation, …☆74Updated 4 years ago
- Library of TensorFlow layers for audio data processing and data augmentation☆20Updated 2 years ago
- Augmented Audio Data Generator for 1D-Convolutional Neural Networks☆49Updated 3 years ago
- Repository hosting code and slides of the Audio Data Augmentation series on The Sound of AI YT channel.☆38Updated 2 years ago
- Urban Sound Classification : striving towards a fair comparison☆17Updated 3 years ago
- A real-time analyzer to detect normal speech/abusive speech/noise☆8Updated 6 years ago
- Classification of WAV files from cats and dogs☆22Updated 6 years ago
- Audio classification is a popular topic, here I implement several models using TenserFlow and Keras.☆23Updated 4 years ago
- A blind source separation package using non-negative matrix factorization and non-negative ICA☆14Updated 3 years ago
- music genre classification : LSTM vs Transformer☆60Updated last year
- End-to-End Speech Recognition using Neural Networks.☆35Updated 2 months ago
- Estimating the Age, Height, and Gender of a speaker with their speech signal. https://arxiv.org/pdf/2110.13653.pdf☆64Updated 3 years ago
- Toolkit to asses speech impairments in patients with neurological disorders☆51Updated 6 years ago
- speech recognition using Kaldi framework☆12Updated 4 years ago
- 1st place solution to the DCASE 2019 - Task 5 - Urban Sound Tagging☆30Updated 3 years ago
- Urban sound source tagging from an aggregation of four second noisy audio clips via 1D and 2D CNN (Xception)☆58Updated last year
- Bird's tweet classification with Deep Learning☆9Updated 6 years ago
- Features from audio: Spectrogram, (Wavelet Transform) Scalogram, (Q Transform) Spectrogram☆17Updated 6 years ago
- CNN 1D vs 2D audio classification☆104Updated 5 years ago
- ☆21Updated 5 years ago
- Simple, straight-forward extraction of acoustic and prosodic features from sound waves based on Praat and Parselmouth.☆20Updated 5 years ago
- Official implementation of EfficientLEAF, a learnable audio frontend.☆39Updated last year
- An implementation of Conformer: Convolution-augmented Transformer for Speech Recognition, a Transformer Variant in TensorFlow/Keras☆42Updated 2 years ago
- Audio data augmentation examples☆35Updated 6 years ago
- Classifying 10 different categories of Sound using Deep Learning.☆25Updated 6 years ago
- speaker_diarization done on toy dataset and tested on timit dataset☆8Updated 2 years ago