acen20 / cnn-tf-keras-audio-classification
Feature extraction from sound signals along with complete CNN model and evaluations using tensorflow, keras and, librosa for MFCC generation
☆10Updated 3 years ago
Alternatives and similar repositories for cnn-tf-keras-audio-classification
Users that are interested in cnn-tf-keras-audio-classification are comparing it to the libraries listed below
Sorting:
- Estimating the Age, Height, and Gender of a speaker with their speech signal. https://arxiv.org/pdf/2110.13653.pdf☆65Updated 4 years ago
- Speaker verification using ResnetSE (EER=0.0093) and ECAPA-TDNN☆91Updated 3 years ago
- Multi-class audio classification with MFCC features using CNN☆30Updated 5 years ago
- This project is about performing Speaker diarization for Hindi Language.☆49Updated 4 years ago
- ☆21Updated 5 years ago
- Multi class audio classification using Deep Learning (MLP, CNN): The objective of this project is to build a multi class classifier to id…☆67Updated 4 years ago
- Simple real-time Sound Event Detector based on YAMNet and pyaudio.☆23Updated 5 years ago
- This is the public repository for eigenvector-based SALSA features for polyphonic sound event localization and detection.☆99Updated 2 years ago
- ☆92Updated 2 years ago
- Neural network based similarity scoring for diarization (pytorch implementation of "LSTM based Similarity Measurement with Spectral Clust…☆45Updated 4 years ago
- Speaker identification/verification models for Machine Learning for Computer Vision class at UNIBO☆63Updated 2 years ago
- A speaker gender classifier. MFC feature engineering and a pre-trained ResNet-50. GradCAM interpretation.☆27Updated 3 years ago
- Simple d-vector based Speaker Recognition (verification and identification) using Pytorch☆211Updated 4 years ago
- ☆31Updated this week
- The official Pytorch implementation of "Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based …☆132Updated 2 months ago
- Learning discriminative and robust time-frequency representations for environmental sound classification: Convolutional neural networks (…☆29Updated 5 years ago
- Reproducible experimental protocols for multimedia (audio, video, text) database☆100Updated 3 months ago
- Keras (tensorflow) implementation of SincNet (Mirco Ravanelli, Yoshua Bengio - https://github.com/mravanelli/SincNet)☆73Updated 4 years ago
- Official PyTorch implementation of "Attention-Free Keyword Spotting", Mashrur. M. Morshed & Ahmad Omar Ahsan, PML4DC @ ICLR 2022.☆15Updated 2 years ago
- General purpose sound recognition demo☆156Updated last year
- Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.☆83Updated last year
- SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition☆78Updated 4 years ago
- Implement Wave-U-Net by PyTorch, and migrate it to the speech enhancement.☆330Updated 2 years ago
- A two-stage polyphonic sound event detection and localization method for both SED and DOA.☆114Updated 2 years ago
- a python library for speech enhancement☆79Updated 10 months ago
- ☆49Updated last year
- Speech Separation☆64Updated last year
- Analyzes signal, finds fundamental frequency, HNR etc☆15Updated 7 years ago
- ☆84Updated last year
- A self-supervised speech denoising strategy named Only-Noisy Training (ONT), which solves the speech denoising problem with only noisy au…☆68Updated 2 years ago