acen20 / cnn-tf-keras-audio-classificationLinks
Feature extraction from sound signals along with complete CNN model and evaluations using tensorflow, keras and, librosa for MFCC generation
☆10Updated 3 years ago
Alternatives and similar repositories for cnn-tf-keras-audio-classification
Users that are interested in cnn-tf-keras-audio-classification are comparing it to the libraries listed below
Sorting:
- Estimating the Age, Height, and Gender of a speaker with their speech signal. https://arxiv.org/pdf/2110.13653.pdf☆65Updated 4 years ago
- ☆84Updated 2 years ago
- Repository hosting code and slides of the Audio Data Augmentation series on The Sound of AI YT channel.☆37Updated 3 years ago
- ☆21Updated 5 years ago
- This repository contains the code related to the paper 'DENet: a deep architecture for audio surveillance applications'.☆42Updated last year
- A speaker gender classifier. MFC feature engineering and a pre-trained ResNet-50. GradCAM interpretation.☆27Updated 3 years ago
- Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.☆84Updated last year
- General purpose sound recognition demo☆157Updated last year
- ☆93Updated 2 years ago
- This repository contains the Code for SOTA model on Google Speech Command V2 dataset.☆15Updated last year
- Audio classification is a popular topic, here I implement several models using TenserFlow and Keras.☆24Updated 4 years ago
- Multi class audio classification using Deep Learning (MLP, CNN): The objective of this project is to build a multi class classifier to id…☆67Updated 4 years ago
- Repo associated to the DESED dataset, download and creation of data☆138Updated 11 months ago
- Learning discriminative and robust time-frequency representations for environmental sound classification: Convolutional neural networks (…☆30Updated 5 years ago
- PyTorch transcribed audioset classifier, including VGGish and YAMNet, along with utils to manipulate autioset category ontology.☆79Updated 2 months ago
- Simple real-time Sound Event Detector based on YAMNet and pyaudio.☆23Updated 5 years ago
- Final project for the Speaker Recognition course on Udemy, 机器之心, 深蓝学院 and 语音之家☆44Updated last year
- Sound Classification using Librosa, ffmpeg, CNN, Keras, XGBOOST, Random Forest.☆69Updated last year
- A two-stage polyphonic sound event detection and localization method for both SED and DOA.☆114Updated 2 years ago
- ☆65Updated 9 months ago
- A new comprehensive and diverse few-shot acoustic classification benchmark.☆64Updated 9 months ago
- Speaker identification/verification models for Machine Learning for Computer Vision class at UNIBO☆64Updated 2 years ago
- Baseline of DCASE 2020 task 4☆43Updated 2 years ago
- ☆107Updated 4 years ago
- A curated list of awesome voice activity detection☆57Updated 7 months ago
- Attention Backend for Aotumatic Speaker Verification with Multiple Enrollment Utterances☆50Updated 2 years ago
- Speaker verification using ResnetSE (EER=0.0093) and ECAPA-TDNN☆91Updated 3 years ago
- ☆54Updated 5 years ago
- This project is about performing Speaker diarization for Hindi Language.☆50Updated 4 years ago
- Source code for the paper titled "Speech Denoising without Clean Training Data: a Noise2Noise Approach". Paper accepted at the INTERSPEE…☆190Updated last year