oscarknagg / voicemapLinks
Identifying people from small audio fragments
β170Updated 5 years ago
Alternatives and similar repositories for voicemap
Users that are interested in voicemap are comparing it to the libraries listed below
Sorting:
- Speaker Identification System (upto 100% accuracy); built using Python 2.7 and python_speech_features libraryβ212Updated 5 years ago
- π§ Automatic Speech Recognition: DeepSpeech & Seq2Seq (TensorFlow)β223Updated 5 years ago
- A neural attention model for speech command recognitionβ186Updated 5 months ago
- π£οΈ A book and repo to get you started programming voice computing applications in Python (10 chapters and 200+ scripts).β385Updated 3 years ago
- Speech noise reduction which was generated using existing post-production techniques implemented in Pythonβ181Updated 4 years ago
- Speaker diarization scripts, based on AaltoASRβ191Updated 6 years ago
- The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech procβ¦β373Updated 6 months ago
- β90Updated 3 years ago
- A fully convolution-network for speech-to-text, built on pytorch.β126Updated 5 years ago
- A list of publically available audio data that anyone can download for ASR or other speech activitiesβ232Updated 4 years ago
- Simple d-vector based Speaker Recognition (verification and identification) using Pytorchβ212Updated 5 years ago
- β84Updated 5 years ago
- [deprecated] Pretrained models for pyannote-audio 1.xβ71Updated 3 years ago
- Deep Learning - one shot learning for speaker recognition using Filter Banksβ170Updated last year
- Tensorflow 2.0 implementation of the paper: A Fully Convolutional Neural Network for Speech Enhancementβ256Updated 5 years ago
- β© Generating speech in a single forward pass without any attention!β580Updated 2 weeks ago
- Real-time Voice Activity Detection in Noisy Eniviroments using Deep Neural Networksβ452Updated 5 years ago
- Text to Speech with PyTorch (English and Mongolian)β186Updated last year
- Detecting emotions using MFCC features of human speech using Deep Learningβ131Updated 5 years ago
- Speaker diarization python system based on binary key speaker modellingβ60Updated 3 years ago
- A simple audio feature extraction libraryβ81Updated 6 years ago
- This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )β539Updated 3 years ago
- Voice based gender recognition using Mel-frequency cepstrum coefficients (MFCC) and Gaussian mixture models (GMM)β218Updated 2 years ago
- Problem Agnostic Speech Encoderβ444Updated 2 years ago
- It uses GMM to train a speaker identification model. The training and testing has been done on subset (34 speakers) from VoxForge data coβ¦β58Updated 6 years ago
- This is the end-to-end Speech Recognition neural network, deployed in Keras. This was my final project for Artificial Intelligence Nanodeβ¦β190Updated 8 years ago
- Tensorflow implementation of "Generalized End-to-End Loss for Speaker Verification"β370Updated 4 years ago
- Keras (tensorflow) implementation of SincNet (Mirco Ravanelli, Yoshua Bengio - https://github.com/mravanelli/SincNet)β74Updated 4 years ago
- βοΈβοΈ Detect a person's gender from a voice file (90.7% +/- 1.3% accuracy).β90Updated last year
- Understanding emotions from audio files using neural networks and multiple datasets.β425Updated 2 years ago