bioidiap / bob
Bob is a free signal-processing and machine learning toolbox originally developed by the Biometrics group at Idiap Research Institute, in Switzerland. - Mirrored from https://gitlab.idiap.ch/bob/bob
☆48Updated last year
Alternatives and similar repositories for bob:
Users that are interested in bob are comparing it to the libraries listed below
- ☆64Updated 6 years ago
- This code implements a basic MLP for speech recognition. The MLP is trained with pytorch, while feature extraction, alignments, and dec…☆38Updated 7 years ago
- Python library for audio augmentation☆84Updated last year
- Convolutional neural networks for sound classification☆20Updated 7 years ago
- ☆15Updated 7 years ago
- Emotion recognition of Speaker's Speech Data. Employ speaker detection classifiers for emotion recognition, a multiclass classification p…☆16Updated 9 years ago
- Sound event detection in real life audio with CNN submitted to DCASE16☆22Updated 2 years ago
- Mapping features using Deep Neural Networks (DNNs) with application to Voice Conversion (VC). The implementations are on top of Theano Py…☆33Updated 6 years ago
- Utils and data sets for audio and PyTorch☆85Updated 3 years ago
- 24-hour Automatic Speech Recognition☆27Updated 3 years ago
- Weakly Supervised CRNN System for Sound Event Detection With Large-scale Unlabeled In-domain Data☆10Updated 6 years ago
- Pytorch based phoneme recognition (TIMIT phoneme classification)☆34Updated 7 years ago
- An Attention Based Open-Source End to End Speech Synthesis Framework, No CNN, No RNN, No MFCC!!!☆85Updated 4 years ago
- mirror of VoxCeleb dataset - a large-scale speaker identification dataset☆71Updated 5 years ago
- Repository for Weak Label Learning for Audio Events - A closer look. Uses Audioset subset data provided for reproducibility.☆32Updated last year
- VoxCeleb plugin for pyannote.database☆29Updated 3 years ago
- Python implementation of pre-processing for End-to-End speech recognition☆69Updated 7 years ago
- Supplementary information and code for INTERSPEECH 2018 paper: Singing voice phoneme segmentation by hierarchically inferring syllable an…☆46Updated 6 years ago
- Speaker Diarization is the problem of separating speakers in an audio. There could be any number of speakers and final result should stat…☆65Updated 4 years ago
- Speaker recognition ,Voiceprint recognition☆53Updated 5 years ago
- End to End Multiview Lip Reading☆10Updated 7 years ago
- Inspired work by the project of SER using ELM at Microsoft Research☆19Updated 6 years ago
- Implementation and reviews of Audio & Computer vision related papers in python using keras and tensorflow.☆40Updated 6 years ago
- SiSEC MUS 2018 Submission System☆43Updated 5 years ago
- Supporting code for "Emotion Recognition in Speech using Cross-Modal Transfer in the Wild"☆101Updated 5 years ago
- SoundNet, built in Keras with pre-trained 8-layer model.☆29Updated 5 years ago
- ☆27Updated 6 years ago
- Tensor2tensor experiment with SpecAugment☆46Updated 5 years ago
- Adaptive and Focusing Neural Layers for Multi-Speaker Separation Problem☆51Updated 6 years ago
- Code for the paper "Investigating the effect of residual and highway connections in speech enhancement models"☆11Updated 6 years ago