bioidiap / bobLinks
Bob is a free signal-processing and machine learning toolbox originally developed by the Biometrics group at Idiap Research Institute, in Switzerland. - Mirrored from https://gitlab.idiap.ch/bob/bob
☆48Updated 2 years ago
Alternatives and similar repositories for bob
Users that are interested in bob are comparing it to the libraries listed below
Sorting:
- Project to learn about speech recognition - both Speaker Diarization and other Speech Recognition applications.☆50Updated 9 years ago
- Urban Sound Classification: With Random Forest, SVM, DNN, RNN, and CNN Classifiers☆55Updated 8 years ago
- A program for automatic speaker identification using deep learning techniques.☆84Updated 8 years ago
- Tensorflow - Very Deep Convolutional Neural Networks For Raw Waveforms - https://arxiv.org/pdf/1610.00087.pdf☆75Updated 4 years ago
- Audio Classification using Image Classification☆48Updated 6 years ago
- Train a Deep Learning model to classify audio embeddings on IBM's Deep Learning as a Service (DLaaS) platform - Watson Machine Learning☆102Updated 4 months ago
- The implementation of 'Watch, Listen, Attend and Spell’ (WLAS) network that learns to transcribe videos of mouth motion to character on p…☆11Updated 7 years ago
- This code implements a basic MLP for speech recognition. The MLP is trained with pytorch, while feature extraction, alignments, and dec…☆40Updated 7 years ago
- HTK features in Python☆73Updated 2 months ago
- How to run GPU accelerated Signal Processing in TensorFlow☆23Updated 7 years ago
- Implementation and reviews of Audio & Computer vision related papers in python using keras and tensorflow.☆40Updated 7 years ago
- Adaptive and Focusing Neural Layers for Multi-Speaker Separation Problem☆51Updated 7 years ago
- Python library for audio augmentation☆85Updated 2 years ago
- "Automated Speech Recognition System" in Machine Learning and Having it Deep and Structured, Spring 2015☆21Updated 9 years ago
- ☆27Updated 6 years ago
- Trained speaker embedding deep learning models and evaluation pipelines in pytorch and tesorflow for speaker recognition.☆36Updated 6 years ago
- End to End Dialect Identification using Convolutional Neural Network☆53Updated 6 years ago
- Speaker Diarization is the problem of separating speakers in an audio. There could be any number of speakers and final result should stat…☆64Updated 5 years ago
- Utils and data sets for audio and PyTorch☆86Updated 4 years ago
- ☆15Updated 8 years ago
- Convolutional neural networks for sound classification☆20Updated 8 years ago
- The Additive Margin MobileNet1D is a new light weight deep learning model for Speaker Recognition which is based on the MobileNetV2 archi…☆30Updated 2 years ago
- An Attention Based Open-Source End to End Speech Synthesis Framework, No CNN, No RNN, No MFCC!!!☆85Updated 5 years ago
- Classification of environmental sounds using first order statistics and GLCM (Gray-Level Co-Occurrence Matrix ) features of a spectrogram…☆25Updated 5 years ago
- Audio data augmentation examples☆34Updated 7 years ago
- Multimodal speech recognition using lipreading (with CNNs) and audio (using LSTMs). Sensor fusion is done with an attention network.☆69Updated 3 years ago
- A PyTorch implementation of DeepSpeech and DeepSpeech2.☆50Updated 7 years ago
- An end-to-end MATLAB toolkit for completely unsupervised Speaker Diarization using state-of-the-art algorithms.☆15Updated 10 years ago
- A large, free audio sample database (10M words pronounced), a test bed for voice activity detection algorithms and for single-syllable wo…☆70Updated 8 years ago
- Recurrent neural network training for noise reduction in robust automatic speech recognition☆147Updated 11 years ago