bioidiap / bobLinks
Bob is a free signal-processing and machine learning toolbox originally developed by the Biometrics group at Idiap Research Institute, in Switzerland. - Mirrored from https://gitlab.idiap.ch/bob/bob
☆48Updated 2 years ago
Alternatives and similar repositories for bob
Users that are interested in bob are comparing it to the libraries listed below
Sorting:
- A program for automatic speaker identification using deep learning techniques.☆84Updated 8 years ago
- How to run GPU accelerated Signal Processing in TensorFlow☆23Updated 6 years ago
- This code implements a basic MLP for speech recognition. The MLP is trained with pytorch, while feature extraction, alignments, and dec…☆38Updated 7 years ago
- Project to learn about speech recognition - both Speaker Diarization and other Speech Recognition applications.☆50Updated 8 years ago
- Audio Classification using Image Classification☆48Updated 5 years ago
- A large, free audio sample database (10M words pronounced), a test bed for voice activity detection algorithms and for single-syllable wo…☆69Updated 7 years ago
- Train a Deep Learning model to classify audio embeddings on IBM's Deep Learning as a Service (DLaaS) platform - Watson Machine Learning☆102Updated 3 weeks ago
- End to End Dialect Identification using Convolutional Neural Network☆52Updated 5 years ago
- An end-to-end MATLAB toolkit for completely unsupervised Speaker Diarization using state-of-the-art algorithms.☆15Updated 9 years ago
- Multimodal speech recognition using lipreading (with CNNs) and audio (using LSTMs). Sensor fusion is done with an attention network.☆69Updated 2 years ago
- Urban Sound Classification: With Random Forest, SVM, DNN, RNN, and CNN Classifiers☆54Updated 8 years ago
- Tensorflow - Very Deep Convolutional Neural Networks For Raw Waveforms - https://arxiv.org/pdf/1610.00087.pdf☆74Updated 4 years ago
- Speaker Diarization is the problem of separating speakers in an audio. There could be any number of speakers and final result should stat…☆64Updated 4 years ago
- Source code for 'Transfer Learning for Speech Recognition on a Budget' published at ACL 2017☆46Updated 8 years ago
- audio cfeatures extraction tool from wav to h5features format☆19Updated 6 years ago
- HTK features in Python☆73Updated 6 years ago
- Conv-LSTM-CTC speech recognition network (end-to-end), written in TensorFlow.☆72Updated 6 years ago
- Convolutional neural networks for sound classification☆20Updated 7 years ago
- Python library for audio augmentation☆84Updated 2 years ago
- ☆15Updated 7 years ago
- Keras version of Syncnet, by Joon Son Chung and Andrew Zisserman.☆51Updated 6 years ago
- 24-hour Automatic Speech Recognition☆27Updated 4 years ago
- Text Independent Speaker Verification Using GE2E Loss☆84Updated 6 years ago
- Time Delayed NN implemented in pytorch☆81Updated 8 years ago
- ☆30Updated 6 years ago
- An Attention Based Open-Source End to End Speech Synthesis Framework, No CNN, No RNN, No MFCC!!!☆85Updated 5 years ago
- mirror of VoxCeleb dataset - a large-scale speaker identification dataset☆73Updated 6 years ago
- Implementation of state of the art d-vector approach for speaker verification☆127Updated 8 years ago
- "Automated Speech Recognition System" in Machine Learning and Having it Deep and Structured, Spring 2015☆21Updated 8 years ago
- Masked ConditionaL Neural Networks☆15Updated 2 years ago