bioidiap / bobLinks
Bob is a free signal-processing and machine learning toolbox originally developed by the Biometrics group at Idiap Research Institute, in Switzerland. - Mirrored from https://gitlab.idiap.ch/bob/bob
☆48Updated 2 years ago
Alternatives and similar repositories for bob
Users that are interested in bob are comparing it to the libraries listed below
Sorting:
- A program for automatic speaker identification using deep learning techniques.☆84Updated 8 years ago
- Train a Deep Learning model to classify audio embeddings on IBM's Deep Learning as a Service (DLaaS) platform - Watson Machine Learning☆101Updated 2 years ago
- Urban Sound Classification: With Random Forest, SVM, DNN, RNN, and CNN Classifiers☆54Updated 8 years ago
- A large, free audio sample database (10M words pronounced), a test bed for voice activity detection algorithms and for single-syllable wo…☆69Updated 7 years ago
- Project to learn about speech recognition - both Speaker Diarization and other Speech Recognition applications.☆49Updated 8 years ago
- Tensorflow - Very Deep Convolutional Neural Networks For Raw Waveforms - https://arxiv.org/pdf/1610.00087.pdf☆74Updated 4 years ago
- Utils and data sets for audio and PyTorch☆86Updated 3 years ago
- Implementation and reviews of Audio & Computer vision related papers in python using keras and tensorflow.☆40Updated 6 years ago
- ESC: Dataset for Environmental Sound Classification - paper replication data☆80Updated 7 years ago
- Audio data augmentation examples☆34Updated 7 years ago
- audio classification using TensorFlow☆15Updated 8 years ago
- Multimodal speech recognition using lipreading (with CNNs) and audio (using LSTMs). Sensor fusion is done with an attention network.☆69Updated 2 years ago
- This code implements a basic MLP for speech recognition. The MLP is trained with pytorch, while feature extraction, alignments, and dec…☆38Updated 7 years ago
- How to run GPU accelerated Signal Processing in TensorFlow☆23Updated 6 years ago
- HTK features in Python☆73Updated 6 years ago
- Source code for 'Transfer Learning for Speech Recognition on a Budget' published at ACL 2017☆46Updated 8 years ago
- ☆65Updated 6 years ago
- https://dodiku.github.io/audio_noise_clustering/results/ ==> An experiment with a variety of clustering (and clustering-like) techniques …☆26Updated 8 years ago
- ☆26Updated 6 years ago
- Keras version of Syncnet, by Joon Son Chung and Andrew Zisserman.☆51Updated 6 years ago
- Audio Classification using Image Classification☆48Updated 5 years ago
- Speech recognition on the TIMIT (or any other) dataset☆43Updated 7 years ago
- Adaptive and Focusing Neural Layers for Multi-Speaker Separation Problem☆51Updated 7 years ago
- Keras Implementation of Deepmind's WaveNet for Supervised Learning Tasks☆64Updated 6 years ago
- ☆27Updated 6 years ago
- End to End Dialect Identification using Convolutional Neural Network☆52Updated 5 years ago
- The details that matter: Frequency resolution of spectrograms in acoustic scene classification - paper replication data☆39Updated 7 years ago
- Python library for audio augmentation☆84Updated 2 years ago
- Supporting code for "Emotion Recognition in Speech using Cross-Modal Transfer in the Wild"☆103Updated 5 years ago
- It uses GMM to train a gender detector model. The testing has been done on subset of Google's AudioSet corpus.☆19Updated 8 years ago