bioidiap / bobLinks
Bob is a free signal-processing and machine learning toolbox originally developed by the Biometrics group at Idiap Research Institute, in Switzerland. - Mirrored from https://gitlab.idiap.ch/bob/bob
☆48Updated last year
Alternatives and similar repositories for bob
Users that are interested in bob are comparing it to the libraries listed below
Sorting:
- This code implements a basic MLP for speech recognition. The MLP is trained with pytorch, while feature extraction, alignments, and dec…☆38Updated 7 years ago
- Python library for audio augmentation☆84Updated last year
- Repository for Weak Label Learning for Audio Events - A closer look. Uses Audioset subset data provided for reproducibility.☆32Updated last year
- ☆64Updated 6 years ago
- Estimate the number of concurrent speakers from single channel mixtures to crack the "cocktail-party” problem.☆22Updated 5 years ago
- Audio data augmentation examples☆34Updated 7 years ago
- End to End Dialect Identification using Convolutional Neural Network☆52Updated 5 years ago
- A program for automatic speaker identification using deep learning techniques.☆84Updated 8 years ago
- Supporting code for "Emotion Recognition in Speech using Cross-Modal Transfer in the Wild"☆102Updated 5 years ago
- Convolutional neural networks for sound classification☆20Updated 7 years ago
- ☆15Updated 7 years ago
- Weakly Supervised CRNN System for Sound Event Detection With Large-scale Unlabeled In-domain Data☆10Updated 6 years ago
- Pytorch based phoneme recognition (TIMIT phoneme classification)☆34Updated 7 years ago
- Sound event detection in real life audio with CNN submitted to DCASE16☆22Updated 2 years ago
- 24-hour Automatic Speech Recognition☆27Updated 4 years ago
- How to run GPU accelerated Signal Processing in TensorFlow☆23Updated 6 years ago
- Multimodal speech recognition using lipreading (with CNNs) and audio (using LSTMs). Sensor fusion is done with an attention network.☆69Updated 2 years ago
- processing and extracting of face and mouth image files out of the TCDTIMIT database☆45Updated 4 years ago
- Run speaker recognition algorithms - Mirrored from https://gitlab.idiap.ch/bob/bob.bio.spear☆19Updated last year
- Tensorflow - Very Deep Convolutional Neural Networks For Raw Waveforms - https://arxiv.org/pdf/1610.00087.pdf☆74Updated 4 years ago
- Keras version of Syncnet, by Joon Son Chung and Andrew Zisserman.☆51Updated 6 years ago
- Source code for 'Transfer Learning for Speech Recognition on a Budget' published at ACL 2017☆46Updated 8 years ago
- The Additive Margin MobileNet1D is a new light weight deep learning model for Speaker Recognition which is based on the MobileNetV2 archi…☆30Updated last year
- Read and write HTK and HTS files from python.☆20Updated 10 years ago
- Face Landmark-based Speaker-Independent Audio-Visual Speech Enhancement in Multi-Talker Environments☆108Updated last year
- Speaker Diarization is the problem of separating speakers in an audio. There could be any number of speakers and final result should stat…☆65Updated 4 years ago
- Adaptive and Focusing Neural Layers for Multi-Speaker Separation Problem☆51Updated 6 years ago
- Utils and data sets for audio and PyTorch☆85Updated 3 years ago
- An end-to-end MATLAB toolkit for completely unsupervised Speaker Diarization using state-of-the-art algorithms.☆16Updated 9 years ago
- Inspired work by the project of SER using ELM at Microsoft Research☆19Updated 6 years ago