beasteers / VGGishLinks
An inplementation of vggish in keras with tf backend
☆11Updated 3 years ago
Alternatives and similar repositories for VGGish
Users that are interested in VGGish are comparing it to the libraries listed below
Sorting:
- Urban Sound Classification : striving towards a fair comparison☆17Updated 5 years ago
- Freesound Audio Tagging 2019☆95Updated 6 years ago
- An implementation of vggish in keras with tf backend☆122Updated 4 years ago
- Augmented Audio Data Generator for 1D-Convolutional Neural Networks☆48Updated 4 years ago
- Transcribing audio files using Hugging Face's implementation of Wav2Vec2 + "chain-linking" NLP tasks to combine speech-to-text with downs…☆32Updated 4 years ago
- 6th place solution to Freesound Audio Tagging 2019 kaggle competition☆25Updated 5 years ago
- PyTorch transcribed audioset classifier, including VGGish and YAMNet, along with utils to manipulate autioset category ontology.☆99Updated 9 months ago
- GSoC'2021 | TensorFlow implementation of Wav2Vec2☆91Updated 4 years ago
- Composing General Audio Representation by Fusing Multilayer Features of a Pre-trained Model☆26Updated 2 years ago
- 1st Place solution to the Cornell Birdcall Identification competition.☆156Updated 5 years ago
- Library of TensorFlow layers for audio data processing and data augmentation☆20Updated 3 years ago
- A didactic toolkit to rapidly prototype audio classifiers with pre-trained Tensorflow models and Scikit-learn☆145Updated 3 years ago
- small experimentation about positional encoding☆19Updated 5 years ago
- [deprecated] Pretrained models for pyannote-audio 1.x☆71Updated 3 years ago
- Public repository for the paper "Learning Sound Event Classifiers from Web Audio with Noisy Labels"☆99Updated 6 years ago
- Keras Implementation of Deepmind's WaveNet for Supervised Learning Tasks☆64Updated 6 years ago
- Easy-to-use Connectionnist Temporal Classification in Keras☆77Updated 4 years ago
- Small repo describing how to use Hugging Face's Wav2Vec2 with PyCTCDecode☆111Updated 3 years ago
- Kaggle | 1st place solution for Freesound Audio Tagging 2019☆316Updated 3 years ago
- [DEPRECATED] A knowledge distillation toolkit based on PyTorch and PyTorch Lightning.☆138Updated last year
- WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation, and training audio classification models wi…☆92Updated 4 years ago
- The Additive Margin SincNet (AM-SincNet) is a new approach for speaker recognition problems which is based in the neural network architec…☆46Updated 2 years ago
- Collection of research papers on cough classification☆40Updated 5 years ago
- An audio classification system for learning with out-of-distribution data☆33Updated 3 years ago
- Train a Deep Learning model to classify audio embeddings on IBM's Deep Learning as a Service (DLaaS) platform - Watson Machine Learning☆102Updated 4 months ago
- ☆19Updated 6 years ago
- PyTorch end-to-end speech recognition☆49Updated 5 years ago
- Machine Learning Sound Classifier☆137Updated 6 years ago
- The Additive Margin MobileNet1D is a new light weight deep learning model for Speaker Recognition which is based on the MobileNetV2 archi…☆30Updated 2 years ago
- 8th place solution (on Kaggle) to the Freesound General-Purpose Audio Tagging Challenge (DCASE 2018 - Task 2)☆115Updated 5 years ago