zdmc23 / oneshot-audioLinks
Experiment with "one-shot learning" techniques to recognize a voice signature
☆24Updated 5 years ago
Alternatives and similar repositories for oneshot-audio
Users that are interested in oneshot-audio are comparing it to the libraries listed below
Sorting:
- Project to learn about speech recognition - both Speaker Diarization and other Speech Recognition applications.☆49Updated 8 years ago
- How to run GPU accelerated Signal Processing in TensorFlow☆23Updated 6 years ago
- Keras Implementation of Deepmind's WaveNet for Supervised Learning Tasks☆64Updated 6 years ago
- Audio Classification using Image Classification☆48Updated 5 years ago
- Automatic Speech Recognition Dataset Generation☆37Updated 6 years ago
- Augmented Audio Data Generator for 1D-Convolutional Neural Networks☆48Updated 3 years ago
- Audio data augmentation examples☆34Updated 7 years ago
- COVID-19 Coughs files for training AI models☆41Updated 4 years ago
- ☆83Updated 5 years ago
- keras project for audio deep learning☆40Updated 7 years ago
- Won't it be cool to build a speech assistant like Alexa or Siri yourself without voice API and network connection?☆33Updated 7 years ago
- Machine Learning Sound Classifier☆136Updated 5 years ago
- Built a deep neural network that functions as part of an end-to-end automatic speech recognition (ASR) pipeline.☆49Updated 6 years ago
- Train a Deep Learning model to classify audio embeddings on IBM's Deep Learning as a Service (DLaaS) platform - Watson Machine Learning☆101Updated 2 years ago
- Tensorflow - Very Deep Convolutional Neural Networks For Raw Waveforms - https://arxiv.org/pdf/1610.00087.pdf☆74Updated 4 years ago
- End-to-End Speech Recognition using Neural Networks.☆35Updated 10 months ago
- Companion repository for the blog article: https://www.endpointdev.com/blog/2019/01/speech-recognition-with-tensorflow/☆22Updated 3 years ago
- Code base for WaveTransformer: A novel architecture for automated audio captioning☆44Updated 4 years ago
- A large, free audio sample database (10M words pronounced), a test bed for voice activity detection algorithms and for single-syllable wo…☆69Updated 7 years ago
- https://dodiku.github.io/audio_noise_clustering/results/ ==> An experiment with a variety of clustering (and clustering-like) techniques …☆26Updated 8 years ago
- 📊 Easily apply audio-related machine learning models trained on the AudioSet dataset (527+ models/classes).☆30Updated last year
- Live demo for speech emotion recognition using Keras and Tensorflow models☆39Updated 11 months ago
- 8th place solution (on Kaggle) to the Freesound General-Purpose Audio Tagging Challenge (DCASE 2018 - Task 2)☆115Updated 4 years ago
- ☆29Updated 6 years ago
- It uses GMM to train a gender detector model. The testing has been done on subset of Google's AudioSet corpus.☆19Updated 8 years ago
- Speaker Diarization is the first step in many early audio processing and aims to solve the problem ”who spoke when”. It therefore relies …☆12Updated 6 years ago
- Emotive Speech generation based on DAVID: An open-source platform for real-time emotional speech transformation using pysox☆13Updated 7 years ago
- This code implements a basic MLP for speech recognition. The MLP is trained with pytorch, while feature extraction, alignments, and dec…☆38Updated 7 years ago
- Implementation and reviews of Audio & Computer vision related papers in python using keras and tensorflow.☆40Updated 6 years ago
- Phoneme prediction from speech mel-spectrograms using RNN.☆14Updated 6 years ago