Speech-VINO / Smart-Media-PlayerLinks
For our Smart Media Player (detecting time period(s) inside audio/video during which specific person(s) is/are speaking) project
☆18Updated 5 years ago
Alternatives and similar repositories for Smart-Media-Player
Users that are interested in Smart-Media-Player are comparing it to the libraries listed below
Sorting:
- A neural attention model for speech command recognition☆187Updated 2 months ago
- How to do Real Time Trigger Word Detection with Keras | DLology☆161Updated 6 years ago
- Using speaker embedding for diarization in PyTorch☆17Updated 5 years ago
- A fully convolution-network for speech-to-text, built on pytorch.☆126Updated 5 years ago
- Easy-to-use Connectionnist Temporal Classification in Keras☆77Updated 4 years ago
- Speaker Diarization using GRU in PyTorch☆11Updated 5 years ago
- Udacity 2018 Machine Learning Nanodegree Capstone project☆146Updated 6 years ago
- ☆46Updated 7 years ago
- End-to-End Speech Recognition Using Tensorflow☆41Updated 2 years ago
- https://www.kaggle.com/c/tensorflow-speech-recognition-challenge/☆21Updated 7 years ago
- ☆84Updated 5 years ago
- Identifying people from small audio fragments☆170Updated 5 years ago
- [deprecated] Pretrained models for pyannote-audio 1.x☆71Updated 3 years ago
- ☆90Updated 2 years ago
- A Keras CTC implementation of Baidu's DeepSpeech for model experimentation☆242Updated 7 years ago
- Conv-LSTM-CTC speech recognition network (end-to-end), written in TensorFlow.☆72Updated 6 years ago
- Code for the paper Language Identification Using Deep Convolutional Recurrent Neural Networks☆105Updated 7 years ago
- Augmented Audio Data Generator for 1D-Convolutional Neural Networks☆48Updated 4 years ago
- Audio data augmentation examples☆34Updated 7 years ago
- Simple d-vector based Speaker Recognition (verification and identification) using Pytorch☆211Updated 5 years ago
- DeepSpeech, Speech To Text, ASR, Speech recognition, Keras, Tensorflow☆30Updated 7 years ago
- Speaker Identification System (upto 100% accuracy); built using Python 2.7 and python_speech_features library☆211Updated 5 years ago
- 🎧 Automatic Speech Recognition: DeepSpeech & Seq2Seq (TensorFlow)☆225Updated 5 years ago
- Speech noise reduction which was generated using existing post-production techniques implemented in Python☆180Updated 3 years ago
- 8th place solution (on Kaggle) to the Freesound General-Purpose Audio Tagging Challenge (DCASE 2018 - Task 2)☆115Updated 4 years ago
- Tensorflow 2.0 implementation of the paper: A Fully Convolutional Neural Network for Speech Enhancement☆257Updated 4 years ago
- GSoC'2021 | TensorFlow implementation of Wav2Vec2☆90Updated 3 years ago
- Machine learning experiment to perform gender classification from raw audio.☆23Updated 7 years ago
- Speaker diarization based on Kaldi x-vectors, tuned for 16k microphone data☆95Updated 2 years ago
- This is the end-to-end Speech Recognition neural network, deployed in Keras. This was my final project for Artificial Intelligence Nanode…☆190Updated 8 years ago