Speech-VINO / Smart-Media-PlayerLinks
For our Smart Media Player (detecting time period(s) inside audio/video during which specific person(s) is/are speaking) project
☆18Updated 5 years ago
Alternatives and similar repositories for Smart-Media-Player
Users that are interested in Smart-Media-Player are comparing it to the libraries listed below
Sorting:
- Using speaker embedding for diarization in PyTorch☆17Updated 5 years ago
- A fully convolution-network for speech-to-text, built on pytorch.☆126Updated 5 years ago
- A neural attention model for speech command recognition☆186Updated 5 months ago
- How to do Real Time Trigger Word Detection with Keras | DLology☆161Updated 6 years ago
- Easy-to-use Connectionnist Temporal Classification in Keras☆77Updated 4 years ago
- ☆90Updated 3 years ago
- End-to-End Speech Recognition Using Tensorflow☆41Updated 2 years ago
- DeepSpeech, Speech To Text, ASR, Speech recognition, Keras, Tensorflow☆30Updated 7 years ago
- [deprecated] Pretrained models for pyannote-audio 1.x☆71Updated 3 years ago
- tf 2.0 implementation of Listen, attend and spell☆21Updated 4 years ago
- Code for the paper Language Identification Using Deep Convolutional Recurrent Neural Networks☆105Updated 7 years ago
- Speaker Diarization using GRU in PyTorch☆11Updated 5 years ago
- The repository contains all the codes necessary for my project - Automatic Speech Recognition System in Hindi Language ( Project descript…☆28Updated 6 years ago
- GSoC'2021 | TensorFlow implementation of Wav2Vec2☆91Updated 3 years ago
- Speaker diarization based on Kaldi x-vectors, tuned for 16k microphone data☆96Updated 2 years ago
- Trained speaker embedding deep learning models and evaluation pipelines in pytorch and tesorflow for speaker recognition.☆36Updated 6 years ago
- Identifying people from small audio fragments☆170Updated 5 years ago
- Audio data augmentation examples☆34Updated 7 years ago
- Repository containing experimentation platform on how to train, infer on wav2vec2 models.☆87Updated 3 years ago
- Conv-LSTM-CTC speech recognition network (end-to-end), written in TensorFlow.☆72Updated 6 years ago
- Speaker Diarization is the problem of separating speakers in an audio. There could be any number of speakers and final result should stat…☆64Updated 4 years ago
- Speech recognition with CTC in Keras with Tensorflow backend☆31Updated 2 years ago
- SpeechYOLO Interspeech 2019☆46Updated 3 years ago
- Code and slides for the "Deep learning (audio) application: From design to deployment" tutorials.☆176Updated last year
- ☆84Updated 5 years ago
- Built a deep neural network that functions as part of an end-to-end automatic speech recognition (ASR) pipeline.☆49Updated 6 years ago
- ☆38Updated 5 years ago
- Detecting emotion in voices☆47Updated 6 years ago
- PyTorch code for end-to-end spoken language understanding (SLU) with ASR-based transfer learning☆230Updated 4 years ago
- For our speech emotion recognition project☆28Updated 4 years ago