Speech-VINO / Smart-Media-Player
For our Smart Media Player (detecting time period(s) inside audio/video during which specific person(s) is/are speaking) project
☆18Updated 5 years ago
Alternatives and similar repositories for Smart-Media-Player:
Users that are interested in Smart-Media-Player are comparing it to the libraries listed below
- Using speaker embedding for diarization in PyTorch☆18Updated 4 years ago
- Speaker Diarization using GRU in PyTorch☆11Updated 4 years ago
- A fully convolution-network for speech-to-text, built on pytorch.☆126Updated 4 years ago
- The repository contains all the codes necessary for my project - Automatic Speech Recognition System in Hindi Language ( Project descript…☆28Updated 5 years ago
- [deprecated] Pretrained models for pyannote-audio 1.x☆72Updated 2 years ago
- End-to-End Speech Recognition Using Tensorflow☆42Updated 2 years ago
- Data preparation code for building Kaldi ASR system☆14Updated 8 years ago
- 6th place solution to Freesound Audio Tagging 2019 kaggle competition☆25Updated 4 years ago
- DeepSpeech, Speech To Text, ASR, Speech recognition, Keras, Tensorflow☆30Updated 7 years ago
- https://www.kaggle.com/c/tensorflow-speech-recognition-challenge/☆21Updated 7 years ago
- speaker_diarization done on toy dataset and tested on timit dataset☆8Updated 3 years ago
- Tensor2tensor experiment with SpecAugment☆46Updated 5 years ago
- Speaker diarization based on Kaldi x-vectors, tuned for 16k microphone data☆96Updated last year
- Audio data augmentation examples☆34Updated 6 years ago
- Easy-to-use Connectionnist Temporal Classification in Keras☆78Updated 3 years ago
- Some tutorials used for ASR class☆31Updated 3 years ago
- Keras + pyTorch implimentation of "Deep Learning & 3D Convolutional Neural Networks for Speaker Verification"☆29Updated 6 years ago
- Speech recognition with CTC in Keras with Tensorflow backend☆31Updated 2 years ago
- Trained speaker embedding deep learning models and evaluation pipelines in pytorch and tesorflow for speaker recognition.☆36Updated 5 years ago
- PyTorch end-to-end speech recognition☆49Updated 4 years ago
- Transcribing audio files using Hugging Face's implementation of Wav2Vec2 + "chain-linking" NLP tasks to combine speech-to-text with downs…☆31Updated 4 years ago
- Companion repository for the paper "A Comparison of Metric Learning Loss Functions for End-to-End Speaker Verification" published at SLSP…☆59Updated 4 years ago
- For our speech emotion recognition project☆28Updated 4 years ago
- The repository for Speech Recognition Israel meetup group. It is used to material collection and sharing.☆13Updated 4 years ago
- 1st Place Public Leaderboard Solution for ERC2019☆70Updated 5 years ago
- Conv-LSTM-CTC speech recognition network (end-to-end), written in TensorFlow.☆72Updated 6 years ago
- ☆90Updated 2 years ago
- Augmented Audio Data Generator for 1D-Convolutional Neural Networks☆49Updated 3 years ago
- A simple implementation of the paper https://arxiv.org/pdf/1910.00716v1.pdf☆31Updated 3 years ago
- A better, faster, stronger version of the unbounded interleaved-state recurrent neural network (UIS-RNN)☆61Updated 4 years ago