apoorvpatne10 / fantastic-memory
Lip Reading using STCNNs and Bi-GRUs
☆10Updated 2 years ago
Alternatives and similar repositories for fantastic-memory:
Users that are interested in fantastic-memory are comparing it to the libraries listed below
- Similarity Learning applied to Speaker Verification and Semantic Textual Similarity☆12Updated 4 years ago
- SpeechYOLO Interspeech 2019☆42Updated 2 years ago
- A packaged convolutional voice activity detector for noisy environments.☆14Updated 5 years ago
- ☆23Updated 5 years ago
- (Si)mply a (Re)search front-end for Text-To-Speech Synthesis.☆10Updated 6 years ago
- Mispronunciation detection code for jingju singing voice☆20Updated 6 years ago
- [ICCV'21] The Right to Talk: An Audio-Visual Transformer Approach☆20Updated 3 years ago
- The Additive Margin MobileNet1D is a new light weight deep learning model for Speaker Recognition which is based on the MobileNetV2 archi…☆29Updated last year
- Detect emotion from audio☆13Updated 6 years ago
- This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge☆14Updated 2 years ago
- Implement a GRU/LSTM model using Keras, and train it to classify the languages using MFCC features☆25Updated 5 months ago
- The python code detects different landmarks on the face and predicts the emotions such as smile based on it. It automatically takes a pho…☆46Updated 6 years ago
- Audio classification is a popular topic, here I implement several models using TenserFlow and Keras.☆24Updated 4 years ago
- The Additive Margin SincNet (AM-SincNet) is a new approach for speaker recognition problems which is based in the neural network architec…☆43Updated last year
- This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…☆11Updated 4 years ago
- Audio command recognition by DTW and classification☆7Updated 4 years ago
- Project to learn about speech recognition - both Speaker Diarization and other Speech Recognition applications.☆47Updated 7 years ago
- Transformer-based online speech recognition system with TensorFlow 2☆26Updated 3 years ago
- Audio data augmentation examples☆34Updated 6 years ago
- Urban sound source tagging from an aggregation of four second noisy audio clips via 1D and 2D CNN (Xception)☆58Updated last year
- ☆41Updated 4 months ago
- Construct GMM-HMM and Implement the Viterbi algorithm for continuous speech recognition☆15Updated 6 years ago
- A new metric for evaluating end-to-end speech recognition and disfluency removal systems☆19Updated 3 years ago
- Generalized cross-modal NNs; new audiovisual benchmark (IEEE TNNLS 2019)☆25Updated 4 years ago
- Text-to-Speech Synthesis by Generating Spectrograms using Generative Adversarial Network☆10Updated 6 years ago
- Inspired work by the project of SER using ELM at Microsoft Research☆19Updated 6 years ago
- Predicting various emotion in human speech signal by detecting different speech components affected by human emotion.☆45Updated 5 months ago
- Sorce code of Apkinson: android app to monitor the motor symptoms of Parkinson's patients☆17Updated 4 years ago
- In this repository, I try to combine k2 with speechbrain to decode well and fastly.☆16Updated 2 years ago