ARIA-VALUSPA / AVP
This is the ARIA-VALUSPA Platform, or AVP for short. Use this platform to build your own Virtual Humans with audio-visual input and output, language models for English, French, and German, emotional understanding, and many more. This work was funded by European Union Horizon 2020 research and innovation programme, grant agreement No 645378.
☆32Updated 4 years ago
Alternatives and similar repositories for AVP:
Users that are interested in AVP are comparing it to the libraries listed below
- A classification model in Machine Learning capable of recognizing human facial emotions☆23Updated 6 years ago
- Tool for online Valence and Arousal annotation.☆35Updated 4 years ago
- Build your own Real-time Speech Emotion Recognizer☆111Updated 6 years ago
- Deep understanding and modelling of the hierarchical structure of prosody☆22Updated 5 years ago
- Keras version of Syncnet, by Joon Son Chung and Andrew Zisserman.☆51Updated 6 years ago
- Code to run densepose on video with detectron. https://github.com/facebookresearch/Detectron☆62Updated 6 years ago
- ☆25Updated 6 years ago
- An end-to-end MATLAB toolkit for completely unsupervised Speaker Diarization using state-of-the-art algorithms.☆16Updated 9 years ago
- A Super Wizard of OZ platform☆19Updated 8 years ago
- "Automated Speech Recognition System" in Machine Learning and Having it Deep and Structured, Spring 2015☆20Updated 8 years ago
- Representations of language in a model of visually grounded speech signal.☆23Updated 6 years ago
- Live demo for speech emotion recognition using Keras and Tensorflow models☆39Updated 8 months ago
- ☆29Updated 10 years ago
- processing and extracting of face and mouth image files out of the TCDTIMIT database☆45Updated 4 years ago
- Read and write HTK and HTS files from python.☆20Updated 10 years ago
- Social Signal Interpretation (SSI) Framework☆62Updated last year
- ☆64Updated 6 years ago
- Caffe implementation of face recognition using VGG Deep Face☆1Updated 7 years ago
- An Attention Based Open-Source End to End Speech Synthesis Framework, No CNN, No RNN, No MFCC!!!☆86Updated 4 years ago
- A project where we use tf-open-pose to run 3D pose estimation in realtime .☆56Updated 7 years ago
- Keras Implementation of "Look, Listen and Learn" Model☆21Updated 7 years ago
- Audio-Visual Speech Recognition using Deep Learning☆60Updated 6 years ago
- ☆21Updated 7 years ago
- LSTM/BOF model to encode Videos. Implementation of our BMVC paper "Story Understanding in Video Advertisements".☆14Updated 4 years ago
- WaveNet Vocoder Samples☆23Updated 5 years ago
- Deep CNN networks for Speech Synthesis☆49Updated 7 years ago
- Audio Classification using Image Classification☆48Updated 5 years ago
- Code for "Vid2speech: Speech Reconstruction from Silent Video" ICASSP '17☆116Updated 8 years ago
- Face2Faceの実装とか☆13Updated 8 years ago
- Speech-conditioned face generation using Generative Adversarial Networks☆88Updated 2 years ago