krsna6 / interactive-embedding-spaceLinks
Embed media in a 2D scatter plot.
☆16Updated 5 years ago
Alternatives and similar repositories for interactive-embedding-space
Users that are interested in interactive-embedding-space are comparing it to the libraries listed below
Sorting:
- ☆138Updated last year
- Wav2Vec for speech recognition, classification, and audio classification☆267Updated 3 years ago
- Python package for openSMILE☆291Updated 2 months ago
- A deep learning model for classifying audio frames into [SPEECH, KCHI, CHI, MAL, FEM] classes.☆47Updated 3 months ago
- Official code for the paper "Visual Speech Enhancement Without A Real Visual Stream" published at WACV 2021☆108Updated last year
- Official implementation of INTERSPEECH 2021 paper 'Emotion Recognition from Speech Using Wav2vec 2.0 Embeddings'☆136Updated 9 months ago
- Penn Phonetics Lab Forced Aligner Toolkit (P2FA) for Python3☆107Updated last year
- ☆13Updated 10 months ago
- This is the GitHub page for publicly available emotional speech data.☆367Updated 3 years ago
- A collection of datasets for the purpose of emotion recognition/detection in speech.☆379Updated last year
- These are praat scripts I use in my research, implemented in parselmouth for python for use in binder☆132Updated 3 years ago
- Neural HMMs are all you need (for high-quality attention-free TTS)☆160Updated this week
- A python library to generate speech dataset from Youtube videos☆36Updated last year
- [deprecated] Pretrained models for pyannote-audio 1.x☆71Updated 3 years ago
- This repository contains code to replicate results from the ICASSP 2020 paper "StarGAN for Emotional Speech Conversion: Validated by Data…☆136Updated 3 years ago
- ASRecognition: just an easy-to-use library for Automatic Speech Recognition.☆51Updated 2 years ago
- Identify the emotion of multiple speakers in an Audio Segment☆178Updated 2 years ago
- VQ-VAE for Acoustic Unit Discovery and Voice Conversion☆339Updated 2 years ago
- ☆29Updated 3 years ago
- A summary on our attempts at using Deep Learning approaches for Emotional Text to Speech☆458Updated last year
- Speaker identification/verification models for Machine Learning for Computer Vision class at UNIBO☆65Updated 2 years ago
- An easy way to fine-tune Wav2Vec 2.0 for low-resource languages.☆82Updated 2 years ago
- A unified dataset of multilingual emotional human utterances☆28Updated 3 years ago
- A speaker embedding network in Pytorch that is very quick to set up and use for whatever purposes.☆90Updated 6 months ago
- These are Jupyter Notebooks to help guide people to learn how to use Praat-Parselmouth☆42Updated 4 years ago
- Phoneme Boundary Detection using Learnable Segmental Features (ICASSP 2020)☆82Updated 3 years ago
- 1st Place Public Leaderboard Solution for ERC2019☆70Updated 5 years ago
- PyTorch Implementation of PortaSpeech: Portable and High-Quality Generative Text-to-Speech☆341Updated 3 years ago
- ☆193Updated last year
- This is the implementation of the Speaker Odyssey 2020 paper " Transforming spectrum and prosody for emotional voice conversion with non-…☆125Updated 4 years ago