krsna6 / interactive-embedding-space
Embed media in a 2D scatter plot.
☆15Updated 4 years ago
Alternatives and similar repositories for interactive-embedding-space:
Users that are interested in interactive-embedding-space are comparing it to the libraries listed below
- ☆129Updated 4 months ago
- These are praat scripts I use in my research, implemented in parselmouth for python for use in binder☆127Updated 3 years ago
- A speaker embedding network in Pytorch that is very quick to set up and use for whatever purposes.☆88Updated last year
- Code associated with the paper: Neural Representations for Modeling Variation in Speech.☆17Updated 2 years ago
- [deprecated] Pretrained models for pyannote-audio 1.x☆72Updated 2 years ago
- ☆185Updated 8 months ago
- ☆10Updated 2 months ago
- Self-supervised VQ-VAE for One-Shot Music Style Transfer☆86Updated last year
- Phoneme Boundary Detection using Learnable Segmental Features (ICASSP 2020)☆79Updated 3 years ago
- A deep learning model for classifying audio frames into [SPEECH, KCHI, CHI, MAL, FEM] classes.☆43Updated 4 months ago
- A light weight neural speaker embeddings extraction based on Kaldi and PyTorch.☆136Updated 4 years ago
- ☆28Updated 4 years ago
- A Python toolbox for speech features extraction☆160Updated last year
- WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation, and training audio classification models wi…☆89Updated 3 years ago
- SC-GlowTTS: an Efficient Zero-Shot Multi-Speaker Text-To-Speech Model☆107Updated 3 years ago
- ☆27Updated 2 years ago
- Authors' implementation of DeepSpeech Distances.☆129Updated 4 years ago
- Wav2Vec for speech recognition, classification, and audio classification☆253Updated 2 years ago
- PyTorch transcribed audioset classifier, including VGGish and YAMNet, along with utils to manipulate autioset category ontology.☆69Updated 3 years ago
- Python package for openSMILE☆259Updated last month
- This is the implementation of the Speaker Odyssey 2020 paper " Transforming spectrum and prosody for emotional voice conversion with non-…☆123Updated 4 years ago
- Repository for Quantifying Valence and Arousal in Text with Multilingual Pre-trained Transformers☆25Updated last year
- Perform transfer learning for MIR using Jukebox!☆174Updated last year
- The Additive Margin SincNet (AM-SincNet) is a new approach for speaker recognition problems which is based in the neural network architec…☆43Updated last year
- Extract frequency, power, width and dissonance of formants from wav files☆25Updated 2 years ago
- VQ-VAE for Acoustic Unit Discovery and Voice Conversion☆333Updated last year
- Audio transformations library for PyTorch☆229Updated 2 years ago
- Code accompanying ISMIR'19 paper titled "Learning to Traverse Latent Spaces for Musical Score Inpaintning"☆46Updated 3 years ago
- A pytroch implementation of the EETS: End-to-End Adversarial Text-to-Speech☆128Updated 4 years ago
- Helsinki Prosody Corpus and A System for Predicting Prosodic Prominence from Text☆240Updated 5 years ago