Python Implementation of Visual Relative Attributes for Image Classification and Zero Shot Learning
☆22Jun 14, 2018Updated 7 years ago
Alternatives and similar repositories for Relative-Attributes-Zero-Shot-Learning
Users that are interested in Relative-Attributes-Zero-Shot-Learning are comparing it to the libraries listed below
Sorting:
- Interface for Controllable Expressive Talking Machine☆40Sep 20, 2025Updated 5 months ago
- This is the implementation of the paper "Emotion Intensity and its Control for Emotional Voice Conversion".☆95Feb 9, 2022Updated 4 years ago
- Official implementation for the paper Fine-grained style control in transformer-based text-to-speech synthesis.☆89Mar 5, 2022Updated 4 years ago
- A fundamental frequency estimation algorithm using features from the magnitude and phase spectrogram.☆24Mar 29, 2021Updated 4 years ago
- [INTERSPEECH'2022] Accurate Emotion Strength Assessment for Seen and Unseen Speech Based on Data-Driven Deep Learning☆83Nov 4, 2022Updated 3 years ago
- ☆10Apr 8, 2024Updated last year
- Speechflow for emotion recognition related information decomposition☆10Jul 27, 2021Updated 4 years ago
- ☆12Mar 24, 2023Updated 2 years ago
- ☆13Nov 25, 2023Updated 2 years ago
- ☆12Jul 6, 2023Updated 2 years ago
- ☆17Mar 24, 2022Updated 3 years ago
- SC-CNN: Effective Speaker Conditioning Method for Zero-Shot Multi-Speaker Text-to-Speech Systems☆39Nov 1, 2023Updated 2 years ago
- tacotronV2 + wavernn 实现中文语音合成(Tensorflow + pytorch)☆15May 20, 2020Updated 5 years ago
- Scripts for computing common lyrics-to-audio alignment evaluation metrics. Usable evaluation for any token-based alignment (e.g. if tok…☆18Oct 27, 2020Updated 5 years ago
- This is the implementation of our Interspeech 2020 paper "Converting anyone's emotion: towards speaker-independent emotional voice conver…☆90Nov 13, 2020Updated 5 years ago
- ☆77Apr 26, 2022Updated 3 years ago
- Implementation of "Learning Latent Representations for Style Control and Transfer in End-to-end Speech Synthesis"☆169Jul 6, 2023Updated 2 years ago
- Text to Speech Synthesis based on controllable latent representation☆14Aug 30, 2019Updated 6 years ago
- This is the implementation of our Interspeech 2021 paper: Limited data emotional voice conversion leveraging text-to-speech: two-stage se…☆87Dec 31, 2022Updated 3 years ago
- Implementation code of non-parallel sequence-to-sequence VC☆248Mar 24, 2023Updated 2 years ago
- The Official Implementation of “Content-Dependent Fine-Grained Speaker Embedding for Zero-Shot Speaker Adaptation in Text-to-Speech Synth…☆87Dec 20, 2022Updated 3 years ago
- ☆197May 3, 2024Updated last year
- This is the implementation of the Speaker Odyssey 2020 paper " Transforming spectrum and prosody for emotional voice conversion with non-…☆125Dec 14, 2020Updated 5 years ago
- [ICLR2022] Code for "Retriever: Learning Content-Style Representation as a Token-Level Bipartite Graph"☆54Oct 19, 2022Updated 3 years ago
- This repository contains code to replicate results from the ICASSP 2020 paper "StarGAN for Emotional Speech Conversion: Validated by Data…☆137Oct 24, 2021Updated 4 years ago
- Objective metrics used in several text-to-speech (TTS) papers.☆52Jun 17, 2025Updated 8 months ago
- Hangul pronunciation and romanisation based on Wiktionary ko-pron lua module☆21Oct 17, 2018Updated 7 years ago
- VCTK multi-speaker tacotron for ICASSP 2020☆266Mar 29, 2022Updated 3 years ago
- A Non-Autoregressive End-to-End Text-to-Speech (text-to-wav), supporting a family of SOTA unsupervised duration modelings. This project g…☆146Jun 6, 2022Updated 3 years ago
- ☆25Mar 12, 2022Updated 3 years ago
- An implement of GlowTTS model. Several modes are added: speaker embedding, prosody encoder(GST), and gradient reversal.☆54Sep 14, 2022Updated 3 years ago
- TTS-frontend with Bert and CRF/lstm (For Tacotron)☆53Jun 2, 2020Updated 5 years ago
- ☆25Apr 24, 2019Updated 6 years ago
- Automatic speech annotator processing speech with voice activaty detection, overlapping speech detection, speaker diarization and automat…☆33Jun 14, 2024Updated last year
- Official implementation of VQMIVC: One-shot (any-to-any) Voice Conversion @ Interspeech 2021 + Online playing demo!☆360Apr 27, 2022Updated 3 years ago
- ☆64May 23, 2022Updated 3 years ago
- Audio Research in US. US-based professors who work on audio (music, speech, acoustics). For students who would like to apply for RA, PhD,…☆27Feb 27, 2026Updated last week
- The official repository for Audio ALBERT☆67Jan 21, 2022Updated 4 years ago
- ☆112Jun 11, 2021Updated 4 years ago