Python Implementation of Visual Relative Attributes for Image Classification and Zero Shot Learning
☆22Jun 14, 2018Updated 7 years ago
Alternatives and similar repositories for Relative-Attributes-Zero-Shot-Learning
Users that are interested in Relative-Attributes-Zero-Shot-Learning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Interface for Controllable Expressive Talking Machine☆40Sep 20, 2025Updated 6 months ago
- This is the implementation of the paper "Emotion Intensity and its Control for Emotional Voice Conversion".☆95Feb 9, 2022Updated 4 years ago
- Official implementation for the paper Fine-grained style control in transformer-based text-to-speech synthesis.☆90Mar 5, 2022Updated 4 years ago
- Speechflow for emotion recognition related information decomposition☆10Jul 27, 2021Updated 4 years ago
- A fundamental frequency estimation algorithm using features from the magnitude and phase spectrogram.☆24Mar 29, 2021Updated 4 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- ☆12Mar 24, 2023Updated 3 years ago
- [INTERSPEECH'2022] Accurate Emotion Strength Assessment for Seen and Unseen Speech Based on Data-Driven Deep Learning☆83Nov 4, 2022Updated 3 years ago
- tacotronV2 + wavernn 实现中文语音合成(Tensorflow + pytorch)☆15May 20, 2020Updated 5 years ago
- ☆12Nov 25, 2023Updated 2 years ago
- Scripts for computing common lyrics-to-audio alignment evaluation metrics. Usable evaluation for any token-based alignment (e.g. if tok…☆18Oct 27, 2020Updated 5 years ago
- This is the implementation of our Interspeech 2020 paper "Converting anyone's emotion: towards speaker-independent emotional voice conver…☆90Nov 13, 2020Updated 5 years ago
- ☆17Mar 24, 2022Updated 4 years ago
- ☆12Jul 6, 2023Updated 2 years ago
- SC-CNN: Effective Speaker Conditioning Method for Zero-Shot Multi-Speaker Text-to-Speech Systems☆39Nov 1, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Implementation of "Learning Latent Representations for Style Control and Transfer in End-to-end Speech Synthesis"☆169Jul 6, 2023Updated 2 years ago
- ☆198May 3, 2024Updated last year
- ☆10Apr 8, 2024Updated last year
- VCTK multi-speaker tacotron for ICASSP 2020☆266Mar 29, 2022Updated 3 years ago
- TTS-frontend with Bert and CRF/lstm (For Tacotron)☆53Jun 2, 2020Updated 5 years ago
- Clean and modernized implementation of FastSpeech2/LightSpeech using IPA☆18Aug 16, 2024Updated last year
- The Official Implementation of “Content-Dependent Fine-Grained Speaker Embedding for Zero-Shot Speaker Adaptation in Text-to-Speech Synth…☆87Dec 20, 2022Updated 3 years ago
- This is the implementation of our Interspeech 2021 paper: Limited data emotional voice conversion leveraging text-to-speech: two-stage se…☆87Dec 31, 2022Updated 3 years ago
- Text to Speech Synthesis based on controllable latent representation☆14Aug 30, 2019Updated 6 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Implementation code of non-parallel sequence-to-sequence VC☆248Mar 24, 2023Updated 3 years ago
- New algorithms for Large-scale Collaborative Ranking: PrimalCR and PrimalCR++☆12Sep 1, 2017Updated 8 years ago
- This repository contains code to replicate results from the ICASSP 2020 paper "StarGAN for Emotional Speech Conversion: Validated by Data…☆137Oct 24, 2021Updated 4 years ago
- This is the implementation of the Speaker Odyssey 2020 paper " Transforming spectrum and prosody for emotional voice conversion with non-…☆125Dec 14, 2020Updated 5 years ago
- ☆64May 23, 2022Updated 3 years ago
- A Non-Autoregressive End-to-End Text-to-Speech (text-to-wav), supporting a family of SOTA unsupervised duration modelings. This project g…☆147Jun 6, 2022Updated 3 years ago
- ☆77Apr 26, 2022Updated 3 years ago
- Official implementation of VQMIVC: One-shot (any-to-any) Voice Conversion @ Interspeech 2021 + Online playing demo!☆360Apr 27, 2022Updated 3 years ago
- The official repository for Audio ALBERT☆67Jan 21, 2022Updated 4 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A Pytorch implementation for the ZeroSpeech 2019 challenge.☆112Nov 12, 2019Updated 6 years ago
- [ICLR2022] Code for "Retriever: Learning Content-Style Representation as a Token-Level Bipartite Graph"☆54Oct 19, 2022Updated 3 years ago
- knowledge graph recommendation☆14Jun 13, 2019Updated 6 years ago
- ☆11May 7, 2022Updated 3 years ago
- Audio Research in US. US-based professors who work on audio (music, speech, acoustics). For students who would like to apply for RA, PhD,…☆27Feb 27, 2026Updated 3 weeks ago
- ☆25Mar 12, 2022Updated 4 years ago
- ☆13Nov 15, 2016Updated 9 years ago