ZET-Speech: Zero-shot adaptive Emotion-controllable Text-to-Speech Synthesis with Diffusion and Style-based Models (TTS)
☆10Mar 9, 2024Updated 2 years ago
Alternatives and similar repositories for ZET-Speech-Demo
Users that are interested in ZET-Speech-Demo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Emotion Rendering for Conversational Speech Synthesis with Heterogeneous Graph-Based Context Modeling (Accepted by AAAI'2024)☆59Jun 20, 2024Updated last year
- Speech Recognition and Voice Activity Detection using a Convolutional Neural Network Architecture built with Tensorflow.js☆13Oct 24, 2021Updated 4 years ago
- PyTorch Implementation of ByteDance's Cross-speaker Emotion Transfer Based on Speaker Condition Layer Normalization and Semi-Supervised T…☆194Nov 9, 2022Updated 3 years ago
- ☆130Aug 19, 2024Updated last year
- BEGANSing - Korean SVS + SVC + AudioSR☆11Feb 17, 2024Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- A waifu2x CLI wrapper.☆21Jun 8, 2015Updated 10 years ago
- PyTorch implementation of NEUTART, a system that creates photorealistic talking avatars from an input text transcription.☆33Mar 11, 2025Updated last year
- ☆14Jun 16, 2023Updated 2 years ago
- Non Parallel Voice Conversion based on VITS☆24Mar 31, 2023Updated 2 years ago
- Code for paper A3T: Alignment-Aware Acoustic and Text Pretraining for Speech Synthesis and Editing☆89Sep 6, 2024Updated last year
- ☆11Dec 2, 2024Updated last year
- Generative Expressive Conversational Speech Synthesis (Accepted by MM'2024)☆62Nov 1, 2024Updated last year
- 汽车-androidAPP-物联网-蓝牙☆11Nov 29, 2017Updated 8 years ago
- Deformable Convolutional Networks v2 with Pytorch☆10Jul 29, 2020Updated 5 years ago
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- Adaptive Passage Encoder for Open-domain Question Answering☆15Jun 1, 2021Updated 4 years ago
- A simple python package to stretch audio files and change their speed☆12Feb 18, 2026Updated last month
- an smart ai waifu☆26Mar 18, 2023Updated 3 years ago
- A simple and effective feature alignment method with proposed anchor loss for person re-identification☆15Aug 18, 2020Updated 5 years ago
- A unified dataset of multilingual emotional human utterances☆28Jan 16, 2026Updated 2 months ago
- Code for the paper "Modeling Information Change in Science Communication with Semantically Matched Paraphrases" from EMNLP 2022☆13Oct 20, 2022Updated 3 years ago
- The goal is to pilot Microsoft Cognitive Services to unlock the strategic value of UN unstructured content by building on AI and semantic…☆16Jul 6, 2023Updated 2 years ago
- Emofilt is a program to simulate emotional arousal with speech synthesis based on the free-for-non-commercial-use MBROLA synthesis engine…☆14Mar 17, 2022Updated 4 years ago
- This is an implementation of CartoonGAN in pytorch, including both ".py" and ".ipynb" version.☆12Nov 28, 2019Updated 6 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆11Aug 10, 2022Updated 3 years ago
- This project provides a data set with bounding boxes, body poses, 3D face meshes & captions of people from our LAION-2.2B. Additionally i…☆14Jan 2, 2022Updated 4 years ago
- Manage audio and video datasets☆35Mar 4, 2026Updated 3 weeks ago
- Dataset and model in the paper "SciXGen: A Scientific Paper Dataset for Context-Aware Text Generation"☆13Feb 14, 2022Updated 4 years ago
- SpeechNAS-Better-Trade-off-between-Latency-and-Accuracy-for-Large-Scale-Speaker-Verification☆30Mar 24, 2023Updated 3 years ago
- Webpage of "Portrait4D-v2: Pseudo Multi-View Data Creates Better 4D Head Synthesizer"☆11Jul 2, 2024Updated last year
- Feature Extraction Approaches for Biological Sequences: A Comparative Study of Mathematical Models☆16Jul 6, 2023Updated 2 years ago
- Official Demo Page for DiTTo-TTS: Efficient and Scalable Zero-Shot Text-to-Speech with Diffusion Transformer☆38Feb 17, 2025Updated last year
- NAR-BERT-ASR☆10Sep 27, 2021Updated 4 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Code for the paper 'Geodesic Finite Mixture Model'.☆10Aug 25, 2016Updated 9 years ago
- ☆16Nov 5, 2018Updated 7 years ago
- Pytorch Text GAN for lyrics generation☆10Apr 13, 2019Updated 6 years ago
- Learning Transferable Features with Deep Adaptation Networks☆12Jul 18, 2023Updated 2 years ago
- This repo is for CaesarNeRF: Calibrated Semantic Representation for Few-Shot Generalizable Neural Rendering.☆14Mar 6, 2024Updated 2 years ago
- ☆11Mar 28, 2021Updated 5 years ago
- Code for the paper "Abstractive Summarization Guided by Latent Hierarchical Document Structure"☆13May 20, 2023Updated 2 years ago