ZET-Speech: Zero-shot adaptive Emotion-controllable Text-to-Speech Synthesis with Diffusion and Style-based Models (TTS)
☆10Mar 9, 2024Updated 2 years ago
Alternatives and similar repositories for ZET-Speech-Demo
Users that are interested in ZET-Speech-Demo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Emotion Rendering for Conversational Speech Synthesis with Heterogeneous Graph-Based Context Modeling (Accepted by AAAI'2024)☆59Jun 20, 2024Updated last year
- Speech Recognition and Voice Activity Detection using a Convolutional Neural Network Architecture built with Tensorflow.js☆13Oct 24, 2021Updated 4 years ago
- PyTorch Implementation of ByteDance's Cross-speaker Emotion Transfer Based on Speaker Condition Layer Normalization and Semi-Supervised T…☆194Nov 9, 2022Updated 3 years ago
- ☆130Aug 19, 2024Updated last year
- BEGANSing - Korean SVS + SVC + AudioSR☆11Feb 17, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A waifu2x CLI wrapper.☆21Jun 8, 2015Updated 10 years ago
- PyTorch implementation of NEUTART, a system that creates photorealistic talking avatars from an input text transcription.☆34Mar 11, 2025Updated last year
- ☆14Jun 16, 2023Updated 2 years ago
- Non Parallel Voice Conversion based on VITS☆24Mar 31, 2023Updated 3 years ago
- Code for paper A3T: Alignment-Aware Acoustic and Text Pretraining for Speech Synthesis and Editing☆89Sep 6, 2024Updated last year
- ☆11Dec 2, 2024Updated last year
- Generative Expressive Conversational Speech Synthesis (Accepted by MM'2024)☆62Nov 1, 2024Updated last year
- 汽车-androidAPP-物联网-蓝牙☆11Nov 29, 2017Updated 8 years ago
- [ACL-IJCNLP 2021] Structural Knowledge Distillation: Tractably Distilling Information for Structured Predictor☆10Jul 10, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Adaptive Passage Encoder for Open-domain Question Answering☆15Jun 1, 2021Updated 4 years ago
- A simple python package to stretch audio files and change their speed☆12Feb 18, 2026Updated 2 months ago
- Deformable Convolutional Networks v2 with Pytorch☆10Jul 29, 2020Updated 5 years ago
- an smart ai waifu☆26Mar 18, 2023Updated 3 years ago
- A simple and effective feature alignment method with proposed anchor loss for person re-identification☆15Aug 18, 2020Updated 5 years ago
- A unified dataset of multilingual emotional human utterances☆28Jan 16, 2026Updated 3 months ago
- Code for the paper "Modeling Information Change in Science Communication with Semantically Matched Paraphrases" from EMNLP 2022☆13Oct 20, 2022Updated 3 years ago
- The goal is to pilot Microsoft Cognitive Services to unlock the strategic value of UN unstructured content by building on AI and semantic…☆16Jul 6, 2023Updated 2 years ago
- Emofilt is a program to simulate emotional arousal with speech synthesis based on the free-for-non-commercial-use MBROLA synthesis engine…☆14Mar 17, 2022Updated 4 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- This is an implementation of CartoonGAN in pytorch, including both ".py" and ".ipynb" version.☆12Nov 28, 2019Updated 6 years ago
- ☆11Aug 10, 2022Updated 3 years ago
- Manage audio and video datasets☆36Apr 16, 2026Updated 3 weeks ago
- This project provides a data set with bounding boxes, body poses, 3D face meshes & captions of people from our LAION-2.2B. Additionally i…☆14Jan 2, 2022Updated 4 years ago
- Dataset and model in the paper "SciXGen: A Scientific Paper Dataset for Context-Aware Text Generation"☆13Feb 14, 2022Updated 4 years ago
- SpeechNAS-Better-Trade-off-between-Latency-and-Accuracy-for-Large-Scale-Speaker-Verification☆30Mar 24, 2023Updated 3 years ago
- Webpage of "Portrait4D-v2: Pseudo Multi-View Data Creates Better 4D Head Synthesizer"☆12Jul 2, 2024Updated last year
- Feature Extraction Approaches for Biological Sequences: A Comparative Study of Mathematical Models☆16Jul 6, 2023Updated 2 years ago
- Official Demo Page for DiTTo-TTS: Efficient and Scalable Zero-Shot Text-to-Speech with Diffusion Transformer☆38Feb 17, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- NAR-BERT-ASR☆10Sep 27, 2021Updated 4 years ago
- Code for the paper 'Geodesic Finite Mixture Model'.☆10Aug 25, 2016Updated 9 years ago
- ☆16Nov 5, 2018Updated 7 years ago
- Pytorch Text GAN for lyrics generation☆10Apr 13, 2019Updated 7 years ago
- This repo is for CaesarNeRF: Calibrated Semantic Representation for Few-Shot Generalizable Neural Rendering.☆14Mar 6, 2024Updated 2 years ago
- Learning Transferable Features with Deep Adaptation Networks☆13Jul 18, 2023Updated 2 years ago
- Code for the paper "Abstractive Summarization Guided by Latent Hierarchical Document Structure"☆13May 20, 2023Updated 2 years ago