xinshengwang/S2IGAN

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/xinshengwang/S2IGAN)

xinshengwang / S2IGAN

Pytorch Code for S2IGAN

☆40

Alternatives and similar repositories for S2IGAN

Users that are interested in S2IGAN are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

rgzn-aiyun / melgan-cpu
View on GitHub
Real-time melgan based on cpu ！！！
☆13Dec 3, 2019Updated 6 years ago
dipjyoti92 / speaker_embeddings_GE2E
View on GitHub
PyTorch Implementation of Generalized End-to-End Loss for Speaker Verification
☆28Jan 23, 2021Updated 5 years ago
rosinality / melgan-pytorch
View on GitHub
MelGAN and Tacotron 2 in PyTorch
☆11Oct 22, 2019Updated 6 years ago
smallflyingpig / speech-to-image-translation-without-text
View on GitHub
Code for paper "direct speech-to-image translation"
☆26Jun 8, 2020Updated 6 years ago
shashankshirol / GeneratingNoisySpeechData
View on GitHub
A repository comprising of code for generation of noisy speech data from clean data using deep learning methods
☆16Jul 12, 2021Updated 5 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
keonlee9420 / WaveGrad2
View on GitHub
PyTorch Implementation of Google Brain's WaveGrad 2: Iterative Refinement for Text-to-Speech Synthesis
☆68Aug 3, 2021Updated 4 years ago
duyichao / E2E-ST-TDA
View on GitHub
Official implementation of AAAI'2022 paper "Regularizing End-to-End Speech Translation with Triangular Decomposition Agreement"
☆17Dec 23, 2021Updated 4 years ago
Rongjiehuang / awesome-speech-to-speech-translation
View on GitHub
List of direct speech-to-speech translation papers.
☆39Jan 31, 2023Updated 3 years ago
rishikksh20 / UnivNet-pytorch
View on GitHub
UnivNet: A Neural Vocoder with Multi-Resolution Spectrogram Discriminators for High-Fidelity Waveform Generation
☆76Aug 30, 2021Updated 4 years ago
LEEYOONHYUNG / GraphTTS
View on GitHub
☆12Jul 6, 2023Updated 3 years ago
karchkha / MelSpec_GPT_VQVAE
View on GitHub
Audio Generation model working with GPT-2 and VQVAE compressed representation of MelSpectrograms
☆18Oct 8, 2023Updated 2 years ago
xinshengwang / ICASSP2021_paper_list-VC
View on GitHub
ICASSP 2021 accepted papers in term of voice conversion (VC)
☆18Apr 11, 2021Updated 5 years ago
zipengxuc / PPE
View on GitHub
Code for CVPR'2022 paper ✨ "Predict, Prevent, and Evaluate: Disentangled Text-Driven Image Manipulation Empowered by Pre-Trained Vision-L…
☆37Apr 13, 2022Updated 4 years ago
talkiq / llm-evaluate
View on GitHub
☆11Nov 12, 2024Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
liuhuang31 / g2pw_once
View on GitHub
G2pw's inference speed is accelerated by about 8-10 times. Change loop generated predictive data to only once and model loop prediction b…
☆14Dec 30, 2023Updated 2 years ago
maum-ai / wavegrad2
View on GitHub
Unofficial Pytorch Implementation of WaveGrad2
☆111Aug 18, 2021Updated 4 years ago
dipjyoti92 / SC-WaveRNN
View on GitHub
Official PyTorch implementation of Speaker Conditional WaveRNN
☆110Jun 22, 2022Updated 4 years ago
rynmurdock / CPPN-WGAN-GP
View on GitHub
A WGAN-GP that utilizes a compositional pattern producing network as the generator
☆11Sep 9, 2021Updated 4 years ago
georgid / lakh_vocal_segments_dataset
View on GitHub
singing voice with annotations of vocal onsets, based on the matched MIDI from http://colinraffel.com/projects/lmd/
☆20Dec 30, 2019Updated 6 years ago
miqueltubau / Wav2Pix
View on GitHub
Speech-conditioned face generation using Generative Adversarial Networks
☆88Dec 8, 2022Updated 3 years ago
safwankdb / Neural-Style-Transfer
View on GitHub
PyTorch implementation of A Neural Algorithm of Artistic Style
☆10Dec 20, 2019Updated 6 years ago
NeuroWave-ai / CUCVAE-TTS
View on GitHub
☆25Mar 12, 2022Updated 4 years ago
huiyegit / T2I_CL
View on GitHub
☆45Dec 26, 2021Updated 4 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
rgzn-aiyun / tacotron2-melgan
View on GitHub
Mel spectrum based on tacotron2 for melgan speech synthesis
☆15Mar 24, 2023Updated 3 years ago
hiarsal / DAE-GAN
View on GitHub
☆26Mar 31, 2022Updated 4 years ago
cnlinxi / tpse_tacotron2
View on GitHub
TPSE-GST Tacotron2
☆14May 1, 2019Updated 7 years ago
KunZhou9646 / controllable_evc_code
View on GitHub
This is the code for controllable EVC framework for seen and unseen emotion generation.
☆45Nov 3, 2021Updated 4 years ago
monglechap / fluenttts
View on GitHub
FluentTTS: Text-dependent Fine-grained Style Control for Multi-style TTS
☆20Nov 15, 2022Updated 3 years ago
neuralchen / CooGAN
View on GitHub
The official tensorflow implementation of "CooGAN: A Memory-Efficient Framework for High-Resolution Facial Attribute Editing" (Accepted i…
☆26Mar 19, 2022Updated 4 years ago
vliu15 / adversarial-tts
View on GitHub
End-to-end Text-to-Speech with Generative Adversarial Networks
☆20Feb 6, 2021Updated 5 years ago
bastibe / MAPS-Scripts
View on GitHub
A fundamental frequency estimation algorithm using features from the magnitude and phase spectrogram.
☆25Mar 29, 2021Updated 5 years ago
noc-turne / LLM_Light_Testing
View on GitHub
本项目提出了一个基于python的大语言模型推理服务自动化测试框架，用于评估大语言模型的推理效果以及性能，具有易用性、易拓展性、高效性和可靠性等特点。
☆11Feb 26, 2026Updated 5 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
cxysteven / MapBJ
View on GitHub
☆12Apr 13, 2017Updated 9 years ago
theolepage / ssl-for-slr
View on GitHub
Collection of self-supervised models for speaker and language recognition tasks.
☆19Jan 18, 2022Updated 4 years ago
rishikksh20 / VocGAN
View on GitHub
VocGAN: A High-Fidelity Real-time Vocoder with a Hierarchically-nested Adversarial Network
☆321Jul 25, 2024Updated 2 years ago
MattShannon / mcd
View on GitHub
Mel cepstral distortion (MCD) computations in python.
☆231Jun 13, 2017Updated 9 years ago
mutiann / speech_rankings
View on GitHub
A CSRankings-like index for speech researchers
☆35Oct 16, 2024Updated last year
bzhangGo / st_from_scratch
View on GitHub
Revisiting End-to-End Speech-to-Text Translation From Scratch
☆13Feb 21, 2023Updated 3 years ago
mir-aidj / neural-loop-combiner
View on GitHub
"Neural Loop Combiner: Neural Network Models For Assessing The Compatibility of Loops", ISMIR 2020
☆33Nov 8, 2020Updated 5 years ago