blairstar / The_Art_of_DPM
View external linksLinks

An In-depth Analysis of Diffusion Probability Model

☆119

Alternatives and similar repositories for The_Art_of_DPM

Users that are interested in The_Art_of_DPM are comparing it to the libraries listed below

Sorting:

VoiceBank-NTPU-TW / VoiceBank-2023
View on GitHub
VoiceBank-2023 is the speech corpus specially designed for constructing personalized Mandarin text-to-speech (TTS) systems.
☆41Jan 4, 2026Updated last month
ex3ndr / supervoice-hybrid
View on GitHub
My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one
☆26Aug 5, 2024Updated last year
light1726 / BetaVAE_VC
View on GitHub
Implementation for paper "Disentangled Speech Representation Learning for One-Shot Cross-Lingual Voice Conversion Using ß-VAE"
☆44Apr 10, 2023Updated 2 years ago
jdh-algo / JoyTTS
View on GitHub
☆40Jul 15, 2025Updated 7 months ago
Honee-W / U-SAM
View on GitHub
Official repository for U-SAM (Interspeech 2025)
☆25Jun 3, 2025Updated 8 months ago
cszhilu1998 / TBSR
View on GitHub
This is the official PyTorch implementation of TBSR. Our team received 2nd place (real data track) and 3rd place (synthetic track) in NTI…
☆14Jun 11, 2022Updated 3 years ago
e-c-k-e-r / vall-e
View on GitHub
An unofficial PyTorch implementation of VALL-E
☆88Aug 3, 2025Updated 6 months ago
csmliu / pretrained-GANs
View on GitHub
A Survey on Leveraging Pre-trained Generative Adversarial Networks for Image Editing and Restoration
☆17Jul 22, 2022Updated 3 years ago
ddPn08 / Latopia
View on GitHub
Speech AI training and inference tools
☆36Jun 25, 2023Updated 2 years ago
rockingdingo / context_recommendation
View on GitHub
Contextual Recommendation Implementation for Research Purposes
☆19Jul 3, 2024Updated last year
StacyYang / AutoTorch
View on GitHub
AutoTorch, A HPO Toolkit
☆60May 25, 2020Updated 5 years ago
keonlee9420 / Comprehensive-E2E-TTS
View on GitHub
A Non-Autoregressive End-to-End Text-to-Speech (text-to-wav), supporting a family of SOTA unsupervised duration modelings. This project g…
☆146Jun 6, 2022Updated 3 years ago
adelacvg / diff-vits
View on GitHub
☆39Oct 1, 2023Updated 2 years ago
happycaoyue / JSPL
View on GitHub
☆17Mar 24, 2022Updated 3 years ago
lslrh / SyncNoise
View on GitHub
SyncNoise: Geometrically Consistent Noise Prediction for Text-based 3D Scene Editing
☆18Dec 28, 2024Updated last year
thunlp / SparsingLaw
View on GitHub
The open-source materials for paper "Sparsing Law: Towards Large Language Models with Greater Activation Sparsity".
☆30Nov 12, 2024Updated last year
csmliu / AdaNEC
View on GitHub
☆22Apr 4, 2022Updated 3 years ago
ryanrudes / YTTTS
View on GitHub
The YouTube Text-To-Speech dataset is comprised of waveform audio extracted from YouTube videos alongside their English transcriptions
☆52Apr 1, 2021Updated 4 years ago
pengzhendong / audio-pipeline
View on GitHub
☆23Oct 17, 2024Updated last year
tonnetonne814 / unofficial-vits2-44100-Ja
View on GitHub
44100Hz日本語音源に対応させた unofficial vits2-TTS implementation in pytorchです。
☆24Sep 1, 2023Updated 2 years ago
iaramer / dobbi
View on GitHub
An open-source NLP library: fast text cleaning and preprocessing
☆23Nov 9, 2021Updated 4 years ago
ftshijt / speech_evaluation
View on GitHub
A toolkit dedicate for speech evaluation.
☆24Sep 26, 2024Updated last year
anton-kashkin / hifi_vc
View on GitHub
☆25Jan 24, 2023Updated 3 years ago
neoncloud / mdctGAN
View on GitHub
Code for INTERSPEECH 2023 paper "mdctGAN: Taming transformer-based GAN for speech super-resolution with Modified DCT spectra"
☆66Jun 3, 2023Updated 2 years ago
TencentARC-QQ / QA-CLIP
View on GitHub
Chinese CLIP models with SOTA performance.
☆60Aug 28, 2023Updated 2 years ago
wenet-e2e / llm-papers
View on GitHub
List of Large Lanugage Model Papers
☆60Jun 5, 2023Updated 2 years ago
NeuroWave-ai / CUCVAE-TTS
View on GitHub
☆25Mar 12, 2022Updated 3 years ago
vtuber-plan / vcvits
View on GitHub
Non Parallel Voice Conversion based on VITS
☆24Mar 31, 2023Updated 2 years ago
p0p4k / vits3_pytorch
View on GitHub
☆28Nov 15, 2023Updated 2 years ago
jaechanjo / TIFF
View on GitHub
Text-Guided Generation of Full-Body Image with Preserved Reference Face for Customized Animation
☆24Jun 24, 2024Updated last year
ML-GSAI / SDE-Drag
View on GitHub
Official PyTorch implementation for ICLR2024 paper "The Blessing of Randomness: SDE Beats ODE in General Diffusion-based Image Editing"
☆110Feb 26, 2024Updated last year
adelacvg / ttts
View on GitHub
Train the next generation of TTS systems.
☆171Sep 13, 2024Updated last year
hpc203 / Real-Time-Frame-Interpolation-onnxrun
View on GitHub
使用onnxruntime部署实时视频帧插值，包含C++和Python两个版本的程序
☆28Feb 14, 2024Updated 2 years ago
yunyikristy / ttsGAN-ICLR2019
View on GitHub
☆25Apr 24, 2019Updated 6 years ago
wutong16 / FiVA
View on GitHub
[ NeurIPS 2024 D&B Track ] Implementation for "FiVA: Fine-grained Visual Attribute Dataset for Text-to-Image Diffusion Models"
☆73Dec 27, 2024Updated last year
xmu-xiaoma666 / X-Mesh
View on GitHub
A pytorch implementation of “ X-Mesh: Towards Fast and Accurate Text-driven 3D Stylization via Dynamic Textual Guidance”
☆29Jan 12, 2024Updated 2 years ago
Wang-pengfei / GGSD
View on GitHub
Official PyTorch codes for "Open Vocabulary 3D Scene Understanding via Geometry Guided Self-Distillation", ECCV2024
☆30Jul 19, 2024Updated last year
jishengpeng / TextrolSpeech
View on GitHub
[ICASSP 2024] TextrolSpeech: A Text Style Control Speech Corpus With Codec Language Text-to-Speech Models
☆183Nov 22, 2024Updated last year
fishaudio / audio-preprocess
View on GitHub
Preprocess Audio for training
☆374Feb 2, 2026Updated 2 weeks ago

blairstar / The_Art_of_DPMView external linksLinks

Alternatives and similar repositories for The_Art_of_DPM

blairstar / The_Art_of_DPM
View external linksLinks