LukeSutor / programmatic-pitchLinks

High fidelity music synthesis using diffusion and UnivNet.

☆9

Alternatives and similar repositories for programmatic-pitch

Users that are interested in programmatic-pitch are comparing it to the libraries listed below

Sorting:

merlresearch / reverberation-as-supervision
Enhanced Reverberation As Supervision (ERAS) for unsupervised reverberant speech separation
☆12Updated 10 months ago
KdaiP / conformer-RoPE
Conformer block with Rotary Position Embedding, modified from lucidrains' implement
☆14Updated 9 months ago
naba89 / iSeparate-SDX
iSeparate library for the SDX2023 challenge
☆13Updated last year
vtuber-plan / hifi-gan
An High-resolution implementation of HiFi-GAN Vocoder for Voice Conversion.
☆31Updated 2 years ago
sushant-t / tts-trainer
Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…
☆28Updated 2 years ago
uthree / ddsp-vocoder
☆10Updated 7 months ago
will-rice / denoisers
Simple PyTorch Denoisers for Waveform Audio
☆35Updated 2 months ago
rishikksh20 / NU-Wave2-pytorch
NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates [WIP]
☆24Updated 2 years ago
diggerdu / AudioMamba
☆10Updated last year
adefossez / sdx23
SDX23 startkit for the Demucs baselines.
☆28Updated 2 years ago
rom1504 / audio2dataset
Easily turn large sets of audio urls to an audio dataset.
☆21Updated 2 years ago
ShovalMessica / NAST
Official repository for NAST: Noise Aware Speech Tokenization for Speech Language Models (Interspeech 2024) https://arxiv.org/abs/2406.11…
☆46Updated 11 months ago
Many0therFunctions / MaskGCT-Text-To-Semantic-Finetune
This is not remotely close to a finished product, and does not intend to nor does this claim to be working fine-tuning code for MaskGCT. …
☆12Updated 6 months ago
voidful / vall-e-encodec
☆41Updated 2 years ago
Nathan-Roll1 / PSST
Prosodic Speech Segmentation with Transformers
☆25Updated last year
30stomercury / hmm-backprop
Fast and differentiable hidden Markov model in C++
☆17Updated 2 years ago
ORI-Muchim / AudioSR-Upsampling
AudioSR-Upsampling (any -> 48kHz)
☆41Updated last year
vtuber-plan / FlowVAE
☆13Updated last year
ex3ndr / supervoice-gpt-facodec
GPT for FACodec
☆13Updated last year
kyegomez / Audio-xLSTMs
Implementation of "Audio xLSTMs: Learning Self-supervised audio representations with xLSTMs" in PyTorch
☆18Updated last week
0417keito / UTAUTAI
UTAUTAI(Unrestricted Tune Automated Technology Artificial Interigence)
☆12Updated last year
bastibe / Replication-Dataset-Scripts
Scripts for recreating the Replication Dataset for Fundamental Frequency Estimation. Part of the dissertation "Pitch of Voiced Speech in …
☆10Updated 4 years ago
eloimoliner / audio-inpainting-diffusion
☆67Updated last year
Sosdatasets / SoS_Dataset
☆11Updated 11 months ago
asigalov61 / Heptabit-Music-Transformer
[DEPRECIATED] Very fast, large music transformer with 8k sequence length, efficient heptabit MIDI notes encoding, true full MIDI instrume…
☆15Updated last year
Ereboas / TacoLM
☆19Updated last year
reppy4620 / x-vits
☆13Updated 8 months ago
Bartelds / ctc-dro
Code associated with the paper: CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition.
☆15Updated last month
zjlww / ardit-web
☆25Updated 10 months ago
duerig / StyleTTS2
StyleTTS 2 Optimized Training Fork
☆31Updated 4 months ago