LukeSutor / programmatic-pitchLinks
High fidelity music synthesis using diffusion and UnivNet.
☆9Updated last year
Alternatives and similar repositories for programmatic-pitch
Users that are interested in programmatic-pitch are comparing it to the libraries listed below
Sorting:
- Enhanced Reverberation As Supervision (ERAS) for unsupervised reverberant speech separation☆12Updated 10 months ago
- Conformer block with Rotary Position Embedding, modified from lucidrains' implement☆14Updated 9 months ago
- iSeparate library for the SDX2023 challenge☆13Updated last year
- An High-resolution implementation of HiFi-GAN Vocoder for Voice Conversion.☆31Updated 2 years ago
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…☆28Updated 2 years ago
- ☆10Updated 7 months ago
- Simple PyTorch Denoisers for Waveform Audio☆35Updated 2 months ago
- NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates [WIP]☆24Updated 2 years ago
- ☆10Updated last year
- SDX23 startkit for the Demucs baselines.☆28Updated 2 years ago
- Easily turn large sets of audio urls to an audio dataset.☆21Updated 2 years ago
- Official repository for NAST: Noise Aware Speech Tokenization for Speech Language Models (Interspeech 2024) https://arxiv.org/abs/2406.11…☆46Updated 11 months ago
- This is not remotely close to a finished product, and does not intend to nor does this claim to be working fine-tuning code for MaskGCT. …☆12Updated 6 months ago
- ☆41Updated 2 years ago
- Prosodic Speech Segmentation with Transformers☆25Updated last year
- Fast and differentiable hidden Markov model in C++☆17Updated 2 years ago
- AudioSR-Upsampling (any -> 48kHz)☆41Updated last year
- ☆13Updated last year
- GPT for FACodec☆13Updated last year
- Implementation of "Audio xLSTMs: Learning Self-supervised audio representations with xLSTMs" in PyTorch☆18Updated last week
- UTAUTAI(Unrestricted Tune Automated Technology Artificial Interigence)☆12Updated last year
- Scripts for recreating the Replication Dataset for Fundamental Frequency Estimation. Part of the dissertation "Pitch of Voiced Speech in …☆10Updated 4 years ago
- ☆67Updated last year
- ☆11Updated 11 months ago
- [DEPRECIATED] Very fast, large music transformer with 8k sequence length, efficient heptabit MIDI notes encoding, true full MIDI instrume…☆15Updated last year
- ☆19Updated last year
- ☆13Updated 8 months ago
- Code associated with the paper: CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition.☆15Updated last month
- ☆25Updated 10 months ago
- StyleTTS 2 Optimized Training Fork☆31Updated 4 months ago