gaganbahga / time_stretchView external linksLinks
Time stretching audio without changing pitch
☆42Dec 13, 2020Updated 5 years ago
Alternatives and similar repositories for time_stretch
Users that are interested in time_stretch are comparing it to the libraries listed below
Sorting:
- Modify the speed and pitch of a given audio file.☆21Apr 5, 2017Updated 8 years ago
- My implementation of Epoch-Synchronous Overlap-Add method for time stretching and pitch shifting.☆10Jan 25, 2020Updated 6 years ago
- LTFT-Phase-Vocoder is an audio effect that slows down an audio signal without dilating its frequency content or pitch.☆16Dec 19, 2020Updated 5 years ago
- Clean and modernized implementation of FastSpeech2/LightSpeech using IPA☆18Aug 16, 2024Updated last year
- SpeechPlus: Small LLM-Based Text-to-Speech Library 🚀☆20May 20, 2025Updated 8 months ago
- This is an implementation of "Generative adversarial network-based postfilter for statistical parametric speech synthesis"☆16Jun 27, 2018Updated 7 years ago
- Rich Prosody Diversity Modelling with Phone-level Mixture Density Network☆45Dec 1, 2021Updated 4 years ago
- Code for the paper "MULTI-BAND MASKING FOR WAVEFORM-BASED SINGING VOICE SEPARATION" that was accepted on EUSIPCO2022☆15Jun 18, 2022Updated 3 years ago
- Transcripts and segmentation for the Blizzard 2013 audiobooks also known as the Lessac or Blizzard 2013 dataset.☆45Nov 13, 2019Updated 6 years ago
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…☆30May 27, 2023Updated 2 years ago
- NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates [WIP]☆25Jul 5, 2022Updated 3 years ago
- ☆18Feb 9, 2020Updated 6 years ago
- tesseractXplore a tesseract ease of use gui with full control☆27Nov 10, 2021Updated 4 years ago
- pytorch implementation for MultiSpeech: Multi-Speaker Text to Speech with Transformer paper☆21Jun 23, 2022Updated 3 years ago
- 44100Hz日本語音源に対応させた unofficial vits2-TTS implementation in pytorchです。☆24Sep 1, 2023Updated 2 years ago
- WaveRNN-based waveform generator & demo of TensorFlow CuDNN-GRU usage.☆24Aug 19, 2018Updated 7 years ago
- ☆21Jan 12, 2021Updated 5 years ago
- Vector-Quantized Contrastive Predictive Coding for Acoustic Unit Discovery and Voice Conversion☆143Sep 1, 2020Updated 5 years ago
- Export an ONNX graph that performs ISTFT. Designed for TTS models.☆27Apr 23, 2024Updated last year
- TTS Text Analyzer☆32Jul 20, 2023Updated 2 years ago
- Official implementation of "Unsupervised Pre-training for Data-Efficient Text-to-Speech on Low Resource Languages", ICASSP 2023☆27Apr 27, 2023Updated 2 years ago
- Yin pitch estimator in PyTorch☆117Nov 7, 2022Updated 3 years ago
- Korean Singing Voice Synthesis based on Auto-regressive Boundary Equilibrium GAN☆67Apr 26, 2021Updated 4 years ago
- Use FastSpeech2 and HiFi-GAN to easily perform end-to-end Korean speech synthesis.☆29Jul 30, 2023Updated 2 years ago
- ☆26Sep 22, 2022Updated 3 years ago
- ☆11Aug 11, 2021Updated 4 years ago
- BioAmp is an opensource project of a multichannel biopotential adquisition system for EEG, EMG, EOG and EOG signals.☆15Apr 11, 2022Updated 3 years ago
- The official implementation of the Interspeech 2021 paper WSRGlow: A Glow-based Waveform Generative Model for Audio Super-Resolution.☆127Sep 7, 2021Updated 4 years ago
- audio processing module for pytorch:stft, istft☆36Aug 15, 2019Updated 6 years ago
- PyTorch Implementation of NCSOFT's FastPitchFormant: Source-filter based Decomposed Modeling for Speech Synthesis☆73Aug 3, 2021Updated 4 years ago
- Pitch-shift audio clips quickly with PyTorch (CUDA supported)! Additional utilities for searching efficient transformations are included.☆140Sep 25, 2024Updated last year
- Code for reproducing the paper "Neural Networks Fail to Learn Periodic Functions and How to Fix It" as part of the ML Reproducibility Cha…☆11Apr 16, 2021Updated 4 years ago
- Synthesizer Self-Attention is a very recent alternative to causal self-attention that has potential benefits by removing this dot product…☆14Dec 29, 2024Updated last year
- P-sort is an open-source, Python-based, cross-platform software with an intuitive GUI. It has been designed to address the challenges of …☆11Jan 23, 2024Updated 2 years ago
- Pitch-shifting and time-stretching with TD-PSOLA☆88Aug 16, 2023Updated 2 years ago
- One-shot TTS with Improved Unseen Speaker and Style Transfer☆37Mar 2, 2022Updated 3 years ago
- Network specification and demo☆35Jun 5, 2017Updated 8 years ago
- Pitch shifter using WSOLA and resampling implemented by Python3☆39Jul 19, 2017Updated 8 years ago
- Mapping features using Deep Neural Networks (DNNs) with application to Voice Conversion (VC). The implementations are on top of Theano Py…☆32May 30, 2018Updated 7 years ago