Emrys365 / torch_stft
PyTorch-based implementations of short-time Fourier transform
☆15Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for torch_stft
- The demo for "Discretization and Re-synthesis: an alternative method to solve the Cocktail Party Problem".☆12Updated 3 years ago
- Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks☆17Updated last year
- A STFT/iSTFT written up in PyTorch using 1D Convolutions☆25Updated 4 months ago
- Spherical residual vector quantization (SRVQ)☆26Updated 3 months ago
- Awesome Neural Codec Models, Text-to-Speech Synthesizers & Speech Language Models☆15Updated this week
- An evaluation set for large-scale trained TTS models (Coming in Sep 2024)☆12Updated 2 months ago
- A toolkit for researchers in the multimodal sound separation.☆16Updated last year
- real-time speech enhance☆12Updated 10 months ago
- Enhanced Reverberation As Supervision (ERAS) for unsupervised reverberant speech separation☆11Updated 3 months ago
- Official Pytorch implementation of PULSE: Positive–Unlabelled Learning for audio Signal Enhancement (Best Paper Award at ICASSP 2023)☆39Updated last year
- video cut powered by AI☆25Updated 2 years ago
- Streaming Audiotransformers for online Audio tagging☆41Updated 5 months ago
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one☆27Updated 3 months ago
- SRTNet☆24Updated last year
- A purely header only c version of hifi-gan☆8Updated 3 years ago
- A neural speech codec based on discrete WavLM representations☆21Updated 2 months ago
- System that ranks 2nd in DCASE 2022 Challenge Task 5: Few-shot Bioacoustic Event Detection☆27Updated 2 years ago
- Baseline for DCASE 2024 Task 9: "Language-Queried Audio Source Separation"☆22Updated 7 months ago
- A small tool to calculate the distribution of audio durations in a directory☆14Updated last year
- Code for INTERSPEECH 2023 paper "mdctGAN: Taming transformer-based GAN for speech super-resolution with Modified DCT spectra"☆59Updated last year
- Towards High-Quality and Efficient Speech Bandwidth Extension with Parallel Amplitude and Phase Prediction☆11Updated 4 months ago
- Efficient Personalized Speech Enhancement through Self-Supervised Learning☆21Updated last year
- ☆13Updated 4 months ago
- Aligner for text-to-speech☆15Updated 4 months ago
- his code is a pytorch version for CycleFlow model in "CycleFlow: Purify Information Factors by Cycle Loss"☆14Updated 2 years ago
- with alignment learning and continuous wavelet transform☆19Updated 2 years ago
- Official source code of the INTERSPEECH 2023 paper: "Audio-Visual Speech Separation in Noisy Environments with a Lightweight Iterative Mo…☆19Updated last year
- Official implementation of Self-Remixing☆11Updated 9 months ago
- ☆20Updated last month