arda-num / SFSRNetLinks
Reproduction of the paper SFSRNet: Super-resolution for single-channel Audio Source Separation by me (@arda-num) and @dritx16. Navigate Project_Ates_Numanoglu folder to see.
☆11Updated 3 years ago
Alternatives and similar repositories for SFSRNet
Users that are interested in SFSRNet are comparing it to the libraries listed below
Sorting:
- An implementation for Frame-level Speech Signal-to-Noise Ratio Estimation using deep learning☆42Updated 3 years ago
- Deep Speech Distances PyTorch☆29Updated 3 years ago
- Code of the paper "Low-Latency Speech Separation Guided Diarization for Telephone Conversations"☆14Updated 2 years ago
- A PyTorch implementation: "LASAFT-Net-v2: Listen, Attend and Separate by Attentively aggregating Frequency Transformation"☆33Updated 3 years ago
- NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates [WIP]☆25Updated 3 years ago
- Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing☆70Updated 2 years ago
- Code for the paper "MULTI-BAND MASKING FOR WAVEFORM-BASED SINGING VOICE SEPARATION" that was accepted on EUSIPCO2022☆15Updated 3 years ago
- DSing ASR task: Resources and Baseline for an unaccompanied singing ASR.☆19Updated 3 years ago
- ☆33Updated 3 years ago
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment☆16Updated 3 years ago
- SpeechNAS-Better-Trade-off-between-Latency-and-Accuracy-for-Large-Scale-Speaker-Verification☆30Updated 2 years ago
- 60k hours of phoneme-aligned audio from audio books☆19Updated last year
- Incorporating AutoVocoder to MB-iSTFT-VITS☆48Updated 2 years ago
- Official implementation of A cappella: Audio-visual Singing VoiceSeparation, from BMVC21☆16Updated 3 years ago
- This repo contains the source code of the first deep learning-base singing voice beat tracking system. It leverages WavLM and DistilHuBER…☆33Updated 3 years ago
- ☆32Updated 3 years ago
- PyTorch implementation of Continuous Speech Separation☆13Updated 2 years ago
- ☆22Updated 4 years ago
- [ICLR 2022] "Audio Lottery: Speech Recognition Made Ultra-Lightweight, Noise-Robust, and Transferable", by Shaojin Ding, Tianlong Chen, Z…☆31Updated 3 years ago
- Official implementation of the paper "Laughter Synthesis using Pseudo Phonetic Tokens with a Large-scale In-the-wild Laughter Corpus" acc…☆76Updated 2 years ago
- UnivNet: A Neural Vocoder with Multi-Resolution Spectrogram Discriminators for High-Fidelity Waveform Generation☆75Updated 4 years ago
- ☆56Updated 2 years ago
- Ultrafast GAN based Vocoder for Text to Speech☆50Updated 3 years ago
- ☆64Updated 3 years ago
- PodcastMix A dataset for separating music and speech in podcasts.☆44Updated last year
- Streaming Audiotransformers for online Audio tagging☆47Updated last year
- Synthesized singing voice demos of WeSinger 2 paper.☆27Updated 2 years ago
- Pytorch implementation of LearnableUpsamplingLayer (NaturalSpeech, Tan et al., 2022)☆55Updated last year
- Open Source Speech/Text Data on AI☆18Updated 3 years ago
- Temporary anonymous version☆22Updated last year