BrightGu / RLVC
☆14Updated last year
Alternatives and similar repositories for RLVC:
Users that are interested in RLVC are comparing it to the libraries listed below
- Supervoice Speaker Separation Network☆12Updated 7 months ago
- ☆23Updated last year
- StimulerVoiceX is a denoising and speech enhancement system. It uses deep learning techniques to remove noise from speech signals and imp…☆10Updated last year
- TriNet: stabilizing self-supervised learning from complete or slow collapse on ASR.☆26Updated last year
- SpeechNAS-Better-Trade-off-between-Latency-and-Accuracy-for-Large-Scale-Speaker-Verification☆30Updated last year
- Hifi-like Vocoder implemented in PyTorch☆13Updated 2 years ago
- ☆10Updated last year
- ☆10Updated 2 months ago
- Causal Speech Enhancement Based on a Two-Branch Nested U-Net Architecture Using Self-Supervised Speech Embeddings☆10Updated 2 weeks ago
- Enhanced Reverberation As Supervision (ERAS) for unsupervised reverberant speech separation☆11Updated 5 months ago
- audiomod is a project for audio modifications, including audio manipulators such as time-stretching, pitch-shifing, formant-changing, and…☆3Updated 5 months ago
- Phonemes and durations labeling based on whisper small☆11Updated 6 months ago
- This repository provides an implementation of the DPCCN model for single-channel speech separation. More details will be updated soon.☆11Updated 3 years ago
- GroupMap: beyond mean and variance matching for deep learning☆10Updated 2 years ago
- A speech signal processing library in Python with emphasis on deep learning.☆31Updated 2 years ago
- Implementation of "Audio xLSTMs: Learning Self-supervised audio representations with xLSTMs" in PyTorch☆16Updated last week
- Any-to-one voice conversion using the data augment strategy: pitch shifted and duration remained.☆33Updated 3 years ago
- Speech enhancement in noisy and reverberant environments using deep neural networks☆19Updated 3 months ago
- ☆22Updated 2 years ago
- ☆10Updated 9 months ago
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one☆27Updated 5 months ago
- ☆16Updated 2 years ago
- Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks☆17Updated last year
- A purely header only c version of hifi-gan☆9Updated 3 years ago
- Spectral Mapping of Singing Voices: U-Net-Assisted Vocal Segmentation☆12Updated last month
- Official PyTorch implementation of the paper "Robust Training for Speaker Verification against Noisy Labels" in INTERSPEECH 2023.☆10Updated last year
- ForceAlign is a Python library for forced alignment of English text to English audio. You can use ForceAlign to get word or phoneme level…☆10Updated last month
- This is not remotely close to a finished product, and does not intend to nor does this claim to be working fine-tuning code for MaskGCT. …☆10Updated last month
- Repository of published DNN speech separation recipes for a number of datasets☆10Updated 11 months ago