satvik-venkatesh / you-only-hear-onceView external linksLinks
☆101Oct 13, 2022Updated 3 years ago
Alternatives and similar repositories for you-only-hear-once
Users that are interested in you-only-hear-once are comparing it to the libraries listed below
Sorting:
- Repository for the paper "Towards duration robust weakly supervised sound event detection"☆23Aug 3, 2023Updated 2 years ago
- Functions for creating speech features in MATLAB.☆13Jul 7, 2020Updated 5 years ago
- SC-CNN: Effective Speaker Conditioning Method for Zero-Shot Multi-Speaker Text-to-Speech Systems☆39Nov 1, 2023Updated 2 years ago
- Sing any popular song with your voice☆11Jul 10, 2022Updated 3 years ago
- Please visit https://thuhcsi.github.io/SnakeGAN/☆37Apr 25, 2023Updated 2 years ago
- ☆67Sep 13, 2024Updated last year
- ☆54Jun 3, 2020Updated 5 years ago
- Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks☆17Aug 18, 2023Updated 2 years ago
- A solution to denoising and separating for two-speaker-mixed noisy speech, using a BSRNN inspired network.☆14Aug 22, 2023Updated 2 years ago
- ☆95Jun 22, 2023Updated 2 years ago
- Code for the paper "MULTI-BAND MASKING FOR WAVEFORM-BASED SINGING VOICE SEPARATION" that was accepted on EUSIPCO2022☆15Jun 18, 2022Updated 3 years ago
- Python implementation of a few speech intelligibility prediction algorithms☆15May 29, 2024Updated last year
- Efficient Personalized Speech Enhancement through Self-Supervised Learning☆23Mar 12, 2023Updated 2 years ago
- The code for DCASE2021 task5 submission.☆20Feb 21, 2022Updated 3 years ago
- NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates [WIP]☆25Jul 5, 2022Updated 3 years ago
- Paderborn Sound Event Detection☆78Jul 18, 2023Updated 2 years ago
- Implementation for paper: Multi-Metric Optimization using Generative Adversarial Networks for Near-End Speech Intelligibility Enhancement☆22Sep 21, 2021Updated 4 years ago
- Code for the AAAI 2022 paper "SSAST: Self-Supervised Audio Spectrogram Transformer".☆414Aug 14, 2022Updated 3 years ago
- This repository contains the code related to the paper 'DENet: a deep architecture for audio surveillance applications'.☆42Jul 23, 2023Updated 2 years ago
- **ICASSP 2022** 《Toward Degradation-Robust Voice Conversion》Using speech enhancement and end-to-end denoising training to improve degrada…☆24Sep 27, 2022Updated 3 years ago
- Easy to use Audio Tagging in PyTorch☆23Aug 22, 2021Updated 4 years ago
- Learning differentiable temporal resolution on time-series data.☆36Nov 12, 2022Updated 3 years ago
- Apply Score diffusion to improve speech signals recorded under various adverse conditions and distortions, including noise, reverberation…☆76Jul 29, 2024Updated last year
- A diffusion-based cross-lingual voice conversion model, as my bachelor's thesis☆44Jul 24, 2023Updated 2 years ago
- Reimplementation of Miipher☆29Aug 16, 2023Updated 2 years ago
- Finally, some decent sample sentences☆23Dec 3, 2023Updated 2 years ago
- PyTorch implementation of WASE described in our ICASSP 2021: "Wase: Learning When to Attend for Speaker Extraction in Cocktail Party Envi…☆26Jan 11, 2022Updated 4 years ago
- 44100Hz日本語音源に対応させた unofficial vits2-TTS implementation in pytorchです。☆24Sep 1, 2023Updated 2 years ago
- Evaluation toolbox for Sound Event Detection☆157Jun 12, 2024Updated last year
- Speech Parameter Estimation Using Differentiable Speech Synthesizer☆44May 9, 2023Updated 2 years ago
- demo page https://MingjieChen.github.io/dygan-vc☆67Apr 13, 2022Updated 3 years ago
- multi-scale time domain speaker extraction☆71Jun 7, 2021Updated 4 years ago
- Code for the method proposed in the paper:- ccc-wav2vec 2.0: Clustering aided Cross-Contrastive learning of Self-Supervised speech repres…☆23Mar 18, 2024Updated last year
- NEAL (Nature+Energy Audio Labeller) is an open-source interactive audio data annotation tool.☆16Apr 7, 2025Updated 10 months ago
- ☆13Nov 22, 2022Updated 3 years ago
- An unofficial implementation of Lite-RTSE, a cost-effective lite model for real-time speech enhancement☆14Nov 19, 2023Updated 2 years ago
- KABooks is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. Using a…☆12Mar 24, 2023Updated 2 years ago
- 🎯 Speech Recognition Challenge by Speech Lab - IIT Madras☆11Nov 5, 2020Updated 5 years ago
- Acoustic models for: A Comparison of Discrete and Soft Speech Units for Improved Voice Conversion☆104Jul 12, 2023Updated 2 years ago