LLaSE: Maximizing Acoustic Preservation for LLaMA based Speech Enhancement
☆16Jul 11, 2025Updated 8 months ago
Alternatives and similar repositories for LLaSE
Users that are interested in LLaSE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- LLaSE-G1: Incentivizing Generalization Capability for LLaMA-based Speech Enhancement☆102Apr 1, 2025Updated 11 months ago
- Reimplementation of Miipher☆29Aug 16, 2023Updated 2 years ago
- We propose C2SER, a novel audio-language model designed to enhance the stability and accuracy of speech emotion recognition through conte…☆44Mar 3, 2025Updated last year
- LLaSE-G1: Incentivizing Generalization Capability for LLaMA-based Speech Enhancement☆46Mar 10, 2025Updated last year
- A solution to denoising and separating for two-speaker-mixed noisy speech, using a BSRNN inspired network.☆15Aug 22, 2023Updated 2 years ago
- Python scripts to create noisy and reverberant 2-speaker mixture audio with Libri-Light and WHAM☆17Nov 7, 2024Updated last year
- Generation scripts for EARS-WHAM and EARS-Reverb☆43Jul 4, 2025Updated 8 months ago
- Dataset simulation for DPCCN.☆16Dec 25, 2022Updated 3 years ago
- Llasa Speed Up☆61Jan 18, 2026Updated 2 months ago
- The source code of Tim-TSENet☆15Apr 22, 2022Updated 3 years ago
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one☆26Aug 5, 2024Updated last year
- offical code for Dense-TSNet☆12Sep 17, 2024Updated last year
- Joint magnitude estimation and phase recovery using Cycle-in-Cycle GAN for non-parallel speech enhancement☆10Jan 24, 2022Updated 4 years ago
- Audio-FLAN☆160Sep 23, 2025Updated 6 months ago
- PyTorch implementation of TinyWASE described in our paper "Compressing Speaker Extraction Model with Ultra-low Precision Quantization and…☆11Jun 28, 2021Updated 4 years ago
- [Early Alpha] A unified framework for text-to-speech, voice conversion, automatic speech recognition, audio classification, voice activit…☆22Jan 10, 2025Updated last year
- Data simulation scripts for paper "Target Sound Extraction with Variable Cross-modality Clues"☆16May 19, 2023Updated 2 years ago
- ☆25Jan 24, 2023Updated 3 years ago
- ☆15Jun 15, 2022Updated 3 years ago
- Code for paper "Unifying Speech Enhancement and Separation with Gradient Modulation for End-to-End Noise-Robust Speech Separation"☆44Jul 10, 2024Updated last year
- ☆52Sep 10, 2024Updated last year
- TMT: Tri-Modal Translation between Speech, Image, and Text by Processing Different Modalities as Different Languages☆18May 23, 2024Updated last year
- StammerClipper:: :A deep learning approach for automatic stutter detection☆12Mar 27, 2022Updated 3 years ago
- Speech Resynthesis and Language Modeling☆27Jun 11, 2025Updated 9 months ago
- ☆15Nov 11, 2024Updated last year
- Official Implementation of TSELM: Target speaker extraction using discrete tokens and language models☆57Apr 14, 2025Updated 11 months ago
- This is the official repository of ``Scalable Neural Vocoder from Range-Null Space Decomposition'', which is submitted to TPAMI.☆47Oct 11, 2025Updated 5 months ago
- Production first, nn-based on-device signal processing toolkit.☆65May 30, 2023Updated 2 years ago
- ☆26Sep 22, 2022Updated 3 years ago
- ☆67Aug 16, 2023Updated 2 years ago
- Onset-and-Offset-Aware Sound Event Detection☆21Feb 10, 2025Updated last year
- Official implementation of Efficient Speech Separation Framework Based on Neural State-Space Models☆26Feb 25, 2026Updated 3 weeks ago
- ☆15Jul 11, 2022Updated 3 years ago
- Official Implementation of LauraTSE: Target Speaker Extraction using Auto-Regressive Decoder-Only Language Models.☆33Nov 9, 2025Updated 4 months ago
- ☆64Jun 28, 2023Updated 2 years ago
- ☆11Oct 14, 2023Updated 2 years ago
- Personalized AEC☆19Nov 3, 2022Updated 3 years ago
- Exploring Binary Classification Loss for Speaker Verification☆18Jul 18, 2023Updated 2 years ago
- Power-Guided Grouped SRU for Real-Time Causal Audio-Visual Speech Separation☆24Nov 4, 2025Updated 4 months ago