LLaSE-G1: Incentivizing Generalization Capability for LLaMA-based Speech Enhancement
☆46Mar 10, 2025Updated 11 months ago
Alternatives and similar repositories for LLaSE-G1
Users that are interested in LLaSE-G1 are comparing it to the libraries listed below
Sorting:
- We propose C2SER, a novel audio-language model designed to enhance the stability and accuracy of speech emotion recognition through conte…☆17Mar 3, 2025Updated 11 months ago
- LLaSE-G1: Incentivizing Generalization Capability for LLaMA-based Speech Enhancement☆98Apr 1, 2025Updated 11 months ago
- Llasa Speed Up☆60Jan 18, 2026Updated last month
- LLaSE: Maximizing Acoustic Preservation for LLaMA based Speech Enhancement☆16Jul 11, 2025Updated 7 months ago
- Official Implementation of TSELM: Target speaker extraction using discrete tokens and language models☆56Apr 14, 2025Updated 10 months ago
- The official implementation of the DIFFA series for dLLM-based large audio language model☆59Feb 2, 2026Updated last month
- Model configurations for scaling SE models in the paper "Beyond Performance Plateaus: A Comprehensive Study on Scalability in Speech Enha…☆38Aug 7, 2024Updated last year
- OSUM & OSUM-EChat, open speech understanding model and empathetic spoken chatbot based on it, open-sourced by ASLP@NPU.☆482Nov 23, 2025Updated 3 months ago
- ☆15Apr 2, 2025Updated 11 months ago
- ☆14Jul 11, 2022Updated 3 years ago
- A lightweight audio codec based on a single quantizer☆32Sep 4, 2025Updated 5 months ago
- The source code for the paper CrossSinger (asru2023)☆18Oct 12, 2023Updated 2 years ago
- Official Implementation for the paper: A Variational Framework for Improving Naturalness in Generative Spoken Language Models☆22Jun 18, 2025Updated 8 months ago
- ☆39Sep 25, 2025Updated 5 months ago
- The official source code of UniAudio☆95Mar 29, 2024Updated last year
- [CVPR 2025] Official implementation of paper "Prosody-Enhanced Acoustic Pre-training and Acoustic-Disentangled Prosody Adapting for Movie…☆23Jun 6, 2025Updated 8 months ago
- An AR+AR TTS attempt.☆18Jan 13, 2025Updated last year
- ☆21Dec 19, 2023Updated 2 years ago
- Official Repository of Paper: "Towards High-Quality Zero-Shot Singing Voice Conversion in Low-Resource Scenarios"(AAAI 2026)☆89Jan 31, 2026Updated last month
- [ICASSP 2026] Task Vector in TTS: Toward Emotionally Expressive Dialectal Speech Synthesis☆36Dec 24, 2025Updated 2 months ago
- ☆19Mar 22, 2024Updated last year
- A Massive Contextual Speech Recognition Benchmark.☆99Aug 6, 2025Updated 6 months ago
- Implementation of SpatialCodec.☆69Sep 23, 2023Updated 2 years ago
- This is the official implementation of the LiSenNet☆149Nov 15, 2024Updated last year
- ☆25May 14, 2020Updated 5 years ago
- Official code of SenSE.☆74Oct 30, 2025Updated 4 months ago
- Please visit: https://thuhcsi.github.io/icassp2021-emotion-tts/☆34Mar 17, 2023Updated 2 years ago
- [InterSpeech 24] FreeV: Free Lunch For Vocoders Through Pseudo Inversed Mel Filter☆93Jul 4, 2024Updated last year
- We propose C2SER, a novel audio-language model designed to enhance the stability and accuracy of speech emotion recognition through conte…☆43Mar 3, 2025Updated 11 months ago
- FCTalker: Fine and Coarse Grained Context Modeling for Expressive Conversational Speech Synthesis (Accepted by ISCSLP'2024)☆26Feb 22, 2024Updated 2 years ago
- GenSE: Generative Speech Enhancement via Language Models using Hierarchical Modeling☆165Feb 28, 2025Updated last year
- (R&D) Text to speech using phonemes as inputs and audio codec codes as outputs. Loosely based on MegaByte, VALL-E and Encodec.☆48Sep 4, 2023Updated 2 years ago
- The official repo of UL-UNAS, an ultra-lightweight SE model.☆123Feb 13, 2026Updated 2 weeks ago
- Generative Expressive Conversational Speech Synthesis (Accepted by MM'2024)☆78Nov 1, 2024Updated last year
- Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"☆11Apr 10, 2025Updated 10 months ago
- Repository for reproducing result in journal "Self-supervised learning for Speech Emotion Recognition"☆10Mar 15, 2023Updated 2 years ago
- ☆11Nov 7, 2024Updated last year
- ☆10Apr 17, 2024Updated last year
- offical code for Dense-TSNet☆12Sep 17, 2024Updated last year