Kevin-naticl/LLaSE-G1

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Kevin-naticl/LLaSE-G1)

Kevin-naticl / LLaSE-G1

LLaSE-G1: Incentivizing Generalization Capability for LLaMA-based Speech Enhancement

☆105

Alternatives and similar repositories for LLaSE-G1

Users that are interested in LLaSE-G1 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Kevin-naticl / LLaSE
View on GitHub
LLaSE: Maximizing Acoustic Preservation for LLaMA based Speech Enhancement
☆16Jul 11, 2025Updated last year
ASLP-lab / LLaSE-G1
View on GitHub
LLaSE-G1: Incentivizing Generalization Capability for LLaMA-based Speech Enhancement
☆47Mar 10, 2025Updated last year
Beilong-Tang / TSELM
View on GitHub
Official Implementation of TSELM: Target speaker extraction using discrete tokens and language models
☆60Apr 14, 2025Updated last year
ssi-research / FQSE
View on GitHub
Fully Quantized Neural Networks For Speech Enhancement
☆65Feb 15, 2024Updated 2 years ago
ZehuaKcrissLi / GTR-Voice
View on GitHub
☆16Nov 11, 2024Updated last year
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
ASLP-lab / SenSE
View on GitHub
Official code of SenSE.
☆90Oct 30, 2025Updated 8 months ago
xiaomi-research / dasheng-denoiser
View on GitHub
Official PyTorch inference code for the Interspeech 2025 paper: Efficient Speech Enhancement via Embeddings from Pre-trained Generative A…
☆81Jun 16, 2025Updated last year
zexupan / avse_hybrid_loss
View on GitHub
☆16Jun 15, 2022Updated 4 years ago
kaistmm / AdaptVC
View on GitHub
☆17Jun 2, 2025Updated last year
Emrys365 / se-scaling
View on GitHub
Model configurations for scaling SE models in the paper "Beyond Performance Plateaus: A Comprehensive Study on Scalability in Speech Enha…
☆41Aug 7, 2024Updated last year
JusperLee / TFACM
View on GitHub
☆23Jul 16, 2025Updated last year
yaoxunji / gen-se
View on GitHub
GenSE: Generative Speech Enhancement via Language Models using Hierarchical Modeling
☆175Feb 28, 2025Updated last year
urgent-challenge / urgent2024_challenge
View on GitHub
Official data preparation scripts for the URGENT 2024 Challenge
☆90May 21, 2025Updated last year
Beilong-Tang / lauraTSE_code
View on GitHub
Official Implementation of LauraTSE: Target Speaker Extraction using Auto-Regressive Decoder-Only Language Models.
☆37Nov 9, 2025Updated 8 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
zhenye234 / X-Codec-2.0
View on GitHub
Codec for paper: LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis
☆360Jun 25, 2026Updated 3 weeks ago
nanless / universal-speech-enhancement
View on GitHub
Apply Score diffusion to improve speech signals recorded under various adverse conditions and distortions, including noise, reverberation…
☆82Jul 29, 2024Updated last year
seongq / flowmse
View on GitHub
(ICASSP 2025, official code)FlowSE: Flow Matching-based Speech Enhancement
☆106Jul 23, 2025Updated 11 months ago
juhayna-zh / BSRNN-speech-preprocess
View on GitHub
A solution to denoising and separating for two-speaker-mixed noisy speech, using a BSRNN inspired network.
☆15Aug 22, 2023Updated 2 years ago
ASLP-lab / OSUM
View on GitHub
OSUM & OSUM-EChat, open speech understanding model and empathetic spoken chatbot based on it, open-sourced by ASLP@NPU.
☆494Nov 23, 2025Updated 7 months ago
Ryuk17 / noise-xorcist
View on GitHub
Single Channel Speech Enhancement Methods and Toolbox
☆55Jun 28, 2026Updated 3 weeks ago
anton-kashkin / hifi_vc
View on GitHub
☆25Jan 24, 2023Updated 3 years ago
felixfuyihui / Optimize-FixBF-Weight
View on GitHub
☆17Jun 3, 2020Updated 6 years ago
ozspeech / OZSpeech
View on GitHub
[ACL 2025] OZSpeech: One-step Zero-shot Speech Synthesis with Learned-Prior-Conditioned Flow Matching
☆45Feb 9, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
lucadellalib / discrete-wavlm-codec
View on GitHub
A neural speech codec based on discrete WavLM representations
☆26Aug 28, 2024Updated last year
Tencent / StableToken
View on GitHub
[ICLR 2026] StableToken: A state-of-the-art noise-robust semantic speech tokenizer featuring Voting-LFQ for resilient SpeechLLMs.
☆33Feb 27, 2026Updated 4 months ago
lmxue / Audio-FLAN
View on GitHub
Audio-FLAN
☆161Sep 23, 2025Updated 9 months ago
walker-hyf / NCSSD
View on GitHub
Generative Expressive Conversational Speech Synthesis (Accepted by MM'2024)
☆61Nov 1, 2024Updated last year
wenet-e2e / wesep
View on GitHub
Target Speaker Extraction Toolkit
☆299Oct 4, 2025Updated 9 months ago
RoyChao19477 / SEMamba
View on GitHub
This is the official implementation of the SEMamba paper. (Accepted to IEEE SLT 2024)
☆272Dec 12, 2025Updated 7 months ago
urgent-challenge / urgent2025_challenge
View on GitHub
Official data preparation and metric evaluation scripts for the Interspeech 2025 URGENT challenge.
☆85May 21, 2025Updated last year
zelokuo / VPIDM
View on GitHub
This is official repository of new SOTA diffusion models based method for speech enhancement
☆43Jul 31, 2024Updated last year
xinshengwang / robpitch
View on GitHub
A pitch detection model trained to be robust against noise and reverberation environments.
☆27Jan 21, 2025Updated last year
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
cszheng-ioa / Sixty-years-of-frequency-domain-monaural-speech-enhancement
View on GitHub
☆161Jan 30, 2024Updated 2 years ago
cisco-open / pase
View on GitHub
PASE: Phonologically Anchored Speech Enhancer
☆84Updated this week
luotianze666 / WaveFM
View on GitHub
[NAACL 2025] WaveFM: A High-Fidelity and Efficient Vocoder Based on Flow Matching
☆133Apr 8, 2026Updated 3 months ago
xi-j / Mamba-TasNet
View on GitHub
☆116Oct 1, 2024Updated last year
AmphionTeam / Emilia-NV
View on GitHub
Official Repository of Paper: "Emilia-NV: A Non-Verbal Speech Dataset with Word-Level Annotation for Human-Like Speech Modeling"
☆91Sep 18, 2025Updated 10 months ago
jiaqili3 / DualCodec
View on GitHub
[Interspeech 2025] DualCodec: A Low-Frame-Rate, Semantically-Enhanced Neural Audio Codec
☆72Mar 11, 2026Updated 4 months ago
lucadellalib / audiocodecs
View on GitHub
A collections of audio codecs with a standardized API
☆43Apr 15, 2026Updated 3 months ago