Kevin-naticl / LLaSELinks
LLaSE: Maximizing Acoustic Preservation for LLaMA based Speech Enhancement
☆16Updated 2 months ago
Alternatives and similar repositories for LLaSE
Users that are interested in LLaSE are comparing it to the libraries listed below
Sorting:
- Official code of SenSE.☆18Updated this week
- A neural speech codec based on discrete WavLM representations☆24Updated last year
- ☆18Updated last year
- ☆11Updated 2 years ago
- Once more Diarization: Improving meeting transcription systems through segment-level speaker reassignment☆12Updated 8 months ago
- Spherical residual vector quantization (SRVQ)☆30Updated last year
- Official Implementation of TSELM: Target speaker extraction using discrete tokens and language models☆48Updated 5 months ago
- ☆33Updated last month
- ☆12Updated last year
- ☆49Updated 6 months ago
- A toolkit dedicate for speech evaluation.☆23Updated last year
- Efficient Personalized Speech Enhancement through Self-Supervised Learning☆21Updated 2 years ago
- Whisper Speech Quality Assessment (WhiSQA)☆15Updated 10 months ago
- PyTorch Implementation of [WMCodec: End-to-End Neural Speech Codec with Deep Watermarking for Authenticity Verification](https://arxiv.or…☆16Updated 2 months ago
- Please visit https://thuhcsi.github.io/SnakeGAN/☆37Updated 2 years ago
- Streaming Vocos☆29Updated 3 months ago
- ☆13Updated 6 months ago
- This is the implementation of the manuscript "Learning General All-Neural Speech Enhancement based on Taylor's Approximation Theory", whi…☆14Updated 2 years ago
- Unofficial Pytorch Lightning Implementation of "Real-time Speech Frequency Bandwidth Extension"☆35Updated last year
- ☆22Updated 2 years ago
- This is the official train-dev-test release of the Interspeech2024 Discrete Speech Representation Challenge.☆32Updated last year
- Generation scripts for EARS-WHAM and EARS-Reverb☆37Updated 3 months ago
- Exploring Binary Classification Loss for Speaker Verification☆18Updated 2 years ago
- ☆13Updated last month
- ☆54Updated 2 years ago
- A Lightweight One-Shot Whisper to Normal Voice Conversion Model Using Distillation of Self-Supervised Features☆20Updated last month
- Ultra-low bitrate speech codec (0.27-1 kbps) with cross-modal alignment and real-time capabilities☆65Updated last month
- offical code for Dense-TSNet☆12Updated last year
- unofficial implementation of "CPTNN: CROSS-PARALLEL TRANSFORMER NEURAL NETWORK FOR TIME-DOMAIN SPEECH ENHANCEMENT"☆15Updated last year
- Lightweight Speech Representation Learning for One-Shot Voice Conversion☆24Updated 9 months ago