NKU-HLT / AudioEditorView external linksLinks
☆40Apr 2, 2025Updated 10 months ago
Alternatives and similar repositories for AudioEditor
Users that are interested in AudioEditor are comparing it to the libraries listed below
Sorting:
- Retrieval-Augmented MOS Prediction with Prior Knowledge Integration☆32Mar 23, 2025Updated 10 months ago
- ☆43Jan 13, 2025Updated last year
- [ICASSP 2024] KNN-CTC: Enhancing ASR via Retrieval of CTC Pseudo Labels☆42Mar 20, 2024Updated last year
- Speaker-aware CTC (SACTC) for multi-talker overlapped speech recognition.☆21May 26, 2025Updated 8 months ago
- LoRA-based phoneme/prosody control for LLM-based TTS with no G2P - Lightweight adapter for edit and control the target language's phoneme…☆23Aug 14, 2025Updated 5 months ago
- Official implementation for FlowSep☆69Jan 2, 2025Updated last year
- A neural speech codec based on discrete WavLM representations☆24Aug 28, 2024Updated last year
- a Neural Vocoder supporting Ring Attention, Conformer and NSF.☆24Aug 1, 2025Updated 6 months ago
- Official repository of the IEEE SLT 2024 paper "Self-Supervised Syllable Discovery Based on Speaker-Disentangled HuBERT"☆45Updated this week
- [Official Implementation] Acoustic Autoregressive Modeling 🔥☆74Aug 24, 2024Updated last year
- ☆14Feb 19, 2025Updated 11 months ago
- [Interspeech 2024] Enhancing Dysarthric Speech Recognition for Unseen Speakers via Prototype-Based Adaptation☆12Nov 28, 2024Updated last year
- speaker-disentangled speech linguistic content quantizer☆24Mar 19, 2025Updated 10 months ago
- FREECODEC: A DISENTANGLED NEURAL SPEECH CODEC WITH FEWER TOKENS☆24Sep 9, 2024Updated last year
- ☆21Jul 15, 2024Updated last year
- ☆49Apr 1, 2025Updated 10 months ago
- ☆11Nov 7, 2024Updated last year
- Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"☆11Apr 10, 2025Updated 10 months ago
- My vocoder experiments☆31Jul 26, 2025Updated 6 months ago
- An AR+AR TTS attempt.☆18Jan 13, 2025Updated last year
- Speech Resynthesis and Language Modeling☆27Jun 11, 2025Updated 8 months ago
- ☆44Sep 19, 2024Updated last year
- Ultra-low-bitrate Speech Codec for Speech Language Modeling Applications☆86Dec 20, 2024Updated last year
- Paper List☆18Jul 2, 2025Updated 7 months ago
- ☆19Mar 22, 2024Updated last year
- ☆20Sep 2, 2024Updated last year
- Inference code for Interspeech 2025 paper, "LSCodec: Low-Bitrate and Speaker-Decoupled Discrete Speech Codec"☆35Oct 23, 2025Updated 3 months ago
- Source code for the EMNLP 2025 paper “DM-Codec: Distilling Multimodal Representations for Speech Tokenization”☆56Jun 1, 2025Updated 8 months ago
- ☆13Oct 11, 2024Updated last year
- ☆15Nov 11, 2024Updated last year
- This is the code and dataset repo for Interspeech 2024 paper "Target conversation extraction: Source separation using turn-taking dynamic…☆55Aug 15, 2025Updated 5 months ago
- The open source code for LLM-Codec☆145Aug 18, 2024Updated last year
- ☆52Jul 16, 2025Updated 6 months ago
- Reimplementation of Miipher☆29Aug 16, 2023Updated 2 years ago
- Whisper Speech Quality Assessment (WhiSQA)☆16Oct 14, 2025Updated 3 months ago
- ☆155Nov 22, 2024Updated last year
- Please visit https://thuhcsi.github.io/SnakeGAN/☆37Apr 25, 2023Updated 2 years ago
- ☆37Jul 4, 2024Updated last year
- Query-conditioned target sound extraction model☆30Mar 25, 2025Updated 10 months ago