AudioCodec-Hub is a Python library for encoding and decoding audio data, supporting various neural audio codec models
☆25Sep 26, 2023Updated 2 years ago
Alternatives and similar repositories for AudioCodec-Hub
Users that are interested in AudioCodec-Hub are comparing it to the libraries listed below
Sorting:
- Super Flappy Bird in p5.js☆10Mar 8, 2021Updated 5 years ago
- Taiwanese Translation with BERT based model and RNN. Collection of Taiwanese text corpus☆13Oct 15, 2022Updated 3 years ago
- Code for the Findings of NAACL 2022(Long Paper): AdapterBias: Parameter-efficient Token-dependent Representation Shift for Adapters in NL…☆18May 4, 2022Updated 3 years ago
- Taiwanese Speech Synthesis with Tacotron2☆25Oct 2, 2022Updated 3 years ago
- **Interspeech 2022** 《SpeechPrompt: An Exploration of Prompt Tuning on Generative Spoken Language Model for Speech Processing Tasks》Speec…☆101Apr 10, 2025Updated 11 months ago
- A publishing website of a table collecting meta-learning-related papers in the area of human language processing.☆17Aug 2, 2021Updated 4 years ago
- FREECODEC: A DISENTANGLED NEURAL SPEECH CODEC WITH FEWER TOKENS☆24Sep 9, 2024Updated last year
- 《SpeechPrompt v2: Prompt Tuning for Speech Classification Tasks》Speech processing with prompting paradigm☆81Oct 19, 2023Updated 2 years ago
- ☆31Jul 13, 2023Updated 2 years ago
- ☆12Feb 3, 2026Updated last month
- ☆15Sep 9, 2021Updated 4 years ago
- Source code for the EMNLP 2025 paper “DM-Codec: Distilling Multimodal Representations for Speech Tokenization”☆56Jun 1, 2025Updated 9 months ago
- Multi-Speaker Pytorch FastSpeech2: Fast and High-Quality End-to-End Text to Speech☆99Oct 14, 2022Updated 3 years ago
- ASR text preprocessing utility☆21Aug 5, 2024Updated last year
- ☆51Mar 5, 2026Updated 2 weeks ago
- Code for T5lephone: Bridging Speech and Text Self-supervised Models for Spoken Language Understanding via Phoneme level T5☆19Nov 29, 2022Updated 3 years ago
- DinoSR: Self-Distillation and Online Clustering for Self-supervised Speech Representation Learning☆54Jan 18, 2024Updated 2 years ago
- 《SpeechGen: Unlocking the Generative Power of Speech Language Models with Prompts》☆77Jun 9, 2023Updated 2 years ago
- This Repository surveys the paper focusing on Prompting and Adapters for Speech Processing.☆111Aug 4, 2023Updated 2 years ago
- [EMNLP 2024] ESC: Efficient Speech Coding with Cross-Scale Residual Vector Quantized Transformers☆125Mar 20, 2025Updated last year
- A Neural Audio Codec (NAC) for Universal Audio☆44May 30, 2025Updated 9 months ago
- ☆10Sep 19, 2022Updated 3 years ago
- A low-bitrate single-codebook 16 / 24 kHz speech codec based on focal modulation☆156Nov 30, 2025Updated 3 months ago
- Audio Codec Speech processing Universal PERformance Benchmark☆299Jan 8, 2026Updated 2 months ago
- ICASSP 2024 - Generative De-Quantization for Neural Speech Codec via Latent Diffusion.☆56Nov 16, 2025Updated 4 months ago
- Unsupervised spoken sentence embeddings☆14Dec 14, 2022Updated 3 years ago
- This repo contains the official PyTorch implementation of "Analyzing Discrete Self Supervised Speech Representation For Spoken Language M…☆20Jan 3, 2023Updated 3 years ago
- The demo page for ALMTokenizer☆59Apr 14, 2025Updated 11 months ago
- ☆21Jun 1, 2021Updated 4 years ago
- ☆46Jul 7, 2025Updated 8 months ago
- 🦅🔗 Building FlyteGPT on Flyte with LangChain☆30Jan 23, 2024Updated 2 years ago
- Ultra-low-bitrate Speech Codec for Speech Language Modeling Applications☆89Dec 20, 2024Updated last year
- ☆13Sep 25, 2024Updated last year
- A toolkit dedicate for speech evaluation.☆23Sep 26, 2024Updated last year
- ☆12Mar 11, 2025Updated last year
- A Multi-Task Evaluation Benchmark for Audio-Visual Representation Models (ICASSP 2024)☆58Apr 17, 2024Updated last year
- Layer-wise analysis of self-supervised pre-trained speech representations☆127Oct 18, 2024Updated last year
- [ACL 2025 Main] UniCodec: a unified audio codec with a single codebook to support multi-domain audio data, including speech, music, and s…☆153May 30, 2025Updated 9 months ago
- Official implementation of the paper "BigCodec: Pushing the Limits of Low-Bitrate Neural Speech Codec"☆213Sep 19, 2024Updated last year