[ICLR 2025] SOTA discrete acoustic codec models with 40/75 tokens per second for audio language modeling
☆1,280Mar 2, 2025Updated last year
Alternatives and similar repositories for WavTokenizer
Users that are interested in WavTokenizer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ACL 2025 Oral] Language-Codec: Reducing the Gaps Between Discrete Codec Representation and Speech Language Models☆210Jun 25, 2025Updated 9 months ago
- [ACL 2025 Main] ControlSpeech: Towards Simultaneous Zero-shot Speaker Cloning and Zero-shot Language Style Control With Decoupled Codec☆275Nov 22, 2024Updated last year
- AcademiCodec: An Open Source Audio Codec Model for Academic Research☆671Dec 27, 2023Updated 2 years ago
- This is the code for the SpeechTokenizer presented in the SpeechTokenizer: Unified Speech Tokenizer for Speech Language Models. Samples a…☆650Jun 9, 2024Updated last year
- [ICASSP 2024] TextrolSpeech: A Text Style Control Speech Corpus With Codec Language Text-to-Speech Models☆183Nov 22, 2024Updated last year
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- An Workspace for HMI tools☆164Jul 11, 2024Updated last year
- kight is a static analysis tool for c/c++ programs.☆214Dec 27, 2024Updated last year
- Official implementation of the paper "BigCodec: Pushing the Limits of Low-Bitrate Neural Speech Codec"☆213Sep 19, 2024Updated last year
- Evaluation of Text-to-Video Generation Models: A Dynamics Perspective[NeurIPS 2024].☆274Dec 3, 2024Updated last year
- Official Implementation of AttentionShift: Iteratively Estimated Part-based Attention Map for Pointly Supervised Instance Segmentation☆155Oct 18, 2024Updated last year
- PyTorch Implementation of StyleSinger(AAAI 2024): Style Transfer for Out-of-Domain Singing Voice Synthesis☆418Aug 15, 2025Updated 7 months ago
- ☆247Nov 24, 2024Updated last year
- Advanced Unsupervised Image Enhancement with GAN☆247Nov 11, 2024Updated last year
- ☆176Feb 21, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- PyTorch Implementation of TCSinger(EMNLP 2024): Zero-Shot Singing Voice Synthesis with Style Transfer and Multi-Level Style Control☆375Oct 7, 2025Updated 5 months ago
- Large-Scale Selfie Video Dataset (L-SVD): A Benchmark for Emotion Recognition☆306Aug 18, 2024Updated last year
- Codec for paper: LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis☆349Jul 21, 2025Updated 8 months ago
- State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.☆1,745Jan 26, 2026Updated 2 months ago
- Welcome to the 'Open-Alteryx-Macro' project. This project is aimed at providing an open-source solution for managing and updating Alteryx…☆156May 25, 2024Updated last year
- Official repo for WavCraft, an AI agent for audio creation and editing☆526Feb 15, 2025Updated last year
- It is an Android-based application that enables managing hotspot properties through a web interface, providing mobile routing functionali…☆155Dec 19, 2024Updated last year
- 莫甘娜问卷表单编辑器,低代码快速搭建表单,AI表单生成,表单数据搜集统计☆147Aug 9, 2024Updated last year
- A Survey of Spoken Dialogue Models (60 pages)☆316Nov 28, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Deep Reinforcement Learning Algorithms for solving Atari 2600 Games☆143Mar 23, 2023Updated 3 years ago
- ☆142May 8, 2024Updated last year
- AAAI 2025: Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model☆295Oct 12, 2025Updated 5 months ago
- ☆121Sep 30, 2024Updated last year
- A code repository designed to show the best GitHub has to offer.☆165Jun 30, 2024Updated last year
- ☆297Sep 14, 2025Updated 6 months ago
- YiTu is an easy-to-use runtime to fully exploit the hybrid parallelism of different hardwares (e.g., GPU) to efficiently support the exec…☆254Jan 7, 2026Updated 2 months ago
- ☆252Feb 11, 2025Updated last year
- Simple yet powerful Twitter data retrieval SDK with multi-language support.No Limits, No Auth Required☆183Jan 6, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Visualization, simulation, manipulation of Intrinsically disorder proteins with Gibbs sampling☆288Oct 24, 2024Updated last year
- User Identity Scaffolding for Multiple OIDC Authentications for User☆95Jun 14, 2025Updated 9 months ago
- An awesome list of self-sovereign identity resources.☆138Jul 9, 2024Updated last year
- ML-Bench: Evaluating Large Language Models and Agents for Machine Learning Tasks on Repository-Level Code (https://arxiv.org/abs/2311.098…☆316Jul 31, 2025Updated 7 months ago
- ☆135Sep 24, 2024Updated last year
- A python package that integrate algorithms and various machine learning approaches to extract features (genes) effective for classificati…☆252Jan 15, 2026Updated 2 months ago
- Completed this competition in collaboration with Jiang Yan(https://github.com/jy1993) and Guan Shuicheng(https://github.com/guanshuicheng…☆363Nov 6, 2024Updated last year