jishengpeng / WavTokenizerView external linksLinks
[ICLR 2025] SOTA discrete acoustic codec models with 40/75 tokens per second for audio language modeling
☆1,266Mar 2, 2025Updated 11 months ago
Alternatives and similar repositories for WavTokenizer
Users that are interested in WavTokenizer are comparing it to the libraries listed below
Sorting:
- [ACL 2025 Oral] Language-Codec: Reducing the Gaps Between Discrete Codec Representation and Speech Language Models☆210Jun 25, 2025Updated 7 months ago
- AcademiCodec: An Open Source Audio Codec Model for Academic Research☆667Dec 27, 2023Updated 2 years ago
- [ACL 2025 Main] ControlSpeech: Towards Simultaneous Zero-shot Speaker Cloning and Zero-shot Language Style Control With Decoupled Codec☆274Nov 22, 2024Updated last year
- kight is a static analysis tool for c/c++ programs.☆214Dec 27, 2024Updated last year
- This is the code for the SpeechTokenizer presented in the SpeechTokenizer: Unified Speech Tokenizer for Speech Language Models. Samples a…☆646Jun 9, 2024Updated last year
- An Workspace for HMI tools☆164Jul 11, 2024Updated last year
- PyTorch Implementation of StyleSinger(AAAI 2024): Style Transfer for Out-of-Domain Singing Voice Synthesis☆416Aug 15, 2025Updated 6 months ago
- Evaluation of Text-to-Video Generation Models: A Dynamics Perspective[NeurIPS 2024].☆274Dec 3, 2024Updated last year
- Official Implementation of AttentionShift: Iteratively Estimated Part-based Attention Map for Pointly Supervised Instance Segmentation☆155Oct 18, 2024Updated last year
- Advanced Unsupervised Image Enhancement with GAN☆247Nov 11, 2024Updated last year
- ☆247Nov 24, 2024Updated last year
- PyTorch Implementation of TCSinger(EMNLP 2024): Zero-Shot Singing Voice Synthesis with Style Transfer and Multi-Level Style Control☆371Oct 7, 2025Updated 4 months ago
- [ICASSP 2024] TextrolSpeech: A Text Style Control Speech Corpus With Codec Language Text-to-Speech Models☆183Nov 22, 2024Updated last year
- ☆176Feb 21, 2025Updated 11 months ago
- Large-Scale Selfie Video Dataset (L-SVD): A Benchmark for Emotion Recognition☆306Aug 18, 2024Updated last year
- Official repo for WavCraft, an AI agent for audio creation and editing☆524Feb 15, 2025Updated last year
- Official implementation of the paper "BigCodec: Pushing the Limits of Low-Bitrate Neural Speech Codec"☆212Sep 19, 2024Updated last year
- Welcome to the 'Open-Alteryx-Macro' project. This project is aimed at providing an open-source solution for managing and updating Alteryx…☆156May 25, 2024Updated last year
- 莫甘娜问卷表单编辑器,低代码快速搭建表单,AI表单生成,表单数据搜集统计☆147Aug 9, 2024Updated last year
- YiTu is an easy-to-use runtime to fully exploit the hybrid parallelism of different hardwares (e.g., GPU) to efficiently support the exec…☆254Jan 7, 2026Updated last month
- It is an Android-based application that enables managing hotspot properties through a web interface, providing mobile routing functionali…☆154Dec 19, 2024Updated last year
- Deep Reinforcement Learning Algorithms for solving Atari 2600 Games☆143Mar 23, 2023Updated 2 years ago
- A python package that integrate algorithms and various machine learning approaches to extract features (genes) effective for classificati…☆252Jan 15, 2026Updated last month
- ML-Bench: Evaluating Large Language Models and Agents for Machine Learning Tasks on Repository-Level Code (https://arxiv.org/abs/2311.098…☆318Jul 31, 2025Updated 6 months ago
- ☆288Jul 6, 2024Updated last year
- ☆251Feb 11, 2025Updated last year
- A curated list of awesome papers related to adversarial attacks and defenses for information retrieval. If I missed any papers, feel free…☆221Jul 11, 2024Updated last year
- ☆142May 8, 2024Updated last year
- Code for paper "Self-Taught Recognizer: Toward Unsupervised Adaptation for Speech Foundation Models"☆242May 24, 2024Updated last year
- Visualization, simulation, manipulation of Intrinsically disorder proteins with Gibbs sampling☆288Oct 24, 2024Updated last year
- ☆135Sep 24, 2024Updated last year
- ☆242Jul 5, 2024Updated last year
- ☆297Sep 14, 2025Updated 5 months ago
- A code repository designed to show the best GitHub has to offer.☆165Jun 30, 2024Updated last year
- ☆121Sep 30, 2024Updated last year
- Build a simple yet effective CNN to work as a sketch recognizer. Just like Google Quick-Draw Project.☆143Mar 23, 2023Updated 2 years ago
- Analysis and visualization of multi-omics data. In ongoing development: multi-modal fusion, sparse learning, and spatio-temporal effects.…☆206Jan 15, 2026Updated last month
- Codec for paper: LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis☆349Jul 21, 2025Updated 6 months ago
- Simple yet powerful Twitter data retrieval SDK with multi-language support.No Limits, No Auth Required☆183Jan 6, 2025Updated last year