OpenMOSS / MOSS-Audio-Tokenizer
View external linksLinks

MOSS-Audio-Tokenizer is a Causal Transformer-based audio tokenizer built on the CAT architecture. Trained on 3M hours of diverse audio, it supports streaming and variable bitrates, delivering SOTA reconstruction and strong performance in generation and understanding—serving as a unified interface for next-generation native audio language models.
85Updated this week

Alternatives and similar repositories for MOSS-Audio-Tokenizer

Users that are interested in MOSS-Audio-Tokenizer are comparing it to the libraries listed below

Sorting:

Are these results useful?