[ICLR2026] AliTok: Towards Sequence Modeling Alignment between Tokenizer and Autoregressive Model
☆56Oct 12, 2025Updated 7 months ago
Alternatives and similar repositories for alitok
Users that are interested in alitok are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [NeurIPS 2025] ViewPoint: Panoramic Video Generation with Pretrained Diffusion Models☆33Jul 1, 2025Updated 10 months ago
- ☆32Mar 4, 2025Updated last year
- ☆15Apr 16, 2026Updated last month
- ☆31Jul 16, 2025Updated 10 months ago
- ☆34Apr 8, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Implementation of the paper "MaskBit: Embedding-free Image Generation from Bit Tokens"☆93Apr 10, 2025Updated last year
- FACM: Flow-Anchored Consistency Models☆146Aug 6, 2025Updated 9 months ago
- Official implementation of the paper: "NeoBabel: A Multilingual Open Tower for Visual Generation"☆23Aug 4, 2025Updated 9 months ago
- Speech Resynthesis and Language Modeling☆27Jun 11, 2025Updated 11 months ago
- Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"☆11Apr 10, 2025Updated last year
- Official Implementation for the paper: A Variational Framework for Improving Naturalness in Generative Spoken Language Models☆24Jun 18, 2025Updated 11 months ago
- Try to replicate the architecture of MiniMaxTTS mentioned in it's technical report☆47Sep 2, 2025Updated 8 months ago
- [Preprint] GMem: A Modular Approach for Ultra-Efficient Generative Models☆43Mar 11, 2025Updated last year
- [ICCV2025] TokenBridge: Bridging Continuous and Discrete Tokens for Autoregressive Visual Generation. https://yuqingwang1029.github.io/To…☆158Jul 24, 2025Updated 10 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A TTS Trained on Universal Audio.☆41Jun 6, 2025Updated 11 months ago
- A neural speech codec based on discrete WavLM representations☆26Aug 28, 2024Updated last year
- [ICASSP 2025] AnCoGen: Analysis, Control and Generation of Speech with a Masked Autoencoder☆14Mar 11, 2025Updated last year
- Variable Bitrate Residual Vector Quantization for Audio Coding☆52May 1, 2025Updated last year
- ☆101Jan 19, 2026Updated 4 months ago
- [NeurIPS 2024] CV-VAE: A Compatible Video VAE for Latent Generative Video Models☆285Dec 4, 2024Updated last year
- ☆53Jun 13, 2025Updated 11 months ago
- ☆14Feb 3, 2026Updated 3 months ago
- ☆25Jan 24, 2023Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Incorporating AutoVocoder to MB-iSTFT-VITS☆48Dec 1, 2022Updated 3 years ago
- This repo contains the code for 1D tokenizer and generator☆1,150Mar 20, 2025Updated last year
- ☆11Feb 20, 2025Updated last year
- FlexTok: Resampling Images into 1D Token Sequences of Flexible Length☆317Jun 2, 2025Updated 11 months ago
- Zero-shot voice cloning text-to-speech (TTS) with explicit emotion class conditioning built on F5-TTS☆37Mar 3, 2026Updated 2 months ago
- Repo of the paper "Towards Building an End-to-End Multilingual Automatic Lyrics Transcription Model""☆15Jun 28, 2024Updated last year
- [ICCV 2025] Official repo for "GigaTok: Scaling Visual Tokenizers to 3 Billion Parameters for Autoregressive Image Generation"☆204Jan 7, 2026Updated 4 months ago
- Code release of our IROS 2024 paper "MonoPlane: Exploiting Monocular Geometric Cues for Generalizable 3D Plane Reconstruction"☆19Nov 5, 2024Updated last year
- Ultra-low-bitrate Speech Codec for Speech Language Modeling Applications☆92Dec 20, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- faster inference☆28Jan 20, 2025Updated last year
- Tidy Tunes is an easy-to-use pipeline for mining high-quality audio data for speech generation models. To do so, it chains multiple open …☆23May 19, 2026Updated last week
- ☆12Mar 11, 2025Updated last year
- Where is the "main theme" in an orchestral score?☆16Updated this week
- Code for "Self-Sustaining Representation Expansion for Non-Exemplar Class-Incremental Learning"☆20Oct 26, 2022Updated 3 years ago
- Official Implementation for Diffusion Models Without Classifier-free Guidance☆174Feb 18, 2025Updated last year
- The official implementation of A Counting-Aware Hierarchical Decoding Framework for Generalized Referring Expression Segmentation☆26Aug 17, 2025Updated 9 months ago