imagetokenizer is a python package, helps you encoder visuals and generate visuals token ids from codebook, supports both image and video.
☆40Jun 22, 2024Updated last year
Alternatives and similar repositories for ImageTokenizer
Users that are interested in ImageTokenizer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- LLaVA combines with Magvit Image tokenizer, training MLLM without an Vision Encoder. Unifying image understanding and generation.☆39Jun 20, 2024Updated last year
- Rust standalone inference of Namo-500M series models. Extremly tiny, runing VLM on CPU.☆24Mar 12, 2025Updated last year
- ☆16Apr 28, 2023Updated 3 years ago
- [NeurIPS 2024] Official PyTorch implementation code for realizing the technical part of Mamba-based traversal of rationale (Meteor) to im…☆117May 30, 2024Updated last year
- A Simple MLLM Surpassed QwenVL-Max with OpenSource Data Only in 14B LLM.☆38Sep 9, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- The extented code of layered conceptual image compression. Journal submitted.☆15Aug 29, 2022Updated 3 years ago
- minisora-DiT, a DiT reproduction based on XTuner from the open source community MiniSora☆39Mar 25, 2024Updated 2 years ago
- [ICME 2020, Oral] Fine-Grained Expression Manipulation via Structured Latent Space☆14Nov 16, 2020Updated 5 years ago
- SEED-Voken: A Series of Powerful Visual Tokenizers☆1,008Nov 25, 2025Updated 6 months ago
- Codebase for the paper-Elucidating the design space of language models for image generation☆46Nov 17, 2024Updated last year
- The unofficial CLI of Amazon S3 Vectors (Preview) in Rust☆17Jul 19, 2025Updated 10 months ago
- Exploration of the multi modal fuyu-8b model of Adept. 🤓 🔍☆27Nov 7, 2023Updated 2 years ago
- [EMNLP 2024] Official PyTorch implementation code for realizing the technical part of Traversal of Layers (TroL) presenting new propagati…☆99Jun 23, 2024Updated last year
- Simple MoE - Day 17 of 365 Days of Repos☆19Apr 21, 2026Updated last month
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A flexible and efficient implementation of Flash Attention 2.0 for JAX, supporting multiple backends (GPU/TPU/CPU) and platforms (Triton/…☆34Mar 4, 2025Updated last year
- ☆11Jun 11, 2025Updated 11 months ago
- OpenKit Server☆114Jun 9, 2014Updated 11 years ago
- iconsax for flutter package☆34Oct 7, 2023Updated 2 years ago
- Quantized Attention on GPU☆44Nov 22, 2024Updated last year
- A Python library for controlling AlphaDog robotic dogs.☆12Apr 16, 2026Updated last month
- Official implementation for "Revisiting Discriminative vs. Generative Classifiers: Theory and Implications".☆14Feb 7, 2023Updated 3 years ago
- A Conversational Speech Generation Model☆14Mar 16, 2025Updated last year
- Implementation of ICASSP2020 paper "Variable Bitrate Image Compression with Quality Scaling Factors"☆24Jul 19, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆18Sep 26, 2023Updated 2 years ago
- [EMNLP 2024] Official code for "Beyond Embeddings: The Promise of Visual Table in Multi-Modal Models"☆20Oct 17, 2024Updated last year
- ☆30Jun 10, 2025Updated 11 months ago
- This program is used for solving Poisson Equation with several methods. And each methods are parallelized with openMP, MPI and GPU☆12Oct 25, 2017Updated 8 years ago
- A simple web server written in Lua☆16Sep 24, 2022Updated 3 years ago
- hllama is a library which aims to provide a set of utility tools for large language models.☆10Apr 16, 2024Updated 2 years ago
- This is a fork of the original fairseq repository (version 0.12.2) with added classes for training mHuBERT-147.☆21Nov 19, 2024Updated last year
- [ICCV 2023] ChartReader: A Unified Framework for Chart Derendering and Comprehension without Heuristic Rules