imagetokenizer is a python package, helps you encoder visuals and generate visuals token ids from codebook, supports both image and video.
☆40Jun 22, 2024Updated last year
Alternatives and similar repositories for ImageTokenizer
Users that are interested in ImageTokenizer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- LLaVA combines with Magvit Image tokenizer, training MLLM without an Vision Encoder. Unifying image understanding and generation.☆39Jun 20, 2024Updated last year
- MMPD Dataset from ECCV'2024 "When Pedestrian Detection Meets Multi-Modal Learning: Generalist Model and Benchmark Dataset"☆21Jul 15, 2024Updated last year
- [NeurIPS 2024] Official PyTorch implementation code for realizing the technical part of Mamba-based traversal of rationale (Meteor) to im…☆117May 30, 2024Updated last year
- The extented code of layered conceptual image compression. Journal submitted.☆15Aug 29, 2022Updated 3 years ago
- Open deep learning compiler stack for cpu, gpu and specialized accelerators☆19Apr 24, 2026Updated last week
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- huggingface에 있는 한국어 데이터 세트☆36Oct 10, 2024Updated last year
- SEED-Voken: A Series of Powerful Visual Tokenizers☆1,003Nov 25, 2025Updated 5 months ago
- Codebase for the paper-Elucidating the design space of language models for image generation☆46Nov 17, 2024Updated last year
- I-SHEEP: Iterative Self-enHancEmEnt Paradigm of LLMs through Self-Instruct and Self-Assessment☆17Jan 16, 2025Updated last year
- The unofficial CLI of Amazon S3 Vectors (Preview) in Rust☆17Jul 19, 2025Updated 9 months ago
- Exploration of the multi modal fuyu-8b model of Adept. 🤓 🔍☆27Nov 7, 2023Updated 2 years ago
- ☆27Dec 16, 2024Updated last year
- [EMNLP 2024] Official PyTorch implementation code for realizing the technical part of Traversal of Layers (TroL) presenting new propagati…☆99Jun 23, 2024Updated last year
- Simple MoE - Day 17 of 365 Days of Repos☆19Apr 21, 2026Updated 2 weeks ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A flexible and efficient implementation of Flash Attention 2.0 for JAX, supporting multiple backends (GPU/TPU/CPU) and platforms (Triton/…☆34Mar 4, 2025Updated last year
- [ICCV 2025 Highlight] official code of paper "DLF: Extreme Image Compression with Dual-generative Latent Fusion"☆45Dec 24, 2025Updated 4 months ago
- ☆11Jun 11, 2025Updated 10 months ago
- OpenKit Server☆114Jun 9, 2014Updated 11 years ago
- Quantized Attention on GPU☆44Nov 22, 2024Updated last year
- Implementation of TiTok, proposed by Bytedance in "An Image is Worth 32 Tokens for Reconstruction and Generation"☆182Jun 20, 2024Updated last year
- A Python library for controlling AlphaDog robotic dogs.☆12Apr 16, 2026Updated 2 weeks ago
- A public repository of "Generative AI Meets 6G and Beyond: Diffusion Models for Semantic Communications", which is a collection of educat…☆39Apr 11, 2026Updated 3 weeks ago
- Official implementation for "Revisiting Discriminative vs. Generative Classifiers: Theory and Implications".☆14Feb 7, 2023Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Implementation of ICASSP2020 paper "Variable Bitrate Image Compression with Quality Scaling Factors"☆24Jul 19, 2023Updated 2 years ago
- ☆28Jun 10, 2025Updated 10 months ago
- A simple web server written in Lua☆16Sep 24, 2022Updated 3 years ago
- Minimal version of LSeg, based on https://github.com/isl-org/lang-seg☆26Jan 30, 2023Updated 3 years ago
- [ICCV 2023] ChartReader: A Unified Framework for Chart Derendering and Comprehension without Heuristic Rules☆28Jun 3, 2024Updated last year
- [Suspended] Modern, customizable AI character frontend for enthusiasts (inspired by SillyTavern)☆10Nov 8, 2024Updated last year
- A Dead Simple and Modularized Multi-Modal Training and Finetune Framework. Compatible to any LLaVA/Flamingo/QwenVL/MiniGemini etc series …☆19Apr 24, 2024Updated 2 years ago
- Simplify Google Gemini 1.5 Pro's authentication☆15Apr 11, 2024Updated 2 years ago
- CloudFlare worker as a proxy to Google Generative Language API.☆19Oct 16, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- This repo contains the code for 1D tokenizer and generator☆1,145Mar 20, 2025Updated last year
- The official github repo for MixEval-X, the first any-to-any, real-world benchmark.☆17Feb 15, 2025Updated last year
- Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation☆1,948Aug 15, 2024Updated last year
- GroundVLP: Harnessing Zero-shot Visual Grounding from Vision-Language Pre-training and Open-Vocabulary Object Detection (AAAI 2024)☆74Apr 10, 2026Updated 3 weeks ago
- ☆28Sep 8, 2025Updated 7 months ago
- wasm bindings for huggingface tokenizers library☆34Jun 30, 2022Updated 3 years ago
- Official repository of FlowInOne: Unifying Multimodal Generation as Image-In Image-Out Flow Matching☆51Apr 25, 2026Updated last week