lucasjinreal/ImageTokenizer

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/lucasjinreal/ImageTokenizer)

lucasjinreal / ImageTokenizer

imagetokenizer is a python package, helps you encoder visuals and generate visuals token ids from codebook, supports both image and video.

☆40

Alternatives and similar repositories for ImageTokenizer

Users that are interested in ImageTokenizer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

lucasjinreal / LLaVA-Magvit2
View on GitHub
LLaVA combines with Magvit Image tokenizer, training MLLM without an Vision Encoder. Unifying image understanding and generation.
☆38Jun 20, 2024Updated 2 years ago
jin-s13 / MMPD-Dataset
View on GitHub
MMPD Dataset from ECCV'2024 "When Pedestrian Detection Meets Multi-Modal Learning: Generalist Model and Benchmark Dataset"
☆22Jul 15, 2024Updated 2 years ago
ByungKwanLee / Meteor
View on GitHub
[NeurIPS 2024] Official PyTorch implementation code for realizing the technical part of Mamba-based traversal of rationale (Meteor) to im…
☆116May 30, 2024Updated 2 years ago
haotian-liu / transformers_llava
View on GitHub
☆16Apr 28, 2023Updated 3 years ago
MonolithFoundation / Bumblebee
View on GitHub
A Simple MLLM Surpassed QwenVL-Max with OpenSource Data Only in 14B LLM.
☆38Sep 9, 2024Updated last year
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
FloatButterfly / LCIC_plus
View on GitHub
The extented code of layered conceptual image compression. Journal submitted.
☆15Aug 29, 2022Updated 3 years ago
tile-ai / tvm
View on GitHub
Open deep learning compiler stack for cpu, gpu and specialized accelerators
☆20Updated this week
Pepper-lll / LMforImageGeneration
View on GitHub
Codebase for the paper-Elucidating the design space of language models for image generation
☆45Nov 17, 2024Updated last year
songys / huggingface_KoreanDataset
View on GitHub
huggingface에 있는 한국어 데이터 세트
☆37Oct 10, 2024Updated last year
junshutang / EGGAN
View on GitHub
[ICME 2020, Oral] Fine-Grained Expression Manipulation via Structured Latent Space
☆14Nov 16, 2020Updated 5 years ago
TencentARC / SEED-Voken
View on GitHub
SEED-Voken: A Series of Powerful Visual Tokenizers
☆1,018Nov 25, 2025Updated 8 months ago
ipc-lab / deepJSCC-feedback
View on GitHub
Joint Source-Channel Coding of Images With Feedback
☆15Apr 21, 2020Updated 6 years ago
ByungKwanLee / TroL
View on GitHub
[EMNLP 2024] Official PyTorch implementation code for realizing the technical part of Traversal of Layers (TroL) presenting new propagati…
☆99Jun 23, 2024Updated 2 years ago
lucasjinreal / Namo-R1
View on GitHub
A CPU Realtime VLM in 500M. Surpassed Moondream2 and SmolVLM. Training from scratch with ease.
☆256Apr 22, 2025Updated last year
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
nengelmann / Fuyu-8B---Exploration
View on GitHub
Exploration of the multi modal fuyu-8b model of Adept. 🤓 🔍
☆27Nov 7, 2023Updated 2 years ago
00make / robodog
View on GitHub
A Python library for controlling AlphaDog robotic dogs.
☆12Apr 16, 2026Updated 3 months ago
lucidrains / titok-pytorch
View on GitHub
Implementation of TiTok, proposed by Bytedance in "An Image is Worth 32 Tokens for Reconstruction and Generation"
☆184Jun 20, 2024Updated 2 years ago
feifeibear / ChituAttention
View on GitHub
Quantized Attention on GPU
☆45Nov 22, 2024Updated last year
souadELmaazouzi / OpenDCVCs
View on GitHub
☆17Nov 6, 2025Updated 8 months ago
LaVi-Lab / Visual-Table
View on GitHub
[EMNLP 2024] Official code for "Beyond Embeddings: The Promise of Visual Table in Multi-Modal Models"
☆20Oct 17, 2024Updated last year
NVlabs / ShotBench
View on GitHub
☆29Dec 16, 2024Updated last year
hzlsaber / FGTS
View on GitHub
📝The official repository of "Rethinking Cross-Generator Image Forgery Detection through DINOv3"
☆25Dec 2, 2025Updated 7 months ago
zhiqic / ChartReader
View on GitHub
[ICCV 2023] ChartReader: A Unified Framework for Chart Derendering and Comprehension without Heuristic Rules
☆28Jun 3, 2024Updated 2 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
tongxyh / ImageCompression_VariableRate
View on GitHub
Implementation of ICASSP2020 paper "Variable Bitrate Image Compression with Quality Scaling Factors"
☆24Jul 19, 2023Updated 3 years ago
deep-diver / hllama
View on GitHub
hllama is a library which aims to provide a set of utility tools for large language models.
☆10Apr 16, 2024Updated 2 years ago
Longin-Yu / ComRoPE
View on GitHub
☆11Jun 11, 2025Updated last year
krrish94 / lseg-minimal
View on GitHub
Minimal version of LSeg, based on https://github.com/isl-org/lang-seg
☆26Jan 30, 2023Updated 3 years ago
davidkim205 / translation
View on GitHub
☆13Apr 17, 2024Updated 2 years ago
ShinChven / vertex-ai-proxy
View on GitHub
Simplify Google Gemini 1.5 Pro's authentication
☆15Apr 11, 2024Updated 2 years ago
om-ai-lab / GroundVLP
View on GitHub
GroundVLP: Harnessing Zero-shot Visual Grounding from Vision-Language Pre-training and Open-Vocabulary Object Detection (AAAI 2024)
☆74Apr 10, 2026Updated 3 months ago
Phylliida / MambaLens
View on GitHub
Mamba support for transformer lens
☆20Sep 17, 2024Updated last year
chinmaysahu / UnderwaterChannelModeling
View on GitHub
Underwater channels are modeled and equalizers are designed to preserve the message bits from distortion. LMS, Levinsondurbin, Neural Net…
☆17May 6, 2019Updated 7 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
paolo-favaro / rebuttal-template
View on GitHub
Official ECCV 2026 Rebuttal Template
☆20Apr 24, 2026Updated 3 months ago
nyunAI / PruneGPT
View on GitHub
☆51May 31, 2024Updated 2 years ago
Shengcao-Cao / groundLMM
View on GitHub
Emergent Visual Grounding in Large Multimodal Models Without Grounding Supervision
☆47Oct 19, 2025Updated 9 months ago
mithril-security / tokenizers-wasm
View on GitHub
wasm bindings for huggingface tokenizers library
☆34Jun 30, 2022Updated 4 years ago
realsigridjin / s3-vectors-rs
View on GitHub
The unofficial CLI of Amazon S3 Vectors (Preview) in Rust
☆17Jul 19, 2025Updated last year
Beomi / Gemma-EasyLM
View on GitHub
Train GEMMA on TPU/GPU! (Codebase for training Gemma-Ko Series)
☆50Mar 2, 2024Updated 2 years ago
JiauZhang / prompt-to-prompt
View on GitHub
Implementation of Prompt-to-Prompt Image Editing with Cross Attention Control
☆16Apr 5, 2023Updated 3 years ago