[NeurIPS 2024]OmniTokenizer: one model and one weight for image-video joint tokenization.
☆323Jul 9, 2024Updated last year
Alternatives and similar repositories for OmniTokenizer
Users that are interested in OmniTokenizer are comparing it to the libraries listed below
Sorting:
- SEED-Voken: A Series of Powerful Visual Tokenizers☆998Nov 25, 2025Updated 3 months ago
- ☆21Jan 17, 2025Updated last year
- a family of versatile and state-of-the-art video tokenizers.☆438Sep 1, 2025Updated 6 months ago
- This repo contains the code for 1D tokenizer and generator☆1,129Mar 20, 2025Updated last year
- Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation☆1,941Aug 15, 2024Updated last year
- Official Pytorch implementation for LARP: Tokenizing Videos with a Learned Autoregressive Generative Prior (ICLR 2025 Oral).☆100Feb 11, 2025Updated last year
- [ECCV2024] Grounded Multimodal Large Language Model with Localized Visual Tokenization☆584Jun 7, 2024Updated last year
- High-performance Image Tokenizers for VAR and AR☆303Apr 25, 2025Updated 10 months ago
- [NeurIPS 2024] CV-VAE: A Compatible Video VAE for Latent Generative Video Models☆286Dec 4, 2024Updated last year
- This project features optimized Go language, expert source code, concurrent processing, and industry-best practices.☆142Mar 14, 2023Updated 3 years ago
- ☆142May 8, 2024Updated last year
- [ICLR 2025] Binary Spherical Quantization + [CVPR 2026] Leech Spherical Quantization☆203Dec 18, 2025Updated 3 months ago
- LaVIT: Empower the Large Language Model to Understand and Generate Visual Content☆605Oct 6, 2024Updated last year
- ☆287Jul 6, 2024Updated last year
- ☆135Sep 24, 2024Updated last year
- kight is a static analysis tool for c/c++ programs.☆214Dec 27, 2024Updated last year
- C++ codes for FDTD Maxwell's equation.☆161Jun 11, 2023Updated 2 years ago
- Build a simple yet effective CNN to work as a sketch recognizer. Just like Google Quick-Draw Project.☆143Mar 23, 2023Updated 3 years ago
- Harnessing the Power of AI to Navigate the Information Age – Uncovering Truth, Promoting Transparency, and Championing Fact-Based Discour…☆147Jun 2, 2023Updated 2 years ago
- linkedin, seek job information crawler☆106Apr 19, 2025Updated 11 months ago
- YiTu is an easy-to-use runtime to fully exploit the hybrid parallelism of different hardwares (e.g., GPU) to efficiently support the exec…☆254Jan 7, 2026Updated 2 months ago
- Advanced Unsupervised Image Enhancement with GAN☆247Nov 11, 2024Updated last year
- Official Implementation of AttentionShift: Iteratively Estimated Part-based Attention Map for Pointly Supervised Instance Segmentation☆155Oct 18, 2024Updated last year
- ☆88Aug 26, 2025Updated 6 months ago
- Official implementation for "Identifying and Solving Conditional Image Leakage in Image-to-Video Diffusion Model" (NeurIPS 2024)☆258Mar 13, 2026Updated last week
- ☆247Nov 24, 2024Updated last year
- An Workspace for HMI tools☆164Jul 11, 2024Updated last year
- ☆143May 25, 2024Updated last year
- ☆71Sep 2, 2023Updated 2 years ago
- Visualization, simulation, manipulation of Intrinsically disorder proteins with Gibbs sampling☆288Oct 24, 2024Updated last year
- Evaluation of Text-to-Video Generation Models: A Dynamics Perspective[NeurIPS 2024].☆274Dec 3, 2024Updated last year
- 一个轻量的企业级BFF框架,集成xprofiler能力,可直接使用其强大的监控告警能力。☆265Feb 7, 2024Updated 2 years ago
- AI solution for Patent Classification☆143Jun 29, 2020Updated 5 years ago
- Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis☆86Jul 16, 2024Updated last year
- Deep Reinforcement Learning Algorithms for solving Atari 2600 Games☆143Mar 23, 2023Updated 3 years ago
- ☆248Apr 10, 2025Updated 11 months ago
- (Accepted by IJCV) Liquid: Language Models are Scalable and Unified Multi-modal Generators☆643Nov 10, 2025Updated 4 months ago
- ☆252Feb 11, 2025Updated last year
- A PyTorch implementation for Temporal Textual Localization in Video via Adversarial Bi-Directional Interaction Networks☆38Sep 9, 2020Updated 5 years ago