[NeurIPS 2024]OmniTokenizer: one model and one weight for image-video joint tokenization.
☆323Jul 9, 2024Updated last year
Alternatives and similar repositories for OmniTokenizer
Users that are interested in OmniTokenizer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- SEED-Voken: A Series of Powerful Visual Tokenizers☆1,003Nov 25, 2025Updated 5 months ago
- ☆21Jan 17, 2025Updated last year
- a family of versatile and state-of-the-art video tokenizers.☆445Sep 1, 2025Updated 8 months ago
- This repo contains the code for 1D tokenizer and generator☆1,145Mar 20, 2025Updated last year
- Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation☆1,948Aug 15, 2024Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Official Pytorch implementation for LARP: Tokenizing Videos with a Learned Autoregressive Generative Prior (ICLR 2025 Oral).☆104Feb 11, 2025Updated last year
- [ECCV2024] Grounded Multimodal Large Language Model with Localized Visual Tokenization☆587Jun 7, 2024Updated last year
- High-performance Image Tokenizers for VAR and AR☆306Apr 25, 2025Updated last year
- [NeurIPS 2024] CV-VAE: A Compatible Video VAE for Latent Generative Video Models☆286Dec 4, 2024Updated last year
- This project features optimized Go language, expert source code, concurrent processing, and industry-best practices.☆142Mar 14, 2023Updated 3 years ago
- ☆141May 8, 2024Updated last year
- [ICLR 2025] Binary Spherical Quantization + [CVPR 2026] Leech Spherical Quantization☆208Dec 18, 2025Updated 4 months ago
- LaVIT: Empower the Large Language Model to Understand and Generate Visual Content☆604Oct 6, 2024Updated last year
- ☆286Jul 6, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆134Sep 24, 2024Updated last year
- kight is a static analysis tool for c/c++ programs.☆213Dec 27, 2024Updated last year
- C++ codes for FDTD Maxwell's equation.☆161Jun 11, 2023Updated 2 years ago
- Build a simple yet effective CNN to work as a sketch recognizer. Just like Google Quick-Draw Project.☆143Mar 23, 2023Updated 3 years ago
- Harnessing the Power of AI to Navigate the Information Age – Uncovering Truth, Promoting Transparency, and Championing Fact-Based Discour…☆147Jun 2, 2023Updated 2 years ago
- linkedin, seek job information crawler☆106Apr 19, 2025Updated last year
- YiTu is an easy-to-use runtime to fully exploit the hybrid parallelism of different hardwares (e.g., GPU) to efficiently support the exec…☆254Jan 7, 2026Updated 3 months ago
- Advanced Unsupervised Image Enhancement with GAN☆247Nov 11, 2024Updated last year
- Official Implementation of AttentionShift: Iteratively Estimated Part-based Attention Map for Pointly Supervised Instance Segmentation☆155Oct 18, 2024Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆88Aug 26, 2025Updated 8 months ago
- Official implementation for "Identifying and Solving Conditional Image Leakage in Image-to-Video Diffusion Model" (NeurIPS 2024)☆257Mar 13, 2026Updated last month
- ☆246Nov 24, 2024Updated last year
- An Workspace for HMI tools☆163Jul 11, 2024Updated last year
- ☆143May 25, 2024Updated last year
- ☆71Sep 2, 2023Updated 2 years ago
- Visualization, simulation, manipulation of Intrinsically disorder proteins with Gibbs sampling☆288Oct 24, 2024Updated last year
- Evaluation of Text-to-Video Generation Models: A Dynamics Perspective[NeurIPS 2024].☆274Dec 3, 2024Updated last year
- 一个轻量的企业级BFF框架,集成xprofiler能力,可直接使用其强大的监控告警能力。☆264Feb 7, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- AI solution for Patent Classification☆142Jun 29, 2020Updated 5 years ago
- Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis☆86Jul 16, 2024Updated last year
- Deep Reinforcement Learning Algorithms for solving Atari 2600 Games☆143Mar 23, 2023Updated 3 years ago
- ☆248Apr 10, 2025Updated last year
- ☆251Feb 11, 2025Updated last year
- (Accepted by IJCV) Liquid: Language Models are Scalable and Unified Multi-modal Generators☆643Nov 10, 2025Updated 5 months ago
- A PyTorch implementation for Temporal Textual Localization in Video via Adversarial Bi-Directional Interaction Networks☆38Sep 9, 2020Updated 5 years ago