[ICLR2026] WeTok: Powerful Discrete Tokenization for High-Fidelity Visual Reconstruction
☆65Sep 3, 2025Updated 6 months ago
Alternatives and similar repositories for WeTok
Users that are interested in WeTok are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICLR2026] Video-GPT via Next Clip Diffusion.☆44Jun 2, 2025Updated 9 months ago
- Explore how to get a VQ-VAE models efficiently!☆68Jul 24, 2025Updated 8 months ago
- [ICCV 2025] Official repo for "GigaTok: Scaling Visual Tokenizers to 3 Billion Parameters for Autoregressive Image Generation"☆202Jan 7, 2026Updated 2 months ago
- Official repository for the UAE paper, unified-GRPO, and unified-Bench☆160Sep 12, 2025Updated 6 months ago
- Code release for Ming-UniVision: Joint Image Understanding and Geneation with a Continuous Unified Tokenizer☆142Oct 14, 2025Updated 5 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- [arXiv: 2502.05178] QLIP: Text-Aligned Visual Tokenization Unifies Auto-Regressive Multimodal Understanding and Generation☆95Mar 1, 2025Updated last year
- ☆26Jan 20, 2025Updated last year
- Adaptive Length Image Tokenization via Recurrent Allocation | How many tokens is an image worth ?☆147Feb 11, 2025Updated last year
- Official code for CVPR 2026 paper: VGGT-Det: Mining VGGT Internal Priors for Sensor-Geometry-Free Multi-View Indoor 3D Object Detection☆89Mar 3, 2026Updated 3 weeks ago
- [CVPR 2025] Pytorch implementation of the paper "Hearing Anywhere in Any Environment"☆29Sep 18, 2025Updated 6 months ago
- [Neurips 2025 NextVid Workshop Oral✨] Official Implementation of VideoGen-of-Thought: Step-by-step generating multi-shot video with minim…☆60Sep 22, 2025Updated 6 months ago
- Edit-R1: Reinforce Image Editing with Diffusion Negative-Aware Finetuning and MLLM Implicit Feedback☆248Jan 24, 2026Updated 2 months ago
- [IJCV 2025] The project is an official implementation of our paper "Learning Structure-Supporting Dependencies via Keypoint Interactive T…☆18Jul 16, 2025Updated 8 months ago
- A Spitting Image: Modular Superpixel Tokenization in Vision Transformers☆21Sep 12, 2025Updated 6 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- code for HiPer☆32Mar 21, 2023Updated 3 years ago
- [NeurIPS 2025 Spotlight] A Unified Tokenizer for Visual Generation and Understanding☆518Nov 14, 2025Updated 4 months ago
- [Arxiv'25] MGVQ: Could VQ-VAE Beat VAE? A Generalizable Tokenizer with Multi-group Quantization☆55Sep 16, 2025Updated 6 months ago
- [MIPR 2022 & TMM 2023] "Attentive Graph Neural Networks for Few-shot Learning" with its extension version☆16Apr 17, 2023Updated 2 years ago
- The offical repository of "So-Fake: Benchmarking and Explaining Social Media Image Forgery Detection"☆29Oct 29, 2025Updated 4 months ago
- [NeurIPS 2024] Token Merging for Training-Free Semantic Binding in Text-to-Image Synthesis☆86Feb 3, 2025Updated last year
- From Word to World: Can Large Language Models be Implicit Text-based World Models?☆55Dec 25, 2025Updated 3 months ago
- ☆24May 23, 2025Updated 10 months ago
- ☆12Apr 19, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- the official repo for "D-AR: Diffusion via Autoregressive Models"☆135Jan 29, 2026Updated last month
- ☆22Aug 11, 2020Updated 5 years ago
- [CVPR'26] AdapTok: Learning Adaptive and Temporally Causal Video Tokenization in a 1D Latent Space☆24Mar 15, 2026Updated last week
- PyTorch implementation of "Sample- and Parameter-Efficient Auto-Regressive Image Models" from CVPR 2025☆14Nov 21, 2025Updated 4 months ago
- [ICLR 2025] Source code for paper "A Spark of Vision-Language Intelligence: 2-Dimensional Autoregressive Transformer for Efficient Finegr…☆80Dec 10, 2024Updated last year
- [CVPR 2026] Real2Edit2Real: Generating Robotic Demonstrations via a 3D Control Interface☆37Mar 10, 2026Updated 2 weeks ago
- A small project that uses Discrete Denoising Diffusion Probabilistic Models (D3PMs), a generative model for discrete data that builds upo…☆14Aug 10, 2024Updated last year
- Official Pytorch implementation for LARP: Tokenizing Videos with a Learned Autoregressive Generative Prior (ICLR 2025 Oral).☆100Feb 11, 2025Updated last year
- An implementation of several unsupervised object discovery models (Slot Attention, SLATE, GNM) in PyTorch with pre-trained models.☆14May 26, 2025Updated 10 months ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Official PyTorch Implementation of "Diffusion Autoencoders are Scalable Image Tokenizers"☆166Jan 31, 2025Updated last year
- The official code of OneActor: Consistent Subject Generation via Cluster-Conditioned Guidance (NeurIPS 2024)☆17Dec 23, 2024Updated last year
- PyTorch code for the CVPR'23 paper: "ConStruct-VL: Data-Free Continual Structured VL Concepts Learning"☆14Feb 5, 2024Updated 2 years ago
- 大学Latex答辩模版,当前包含川大、哈工大、中科大。☆10Jul 22, 2024Updated last year
- DL Backtrace is a new explainablity technique for deep learning models that works for any modality and model type.☆25Updated this week
- This is the repo for "Adaptive Unimodal Regulation for Balanced Multimodal Information Acquisition", CVPR2025.☆20Dec 22, 2025Updated 3 months ago
- Official Code for "Painting with Words: Elevating Detailed Image Captioning with Benchmark and Alignment Learning" (ICLR 2025)☆12Mar 6, 2025Updated last year