☆118Nov 8, 2025Updated 3 months ago
Alternatives and similar repositories for ml-atoken
Users that are interested in ml-atoken are comparing it to the libraries listed below
Sorting:
- ☆25Aug 12, 2025Updated 6 months ago
- This repository is an official implementation of the paper A Simple Baseline for Open-World Tracking via Self-training.☆10Jan 26, 2024Updated 2 years ago
- VideoGPA is a self-supervised framework that enhances 3D consistency in Video Diffusion Models.☆35Feb 3, 2026Updated last month
- [NeurIPS 2025 Spotlight] A Unified Tokenizer for Visual Generation and Understanding☆512Nov 14, 2025Updated 3 months ago
- Flux training codes (lora) for UniTEX☆23Jun 8, 2025Updated 8 months ago
- [ICCV 2025] Repo for Objaverse++, Curated 3D Object Dataset with Quality Annotations☆104Dec 4, 2025Updated 3 months ago
- Official repo for: Epipolar Geometry Improves Video Generation Models☆79Oct 28, 2025Updated 4 months ago
- Official Implementation for (ICLR 2025) Rethinking Diffusion Posterior Sampling: From Conditional Score Estimator to Maximizing a Posteri…☆21Jan 31, 2025Updated last year
- ☆44Nov 26, 2025Updated 3 months ago
- [CVPR 2025 Oral] PyTorch re-implementation for Autoregressive Distillation of Diffusion Transformers (ARD).☆139Oct 1, 2025Updated 5 months ago
- We introduce new approach, Token Reduction using CLIP Metric (TRIM), aimed at improving the efficiency of MLLMs without sacrificing their…☆20Jan 11, 2026Updated last month
- ☆20Oct 17, 2024Updated last year
- SIM4D☆30Mar 27, 2025Updated 11 months ago
- Official repo for UAE☆169Dec 29, 2025Updated 2 months ago
- ☆111Oct 3, 2025Updated 5 months ago
- ImageNet-12k subset of ImageNet-21k (fall11)☆21Jun 13, 2023Updated 2 years ago
- Seeing What You Miss: Vision-Language Pre-training with Semantic Completion Learning☆20Dec 21, 2023Updated 2 years ago
- repo for paper titled: Towards Realistic Zero-Shot Classification via Self Structural Semantic Alignment (AAAI'24 Oral)☆25May 16, 2024Updated last year
- Self-reimplemented version of 4D-LRM.☆65May 30, 2025Updated 9 months ago
- ☆201Oct 22, 2025Updated 4 months ago
- Repository for TetWeave: Isosurface Extraction using On-The-Fly Delaunay Tetrahedral Grids for Gradient-Based Mesh Optimization (SIGGRAPH…☆121May 8, 2025Updated 9 months ago
- Code for Faster VGGT with Block-Sparse Global Attention☆89Nov 14, 2025Updated 3 months ago
- This repository provides the official implementation of VTBench, a benchmark designed to evaluate the performance of visual tokenizers (V…☆35Jul 30, 2025Updated 7 months ago
- [BMVC 2025] Official Implementation of the paper "PerSense: Personalized Instance Segmentation in Dense Images"☆28Dec 18, 2025Updated 2 months ago
- ☆24Mar 17, 2024Updated last year
- An official PyTorch implementation for CLIPPR☆30Jul 22, 2023Updated 2 years ago
- ☆30Sep 20, 2024Updated last year
- ☆29Oct 24, 2023Updated 2 years ago
- ☆121Jul 19, 2025Updated 7 months ago
- [ECCV 2024] Soft Prompt Generation for Domain Generalization☆31Oct 1, 2024Updated last year
- [ICLR 2025] EdgeRunner: Auto-regressive Auto-encoder for Efficient Mesh Generation☆299Dec 22, 2024Updated last year
- ☆18Jun 10, 2025Updated 8 months ago
- Transcode audio files to MP3 with parallel instances of FFMPEG☆30Jul 17, 2020Updated 5 years ago
- ElasticTok: Adaptive Tokenization for Image and Video☆88Nov 4, 2024Updated last year
- [CVPR 2025 Oral] Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models☆1,402Dec 16, 2025Updated 2 months ago
- Train VAE like a boss☆313Oct 21, 2024Updated last year
- [CVPR 2024] MVD-Fusion: Single-view 3D via Depth-consistent Multi-view Generation☆131Apr 29, 2024Updated last year
- [ICCV 2023] Official implementation of the paper "Less is More: Focus Attention for Efficient DETR"☆82Jul 30, 2023Updated 2 years ago
- The First to Know: How Token Distributions Reveal Hidden Knowledge in Large Vision-Language Models?☆42Nov 1, 2024Updated last year