☆34May 14, 2025Updated 9 months ago
Alternatives and similar repositories for vitok
Users that are interested in vitok are comparing it to the libraries listed below
Sorting:
- ☆23Jun 18, 2024Updated last year
- JAX Scalify: end-to-end scaled arithmetics☆18Oct 30, 2024Updated last year
- Experimental GPU language with meta-programming☆26Sep 6, 2024Updated last year
- [NeurIPS 2025] Vision as a Dialect: Unifying Visual Understanding and Generation via Text-Aligned Representations☆201Sep 18, 2025Updated 5 months ago
- ☆12Jan 4, 2024Updated 2 years ago
- Official implementation of SimFlow☆26Dec 16, 2025Updated 2 months ago
- ☆13Nov 1, 2023Updated 2 years ago
- An official code release of the paper RGB no more: Minimally Decoded JPEG Vision Transformers☆57Jul 11, 2023Updated 2 years ago
- [ICML'25] EQ-VAE: Equivariance Regularized Latent Space for Improved Generative Image Modeling.☆174Jun 26, 2025Updated 8 months ago
- A basic pure pytorch implementation of flash attention☆16Oct 28, 2024Updated last year
- This repo contains the code for the paper "Object-cropping for SSL".☆18Feb 14, 2023Updated 3 years ago
- Official code of "Optimizing 4D Gaussians for Dynamic Scene Video from Single Landscape Images (ICLR 2025)"☆27May 29, 2025Updated 9 months ago
- Official repo of dataset-decomposition paper [NeurIPS 2024]☆21Jan 8, 2025Updated last year
- Repo for the paper: PerAda: Parameter-Efficient Federated Learning Personalization with Generalization Guarantees (CVPR 2024)☆23Aug 14, 2024Updated last year
- Official repo for From Intention to Execution: Probing the Generalization Boundaries of Vision-Language-Action Models☆32Nov 2, 2025Updated 3 months ago
- Scaling Sparse Fine-Tuning to Large Language Models☆18Jan 31, 2024Updated 2 years ago
- ☆23Jan 27, 2025Updated last year
- (CVPR 2025) A Data-Centric Revisit of Pre-Trained Vision Models for Robot Learning☆24Mar 11, 2025Updated 11 months ago
- ☆20Mar 25, 2025Updated 11 months ago
- supporting pytorch FSDP for optimizers☆84Dec 8, 2024Updated last year
- The Automated LLM Speedrunning Benchmark measures how well LLM agents can reproduce previous innovations and discover new ones in languag…☆131Feb 21, 2026Updated last week
- The open-source materials for paper "Sparsing Law: Towards Large Language Models with Greater Activation Sparsity".☆30Nov 12, 2024Updated last year
- [NeurIPS 2022] code for "Visual Concepts Tokenization"☆23Oct 10, 2022Updated 3 years ago
- An implementation of the Llama architecture, to instruct and delight☆21May 31, 2025Updated 9 months ago
- Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit☆63Jun 21, 2023Updated 2 years ago
- Single-pass Adaptive Image Tokenization for Minimum Program Search | What's the Kolmogorov Complexity of an Image?☆42Jul 26, 2025Updated 7 months ago
- FractalTensor is a programming framework that introduces a novel approach to organizing data in deep neural networks (DNNs) as a list of …☆32Dec 21, 2024Updated last year
- Code for MetaMorph Multimodal Understanding and Generation via Instruction Tuning☆234Jan 22, 2026Updated last month
- Official PyTorch Implementation of "Diffusion Autoencoders are Scalable Image Tokenizers"☆166Jan 31, 2025Updated last year
- new optimizer☆20Aug 4, 2024Updated last year
- Official Pytorch implementation for LARP: Tokenizing Videos with a Learned Autoregressive Generative Prior (ICLR 2025 Oral).☆98Feb 11, 2025Updated last year
- ☆21Mar 3, 2025Updated 11 months ago
- ☆102Jul 13, 2025Updated 7 months ago
- ☆28Dec 21, 2023Updated 2 years ago
- This repo contains evaluation code for the paper "AV-Odyssey: Can Your Multimodal LLMs Really Understand Audio-Visual Information?"☆31Dec 23, 2024Updated last year
- Memory Efficient Training Framework for Large Video Generation Model☆25Apr 22, 2024Updated last year
- ☆54Jul 16, 2025Updated 7 months ago
- [NeurIPS 2025 Spotlight] A Unified Tokenizer for Visual Generation and Understanding☆512Nov 14, 2025Updated 3 months ago
- ☆27May 3, 2024Updated last year