☆34May 14, 2025Updated last year
Alternatives and similar repositories for vitok
Users that are interested in vitok are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [NeurIPS 2025] Vision as a Dialect: Unifying Visual Understanding and Generation via Text-Aligned Representations☆202Sep 18, 2025Updated 9 months ago
- A big_vision inspired repo that implements a generic Auto-Encoder class capable in representation learning and generative modeling.☆34Jun 26, 2024Updated 2 years ago
- [ICML'25] EQ-VAE: Equivariance Regularized Latent Space for Improved Generative Image Modeling.☆181Mar 18, 2026Updated 3 months ago
- Official repo for From Intention to Execution: Probing the Generalization Boundaries of Vision-Language-Action Models☆34Nov 2, 2025Updated 7 months ago
- ☆24Jun 18, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Single-pass Adaptive Image Tokenization for Minimum Program Search | What's the Kolmogorov Complexity of an Image?☆43Jul 26, 2025Updated 11 months ago
- Official implementation of SimFlow☆32Dec 16, 2025Updated 6 months ago
- (CVPR 2025) A Data-Centric Revisit of Pre-Trained Vision Models for Robot Learning☆24Mar 11, 2025Updated last year
- [NeurIPS 2022] code for "Visual Concepts Tokenization"☆23Oct 10, 2022Updated 3 years ago
- ☆43Jun 6, 2025Updated last year
- This repo contains evaluation code for the paper "AV-Odyssey: Can Your Multimodal LLMs Really Understand Audio-Visual Information?"☆31Dec 23, 2024Updated last year
- Official PyTorch Implementation of "Latent Denoising Makes Good Visual Tokenizers"☆194Feb 24, 2026Updated 4 months ago
- Pytorch implementation of Twelve Labs' Video Foundation Model evaluation framework & open embeddings☆36Aug 23, 2024Updated last year
- ACCO: An optimization algorithm for sharded distributed LLM training.☆13May 22, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Official PyTorch Implementation of "Diffusion Autoencoders are Scalable Image Tokenizers"☆167Jan 31, 2025Updated last year
- Code for MetaMorph Multimodal Understanding and Generation via Instruction Tuning☆235Jan 22, 2026Updated 5 months ago
- ☆20Nov 23, 2022Updated 3 years ago
- [ICLR 26] The official code repository for the paper "Mirage or Method? How Model–Task Alignment Induces Divergent RL Conclusions".☆18Feb 9, 2026Updated 4 months ago
- Official repository for "BLEUBERI: BLEU is a surprisingly effective reward for instruction following"☆32Jun 5, 2025Updated last year
- Code release for "Generative Modeling of Weights: Generalization or Memorization?"☆22Apr 9, 2026Updated 2 months ago
- ☆21Mar 25, 2025Updated last year
- [NeurIPS 2025 Spotlight] A Unified Tokenizer for Visual Generation and Understanding☆527Nov 14, 2025Updated 7 months ago
- Official code of "Optimizing 4D Gaussians for Dynamic Scene Video from Single Landscape Images (ICLR 2025)"☆27Mar 4, 2026Updated 3 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- [ICML-2025] We introduce Lie group Relative position Encodings (LieRE) that goes beyond RoPE in supporting n-dimensional inputs.☆33Aug 13, 2025Updated 10 months ago
- Measuring the Signal to Noise Ratio in Language Model Evaluation☆30Aug 19, 2025Updated 10 months ago
- Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit☆63Jun 21, 2023Updated 3 years ago
- Official Pytorch implementation for LARP: Tokenizing Videos with a Learned Autoregressive Generative Prior (ICLR 2025 Oral).☆106Feb 11, 2025Updated last year
- new optimizer☆20Aug 4, 2024Updated last year
- ☆32Jul 29, 2024Updated last year
- ☆21Apr 27, 2026Updated 2 months ago
- supporting pytorch FSDP for optimizers☆84Dec 8, 2024Updated last year
- Code release for "MORE: Multi-mOdal REtrieval Augmented Generative Commonsense Reasoning"☆11Oct 11, 2024Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆53Jan 18, 2024Updated 2 years ago
- Tree Attention: Topology-aware Decoding for Long-Context Attention on GPU clusters☆133Dec 3, 2024Updated last year
- ☆17Mar 2, 2023Updated 3 years ago
- 2nd place solution of ECCV 2020 workshop VIPriors Image Classification Challenge, https://arxiv.org/abs/2008.00261☆13Aug 22, 2021Updated 4 years ago
- The Automated LLM Speedrunning Benchmark measures how well LLM agents can reproduce previous innovations and discover new ones in languag…☆144May 6, 2026Updated last month
- Code for NeurIPS 2024 Spotlight: "Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations"☆92Oct 30, 2024Updated last year
- Official Implemenation for RAEv2: Improved Baselines with Representation Autoencoders☆290May 21, 2026Updated last month