☆671Apr 12, 2025Updated 10 months ago
Alternatives and similar repositories for platonic-rep
Users that are interested in platonic-rep are comparing it to the libraries listed below
Sorting:
- [COLM'25] Official implementation of the Law of Vision Representation in MLLMs☆176Oct 6, 2025Updated 5 months ago
- [ICLR'25 Oral] Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think☆1,560Mar 16, 2025Updated 11 months ago
- Learning from synthetic data - code and models☆328Jan 6, 2024Updated 2 years ago
- Official code for the ICML 2024 paper "The Entropy Enigma: Success and Failure of Entropy Minimization"☆55May 27, 2024Updated last year
- Adaptive Length Image Tokenization via Recurrent Allocation | How many tokens is an image worth ?☆145Feb 11, 2025Updated last year
- Cambrian-1 is a family of multimodal LLMs with a vision-centric design.☆1,986Nov 7, 2025Updated 4 months ago
- [ICML 2024 Best Paper] Discrete Diffusion Modeling by Estimating the Ratios of the Data Distribution (https://arxiv.org/abs/2310.16834)☆713Feb 29, 2024Updated 2 years ago
- [ICLR & NeurIPS 2025] Repository for Show-o series, One Single Transformer to Unify Multimodal Understanding and Generation.☆1,887Jan 8, 2026Updated last month
- code for "Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion"☆1,169Nov 9, 2025Updated 3 months ago
- EVE Series: Encoder-Free Vision-Language Models from BAAI☆368Jul 24, 2025Updated 7 months ago
- Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.☆2,084Jul 29, 2024Updated last year
- Consistency Distilled Diff VAE☆2,211Nov 7, 2023Updated 2 years ago
- 🚀 Efficient implementations of state-of-the-art linear attention models☆4,474Updated this week
- A suite of image and video neural tokenizers☆1,711Feb 11, 2025Updated last year
- PyTorch implementation of RCG https://arxiv.org/abs/2312.03701☆938Sep 27, 2024Updated last year
- PyTorch implementation of MAR+DiffLoss https://arxiv.org/abs/2406.11838☆1,872Feb 20, 2026Updated 2 weeks ago
- Training Large Language Model to Reason in a Continuous Latent Space☆1,522Aug 12, 2025Updated 6 months ago
- A PyTorch library for implementing flow matching algorithms, featuring continuous and discrete flow matching implementations. It includes…☆4,167Jan 5, 2026Updated 2 months ago
- [NeurIPS 2025 Spotlight] A Unified Tokenizer for Visual Generation and Understanding☆512Nov 14, 2025Updated 3 months ago
- Code to reproduce "Transformers Can Do Arithmetic with the Right Embeddings", McLeish et al (NeurIPS 2024)☆199May 28, 2024Updated last year
- Stanford NLP Python library for Representation Finetuning (ReFT)☆1,560Jan 14, 2026Updated last month
- This repository provides the code and model checkpoints for AIMv1 and AIMv2 research projects.☆1,403Aug 4, 2025Updated 7 months ago
- Code for BLT research paper☆2,029Nov 3, 2025Updated 4 months ago
- PyTorch code and models for V-JEPA self-supervised learning from video.☆3,566Feb 27, 2025Updated last year
- E5-V: Universal Embeddings with Multimodal Large Language Models☆274Dec 10, 2025Updated 2 months ago
- Simple RL training for reasoning☆3,830Dec 23, 2025Updated 2 months ago
- State-of-the-art Image & Video CLIP, Multimodal Large Language Models, and More!☆2,181Feb 11, 2026Updated 3 weeks ago
- [ICLR 2025 Oral] Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models☆963Jul 10, 2025Updated 7 months ago
- EDM2 and Autoguidance -- Official PyTorch implementation☆824Dec 9, 2024Updated last year
- GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection☆1,678Oct 28, 2024Updated last year
- [ECCV 2024] Characterizing Robustness via Natural Input Gradients☆13Oct 14, 2024Updated last year
- Experiments Notebook of "Understanding the Skill Gap in Recurrent Language Models: The Role of the Gather-and-Aggregate Mechanism"☆14Apr 30, 2025Updated 10 months ago
- ☆4,577Sep 14, 2025Updated 5 months ago
- 4M: Massively Multimodal Masked Modeling☆1,787Jun 2, 2025Updated 9 months ago
- This repo contains evaluation code for the paper "BLINK: Multimodal Large Language Models Can See but Not Perceive". https://arxiv.or…☆163Sep 27, 2025Updated 5 months ago
- Emu Series: Generative Multimodal Models from BAAI☆1,768Jan 12, 2026Updated last month
- Official repo for the TMLR paper "Discffusion: Discriminative Diffusion Models as Few-shot Vision and Language Learners"☆30Apr 27, 2024Updated last year
- Representation Engineering: A Top-Down Approach to AI Transparency☆953Aug 14, 2024Updated last year
- [ICML 2025] Official PyTorch Implementation of "History-Guided Video Diffusion"☆630Jul 1, 2025Updated 8 months ago