lorenmt / clarity-template
Clarity: A Minimalist Website Template for AI Research
☆114Updated 4 months ago
Alternatives and similar repositories for clarity-template
Users that are interested in clarity-template are comparing it to the libraries listed below
Sorting:
- Adaptive Length Image Tokenization via Recurrent Allocation | How many tokens is an image worth ?☆118Updated 3 months ago
- ElasticTok: Adaptive Tokenization for Image and Video☆67Updated 6 months ago
- An ML research template with good documentation by Boyuan Chen, an MIT PhD student☆71Updated 2 months ago
- Code and weights for the paper "Cluster and Predict Latents Patches for Improved Masked Image Modeling"☆102Updated last month
- This repo contains the code for the paper "Intuitive physics understanding emerges fromself-supervised pretraining on natural videos"☆152Updated 2 months ago
- Official PyTorch Implementation of "Diffusion Autoencoders are Scalable Image Tokenizers"☆113Updated 3 months ago
- Benchmarking physical understanding in generative video models☆160Updated 2 weeks ago
- This repo contains the official implementation of ICLR 2024 paper "Is ImageNet worth 1 video? Learning strong image encoders from 1 long …☆88Updated 11 months ago
- FlexTok: Resampling Images into 1D Token Sequences of Flexible Length☆132Updated last month
- ☆126Updated 4 months ago
- [ICML 2024] Compositional Image Decomposition with Diffusion Models☆50Updated 10 months ago
- Code for MetaMorph Multimodal Understanding and Generation via Instruction Tuning☆160Updated 3 weeks ago
- This repo contains evaluation code for the paper "BLINK: Multimodal Large Language Models Can See but Not Perceive". https://arxiv.or…☆124Updated 10 months ago
- Code release for NeurIPS 2023 paper SlotDiffusion: Object-centric Learning with Diffusion Models☆85Updated last year
- [ICLR 2025] Source code for paper "A Spark of Vision-Language Intelligence: 2-Dimensional Autoregressive Transformer for Efficient Finegr…☆76Updated 5 months ago
- [ICLR 2025] Official implementation and benchmark evaluation repository of <PhysBench: Benchmarking and Enhancing Vision-Language Models …☆57Updated 2 months ago
- [ICML 2025] Gaussian Mixture Flow Matching Models (GMFlow)☆88Updated last week
- Implementation of TiTok, proposed by Bytedance in "An Image is Worth 32 Tokens for Reconstruction and Generation"☆172Updated 10 months ago
- ☆159Updated 4 months ago
- A Video Tokenizer Evaluation Dataset☆114Updated 4 months ago
- [arXiv: 2502.05178] QLIP: Text-Aligned Visual Tokenization Unifies Auto-Regressive Multimodal Understanding and Generation☆70Updated 2 months ago
- [NeurIPS 2024] Stabilize the Latent Space for Image Autoregressive Modeling: A Unified Perspective☆68Updated 6 months ago
- Personalized Representation from Personalized Generation (ICLR 2025)☆63Updated 2 months ago
- TokenBridge: Bridging Continuous and Discrete Tokens for Autoregressive Visual Generation. https://yuqingwang1029.github.io/TokenBridge☆108Updated last week
- ☆101Updated last month
- Official Release of NeurIPS 2023 Spotlight paper "Object-Centric Slot Diffusion"☆65Updated last year
- Official PyTorch Implementation of "History-Guided Video Diffusion"☆306Updated 2 months ago
- [ICML 2025] Implementation of Spatial Reasoning with Denoising Models☆34Updated last week
- [ArXiv 2025] WORLDMEM: Long-term Consistent World Simulation with Memory☆97Updated this week
- [NeurIPS 2024] Official implementation of the paper "Interfacing Foundation Models' Embeddings"☆124Updated 8 months ago