a1600012888 / LaCTLinks
Code release for paper "Test-Time Training Done Right"
☆283Updated 2 weeks ago
Alternatives and similar repositories for LaCT
Users that are interested in LaCT are comparing it to the libraries listed below
Sorting:
- ☆142Updated 8 months ago
- [ArXiv 2025] WorldMem: Long-term Consistent World Simulation with Memory☆229Updated last week
- ☆157Updated last month
- An ML research template with good documentation by Boyuan Chen, an MIT PhD student☆82Updated 6 months ago
- [ICML 2025] Official PyTorch Implementation of "History-Guided Video Diffusion"☆488Updated 2 months ago
- Official implementation for WorldScore: A Unified Evaluation Benchmark for World Generation☆136Updated last month
- A Video Tokenizer Evaluation Dataset☆133Updated 8 months ago
- Official Implementation of Paper Transfer between Modalities with MetaQueries☆233Updated 2 months ago
- ElasticTok: Adaptive Tokenization for Image and Video☆77Updated 10 months ago
- Benchmarking physical understanding in generative video models☆196Updated 4 months ago
- A list of works on video generation towards world model☆165Updated last month
- Code release for "PISA Experiments: Exploring Physics Post-Training for Video Diffusion Models by Watching Stuff Drop" (ICML 2025)☆40Updated 4 months ago
- ☆113Updated last month
- Generative World Explorer☆155Updated 3 months ago
- Seeing from Another Perspective: Evaluating Multi-View Understanding in MLLMs☆48Updated 2 months ago
- [ICML 2024] Compositional Image Decomposition with Diffusion Models☆51Updated last year
- Code release for NeurIPS 2023 paper SlotDiffusion: Object-centric Learning with Diffusion Models☆92Updated last year
- A comprehensive list of papers investigating physical cognition in video generation, including papers, codes, and related websites.☆173Updated 2 weeks ago
- Official PyTorch implementation of FlowMo.☆95Updated 5 months ago
- Adaptive Length Image Tokenization via Recurrent Allocation | How many tokens is an image worth ?☆128Updated 7 months ago
- ☆89Updated last month
- Code for MetaMorph Multimodal Understanding and Generation via Instruction Tuning☆209Updated 5 months ago
- Code for ICML 2025 Paper "Highly Compressed Tokenizer Can Generate Without Training"☆171Updated 3 months ago
- Official PyTorch Implementation of "Diffusion Autoencoders are Scalable Image Tokenizers"☆151Updated 7 months ago
- Official implementation of EPiC: Efficient Video Camera Control Learning with Precise Anchor-Video Guidance☆43Updated 3 months ago
- MetaSpatial leverages reinforcement learning to enhance 3D spatial reasoning in vision-language models (VLMs), enabling more structured, …☆189Updated 4 months ago
- UniFork: Exploring Modality Alignment for Unified Multimodal Understanding and Generation☆45Updated 3 weeks ago
- Implementation of "Diffusion with Forward Models: Solving Stochastic Inverse Problems Without Direct Supervision"☆165Updated last year
- (CVPR 2025 Highlight) The Scene Language: Representing Scenes with Programs, Words, and Embeddings☆237Updated 2 months ago
- EgoVid-5M: A Large-Scale Video-Action Dataset for Egocentric Video Generation☆116Updated last month