lorenmt / clarity-template
Clarity: A Minimalist Website Template for AI Research
☆87Updated this week
Alternatives and similar repositories for clarity-template:
Users that are interested in clarity-template are comparing it to the libraries listed below
- Adaptive Length Image Tokenization via Recurrent Allocation | How many tokens is an image worth ?☆94Updated 2 months ago
- ☆99Updated last week
- EgoVid-5M: A Large-Scale Video-Action Dataset for Egocentric Video Generation☆90Updated 2 months ago
- 🦾 EvalGIM (pronounced as "EvalGym") is an evaluation library for generative image models. It enables easy-to-use, reproducible automatic…☆61Updated 3 weeks ago
- Code release for NeurIPS 2023 paper SlotDiffusion: Object-centric Learning with Diffusion Models☆81Updated last year
- [ICML 2024] Compositional Image Decomposition with Diffusion Models☆46Updated 6 months ago
- ElasticTok: Adaptive Tokenization for Image and Video☆44Updated 2 months ago
- This repo contains the official implementation of ICLR 2024 paper "Is ImageNet worth 1 video? Learning strong image encoders from 1 long …☆73Updated 8 months ago
- This is a PyTorch-based reimplementation of CrossFlow, as proposed in 'Flowing from Words to Pixels: A Framework for Cross-Modality Evolu…☆124Updated 2 weeks ago
- A Video Tokenizer Evaluation Dataset☆86Updated this week
- [NeurIPS 2024] Official implementation of the paper "Interfacing Foundation Models' Embeddings"☆118Updated 4 months ago
- Implementation of TiTok, proposed by Bytedance in "An Image is Worth 32 Tokens for Reconstruction and Generation"☆165Updated 6 months ago
- [CVPR 2024] Probing the 3D Awareness of Visual Foundation Models☆289Updated 6 months ago
- IMProv: Inpainting-based Multimodal Prompting for Computer Vision Tasks☆59Updated 3 months ago
- This repo contains evaluation code for the paper "BLINK: Multimodal Large Language Models Can See but Not Perceive". https://arxiv.or…☆112Updated 6 months ago
- [NeurIPS 2024] Official PyTorch Implementation of "FlowTurbo: Towards Real-time Flow-Based Image Generation with Velocity Refiner"☆62Updated 3 months ago
- This repository is a collection of research papers on World Models.☆37Updated last year
- (CVPR 2023) Seeing a Rose in Five Thousand Ways☆116Updated last year
- [ICCV 2023] Code for "Multi-task View Synthesis with Neural Radiance Fields"☆11Updated last year
- ☆128Updated last month
- ☆36Updated this week
- [ECCV2024, Oral, Best Paper Finalist]This is the official implementation of the paper "LEGO: Learning EGOcentric Action Frame Generation …☆35Updated 2 months ago
- Official Release of NeurIPS 2023 Spotlight paper "Object-Centric Slot Diffusion"☆61Updated 10 months ago
- Implementation of Accelerating Auto-regressive Text-to-Image Generation with Training-free Speculative Jacobi Decoding☆26Updated 2 months ago
- Official implementation of ICCV 2023 paper "3D-VisTA: Pre-trained Transformer for 3D Vision and Text Alignment"☆197Updated last year
- Personalized Representation from Personalized Generation☆48Updated 3 weeks ago
- [CVPR 2023] Code for "3D Concept Learning and Reasoning from Multi-View Images"☆76Updated 11 months ago
- Code for paper "Grounding Video Models to Actions through Goal Conditioned Exploration".☆37Updated 3 weeks ago
- ☆43Updated 4 months ago
- [ICLR'24] GTA: A Geometry-Aware Attention Mechanism for Multi-view Transformers☆121Updated 7 months ago