human-centeredAI / awesomeHAI
a reading list for human-centered AI
☆42Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for awesomeHAI
- ☆72Updated 2 years ago
- IMProv: Inpainting-based Multimodal Prompting for Computer Vision Tasks☆59Updated last month
- This repository is a collection of research papers on World Models.☆35Updated last year
- [ICLR 2022] RelViT: Concept-guided Vision Transformer for Visual Relational Reasoning☆64Updated 2 years ago
- Official repository for the General Robust Image Task (GRIT) Benchmark☆50Updated last year
- [ICCV2023] EgoObjects: A Large-Scale Egocentric Dataset for Fine-Grained Object Understanding☆75Updated last year
- ☆29Updated 4 months ago
- A Video Tokenizer Evaluation Dataset☆38Updated this week
- ☆9Updated last year
- [ECCV 2024] This is the official implementation of "Stitched ViTs are Flexible Vision Backbones".☆23Updated 9 months ago
- ☆57Updated last year
- Code for the paper titled "CiT Curation in Training for Effective Vision-Language Data".☆78Updated last year
- A list of papers and other resources on language-guided image editing.☆37Updated 3 years ago
- Adaptive Length Image Tokenization via Recurrent Allocation | How many tokens is an image worth ?☆71Updated this week
- Benchmarking and Analyzing Generative Data for Visual Recognition☆26Updated last year
- ElasticTok: Adaptive Tokenization for Image and Video☆31Updated last week
- [ICCV 2023] Unsupervised Compositional Concepts Discovery with Text-to-Image Generative Models☆78Updated last year
- CCVS: Context-aware Controllable Video Synthesis☆22Updated 2 years ago
- Code release for NeurIPS 2023 paper SlotDiffusion: Object-centric Learning with Diffusion Models☆78Updated 9 months ago
- Paper List for In-context Learning 🌷☆20Updated last year
- [arXiv:2309.16669] Code release for "Training a Large Video Model on a Single Machine in a Day"☆114Updated 3 months ago
- Official repository of S-Agents: Self-organizing Agents in Open-ended Environment☆17Updated 7 months ago
- Video Generation, Physical Commonsense, Semantic Adherence, VideoCon-Physics☆55Updated last month
- 🔥 Aurora Series: A more efficient multimodal large language model series for video.☆41Updated 2 weeks ago
- [NeurIPS 2021] Unsupervised Foreground Extraction via Deep Region Competition☆42Updated 3 years ago
- ☆10Updated last week
- [ECCV2022] New benchmark for evaluating pre-trained model; New supervised contrastive learning framework.☆106Updated 11 months ago
- Source code for "A Dense Reward View on Aligning Text-to-Image Diffusion with Preference" (ICML'24).☆30Updated 6 months ago