worv-ai / canvasLinks
CANVAS: Commonsense-Aware Navigation System for Intuitive Human-Robot Interaction
☆12Updated last week
Alternatives and similar repositories for canvas
Users that are interested in canvas are comparing it to the libraries listed below
Sorting:
- Code for “Pretrained Language Models as Visual Planners for Human Assistance”☆61Updated 2 years ago
- ☆23Updated 2 years ago
- [ICLR-2023] Rarity Score : A New Metric to Evaluate the Uncommonness of Synthesized Images☆67Updated 3 years ago
- Code for the paper "Multi-scale Diffusion Denoised Smoothing" (NeurIPS 2023)☆14Updated last year
- This is an official implementation of GRIT-VLP☆21Updated 3 years ago
- [ICLR 2023] RC-MAE☆53Updated last year
- ☆37Updated 8 months ago
- A real-time, high-frequency, real-world desktop environment that is suitable for desktop-based ML development (agents, world models, etc.…☆14Updated 9 months ago
- [CVPR 2023] HierVL Learning Hierarchical Video-Language Embeddings☆46Updated 2 years ago
- [ECCV2024, Oral, Best Paper Finalist] This is the official implementation of the paper "LEGO: Learning EGOcentric Action Frame Generation…☆38Updated 8 months ago
- [ECCV2024][ICCV2023] Official PyTorch implementation of SeiT++ and SeiT☆56Updated last year
- ☆127Updated 3 years ago
- Course Website for "AI618: Generative Model and Unsupervised Learning"☆37Updated 2 years ago
- [ACL 2024 Findings] Official PyTorch Implementation code for realizing the technical part of CoLLaVO: Crayon Large Language and Vision mO…☆98Updated last year
- ☆46Updated last year
- On Efficient Language and Vision Assistants for Visually-Situated Natural Language Understanding: What Matters in Reading and Reasoning, …☆18Updated 10 months ago
- ☆38Updated 2 years ago
- [ICCV2023] EgoObjects: A Large-Scale Egocentric Dataset for Fine-Grained Object Understanding☆77Updated 2 years ago
- https://arxiv.org/abs/2209.15162☆52Updated 2 years ago
- VQVAE for video prediction☆29Updated 3 years ago
- SMILE: A Multimodal Dataset for Understanding Laughter☆12Updated 2 years ago
- Code for the paper "What Makes Better Augmentation Strategies? Augment Difficult but Not too Different" (ICLR 22)☆12Updated 2 years ago
- Collection of PhD Advice Links☆17Updated 3 years ago
- [BMVC'21] Official PyTorch Implementation of "Grounded Situation Recognition with Transformers"☆27Updated 3 years ago
- Official implementation of the paper "FLAME: Free-form Language-based Motion Synthesis & Editing"☆118Updated last year
- read 1 paper everyday (only weekday)☆56Updated 4 years ago
- [ICCV 2025] Official code for Perspective-Aware Reasoning in Vision-Language Models via Mental Imagery Simulation☆45Updated last month
- Subtask-Aware Visual Reward Learning from Segmented Demonstrations (ICLR 2025 accepted)☆17Updated 6 months ago
- ElasticTok: Adaptive Tokenization for Image and Video☆81Updated 11 months ago
- PyTorch code for "Perceiver-VL: Efficient Vision-and-Language Modeling with Iterative Latent Attention" (WACV 2023)☆33Updated 2 years ago