Charlotte-CharMLab / Fibottention
Inceptive Visual Representation Learning with Diverse Attention Across Heads. Image Classification, Action Recognition, and Robot Learning.
☆16Updated 5 months ago
Alternatives and similar repositories for Fibottention:
Users that are interested in Fibottention are comparing it to the libraries listed below
- Repo for Bring Your Own Vision-Language-Action (VLA) model, arxiv 2024☆27Updated 2 months ago
- Given an RGBD image and a text prompt, ForceSight produces visual-force goals for a robot, enabling mobile manipulation in unseen environ…☆20Updated last year
- [CVPR 2022] Joint hand motion and interaction hotspots prediction from egocentric videos☆60Updated last year
- Code for the paper Seeing the Pose in the Pixels: Learning Pose-Aware Representations in Vision Transformers☆21Updated 8 months ago
- Video + CLIP Baseline for Ego4D Long Term Action Anticipation Challenge (CVPR 2022)☆13Updated 2 years ago
- Code implementation for paper titled "HOI-Ref: Hand-Object Interaction Referral in Egocentric Vision"☆26Updated 11 months ago
- Language/Clicking grounded SAM + VOS for real-time video object tracking☆18Updated 2 months ago
- ☆14Updated last month
- Implementation of Language-Conditioned Path Planning (Amber Xie, Youngwoon Lee, Pieter Abbeel, Stephen James)☆22Updated last year
- ☆42Updated this week
- This repository provides the sample code designed to interpret human demonstration videos and convert them into high-level tasks for robo…☆36Updated 4 months ago
- Detic + SAM for open-vocabulary object detection and segmentation.☆18Updated 10 months ago
- Task-Focused Few-Shot Object Detection Benchmark☆13Updated 2 weeks ago
- Code for "Unleashing Large-Scale Video Generative Pre-training for Visual Robot Manipulation"☆44Updated 11 months ago
- Streaming Diffusion Policy: Fast Policy Synthesis with Variable Noise Diffusion Models☆52Updated 6 months ago
- An unofficial pytorch dataloader for Open X-Embodiment Datasets https://github.com/google-deepmind/open_x_embodiment☆13Updated 2 months ago
- InterPreT: Interactive Predicate Learning from Language Feedback for Generalizable Task Planning (RSS 2024)☆30Updated 9 months ago
- ☆43Updated last year
- (CVPR 2025) A Data-Centric Revisit of Pre-Trained Vision Models for Robot Learning☆10Updated 3 weeks ago
- This is the offical repository of LLAVIDAL☆12Updated last week
- ☆59Updated 2 weeks ago
- [CoRL 2023] XSkill: cross embodiment skill discovery☆59Updated last year
- ☆11Updated 2 years ago
- ☆15Updated 4 months ago
- ☆70Updated last month
- ☆30Updated this week
- ☆10Updated 9 months ago
- ☆31Updated last year
- Code release for SceneReplica paper.☆23Updated last month
- [ICRA'24] Crossway Diffusion: Improving Diffusion-based Visuomotor Policy via Self-supervised Learning☆64Updated 7 months ago