YuZhaoshu / Efficient-VLAs-SurveyView external linksLinks
🔥This is a curated list of "A survey on Efficient Vision-Language Action Models" research. We will continue to maintain and update the repository, so follow us to keep up with the latest developments!!!
☆126Jan 5, 2026Updated last month
Alternatives and similar repositories for Efficient-VLAs-Survey
Users that are interested in Efficient-VLAs-Survey are comparing it to the libraries listed below
Sorting:
- An innovative method designed to augment the capabilities of existing video diffusion models☆22May 10, 2024Updated last year
- ☆23May 29, 2023Updated 2 years ago
- Control 3f robotiq gripper using python and modbus client☆13Jun 27, 2024Updated last year
- This project is intended to build and deploy an SNPE model on Qualcomm Devices, which are having unsupported layers which are not part of…☆10Oct 4, 2021Updated 4 years ago
- Code for our EMNLP 2022 paper: Generative Entity Typing with Curriculum Learning.☆13Aug 19, 2023Updated 2 years ago
- Welcome to CV-PCL Viewer! This software has simple image and video processing functions, as well as the ability to visualize point cloud …☆16Jul 20, 2024Updated last year
- Federated Meta-Learning for Emotion and Sentiment Aware Multi-modal Complaint Identification☆10May 30, 2024Updated last year
- Implementation for the paper "Unified Multimodal Model with Unlikelihood Training for Visual Dialog"☆13May 12, 2023Updated 2 years ago
- In this assignment, you will design a controller for a tele-manipulation task in eye surgery. For the design of your controller you can …☆10Sep 7, 2019Updated 6 years ago
- Dataflow-MM, multi-media operators for Dataflow. We aim to prepare data for Multimodal Large Language Models.☆28Updated this week
- 🚀 Sliding Window Attention Training for Efficient Large Language Models☆15Dec 8, 2025Updated 2 months ago
- Franka Emikia Research 3 robotic arm teleoperation by Force Dimension Sigma.7☆12Jul 27, 2024Updated last year
- [ICLR 2025] Semi-Supervised Vision-Centric 3D Occupancy World Model for Autonomous Driving☆52Feb 14, 2025Updated last year
- ICCV 2021 papers and code focus on adversarial attacks and defense☆11Nov 5, 2021Updated 4 years ago
- Project is intended to build and deploy an scene detection application onto Qualcomm Robotics development Kit (RB5) that detects whether …☆10Jun 26, 2022Updated 3 years ago
- Code and resources for EMNLP 2022 paper on 'Robustness of Fusion-based Multimodal Classifiers to Cross-Modal Content Dilutions'☆10Mar 11, 2024Updated last year
- Improving word mover’s distance by leveraging self-attention matrix (Published in EMNLP 2023 Findings)☆10Jun 17, 2025Updated 8 months ago
- A simple independant Python driver to use the Robotiq 2F-85 Gripper☆16May 23, 2024Updated last year
- ☆10Nov 23, 2023Updated 2 years ago
- Submission Under Review☆17May 15, 2025Updated 9 months ago
- See through the Dark: Learning Illumination-affined Representations for Nighttime Occupancy Prediction (NeurIPS 2025)☆25Oct 21, 2025Updated 3 months ago
- Simple physic simulation☆16Dec 23, 2025Updated last month
- ☆11Jul 11, 2023Updated 2 years ago
- ☆14Apr 25, 2025Updated 9 months ago
- A ROS2 package for interfacing with Force Dimension haptics robots.☆14Jun 3, 2024Updated last year
- DreamDance: Personalized Text-to-video Generation by Combining Text-to-Image Synthesis and Motion Transfer☆14Dec 16, 2022Updated 3 years ago
- ☆13Mar 28, 2025Updated 10 months ago
- [ECAI 2024] TEOcc: Radar-camera Multi-modal Occupancy Prediction via Temporal Enhancement☆13Oct 16, 2024Updated last year
- [ICCVw19] DAME WEB: DynAmic MEan with Whitening Ensemble Binarization for Landmark Retrieval without Human Annotation☆10Dec 20, 2019Updated 6 years ago
- Deep Feature Flow for Video Recognition☆10Jun 9, 2017Updated 8 years ago
- Pytorch implementation of Yolo V3☆11Aug 30, 2018Updated 7 years ago
- Summaries of ICML 2024 papers☆12Jul 31, 2024Updated last year
- VLA-RFT: Vision-Language-Action Models with Reinforcement Fine-Tuning☆126Oct 6, 2025Updated 4 months ago
- Code and data for Distributional Correlation–Aware Knowledge Distillation for Stock Trading Volume Prediction (ECML-PKDD 22)☆15Sep 6, 2022Updated 3 years ago
- A yolov7-tiny model inference applied on qualcomm snpe for pedestrian detection with embedded system.☆13Sep 23, 2024Updated last year
- ☆12Sep 29, 2019Updated 6 years ago
- ☆10Jan 8, 2020Updated 6 years ago
- This repo contains most of outstanding papers on visual saliency (2013-2017).☆10Dec 6, 2017Updated 8 years ago
- code for promptCSE, emnlp 2022☆11Apr 10, 2023Updated 2 years ago