A collection of VLMs papers, blogs, and projects, with a focus on VLMs in Autonomous Driving and related reasoning techniques.
☆11Nov 16, 2024Updated last year
Alternatives and similar repositories for Awesome-VLMs-Strawberry
Users that are interested in Awesome-VLMs-Strawberry are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [CVPR 2026] "GenieDrive: Towards Physics-Aware Driving World Model with 4D Occupancy Guided Video Generation"☆72Dec 17, 2025Updated 4 months ago
- End2EndPerception deployment solution based on vision sparse transformer paradigm is open sourced.☆176Jan 12, 2025Updated last year
- Hierarchical Models for Learning Features from Images and Videos☆12Feb 7, 2023Updated 3 years ago
- Mixed Integer Quadratic Programming for Python (using MINLP-solver Bonmin)☆14Mar 12, 2018Updated 8 years ago
- [NeurIPS 2025] BOOM, A Planning-driven Model-Based RL algorithm☆58Updated this week
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- [ AAAI 2026 ] The official implementation of 'MonoCLUE: Object-Aware Clustering Enhances Monocular 3D Object Detection'☆18Mar 23, 2026Updated 3 weeks ago
- ☆16Dec 7, 2024Updated last year
- Repo for 'VLM-Auto: VLM-based Autonomous Driving Assistant with Human-like Behavior and Understanding for Complex Road Scenes'☆27Oct 10, 2024Updated last year
- [IROS 2025] ManiGaussian++: General Robotic Bimanual Manipulation with Hierarchical Gaussian World Model☆42Jun 26, 2025Updated 9 months ago
- (T-ASE 2024) LeTO: Learning Constrained Visuomotor Policy with Differentiable Trajectory Optimization☆14Oct 14, 2024Updated last year
- ☆14May 9, 2023Updated 2 years ago
- Vlaser: Vision-Language-Action Model with Synergistic Embodied Reasoning☆45Mar 18, 2026Updated last month
- A demo to show how to convert a TensorFlow model to TensorRT uff or PLAN☆11Jul 22, 2018Updated 7 years ago
- [CVPR'24 Highlight] Implementation of "Causal-CoG: A Causal-Effect Look at Context Generation for Boosting Multi-modal Language Models"☆17Sep 12, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [HKUST Template] A latex template for PhD Qualification Exam (AKA PQE), especially for ECE from 2022 and later.☆14Jan 2, 2023Updated 3 years ago
- 软考--系统架构设计师(软考高级)通过复习资料,包含2009-2018历年综合知识、案例分析真题与详细答案以及教材和整套教学视频。2018年相关资料作者近期正在持续更新。☆12Jan 26, 2024Updated 2 years ago
- ☆16Apr 29, 2022Updated 3 years ago
- Code repository for paper "Neural network multi-component gas mixture analysis with broadband dual-frequency comb absorption spectroscopy…☆13Jun 27, 2022Updated 3 years ago
- The code for Spectral Super-Resolution via Deep Low-Rank Tensor Representation☆11Mar 21, 2024Updated 2 years ago
- An implementation of EMMA (End-to-End Multimodal Model for Autonomous Driving) using the Claude API, based on the EMMA paper.☆12Dec 14, 2024Updated last year
- BEV & Occupancy 从入门到精通☆17Jul 4, 2024Updated last year
- 本项目综合运用d3、echarts来完成可视化工作,实现了对nba两场比赛的可视化数据分析,包括球员运动轨迹、个人数据、传球次数以及得分位置等多种可交互式图表。通过可视化方法,我们能够进一步深入分析球队的具体情况,便于制定更佳的战术。☆15Dec 19, 2022Updated 3 years ago
- ☆12May 31, 2023Updated 2 years ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- A NeSy Framework for Learning with Requirements☆21Feb 4, 2026Updated 2 months ago
- FastPoseCNN: Real-time 6D Pose and Size Estimation☆14Jul 6, 2021Updated 4 years ago
- Exploring the Spectral Prior for Hyperspectral Image Super-Resolution (IEEE Transactions on Image Processing 24)☆18Oct 8, 2024Updated last year
- ☆36Jun 3, 2025Updated 10 months ago
- Stores here are the source codes for the official implementation of "Generating Traffic Scenarios via In-Context Learning to Learn Better…☆23May 1, 2025Updated 11 months ago
- This is the reserch code of the IEEE Transactions on Geoscience and Remote Sensing 2022 paper "Spectral Super-Resolution of Multispectral…☆12Nov 14, 2022Updated 3 years ago
- [TCSVT] DAOcc: 3D Object Detection Assisted Multi-Sensor Fusion for 3D Occupancy Prediction☆107Mar 26, 2026Updated 3 weeks ago
- [ICCV 2025] ETA: Efficiency through Thinking Ahead, A Dual Approach to Self-Driving with Large Models☆42Jul 2, 2025Updated 9 months ago
- This repository implements various Search Based (Heuristic and Incremental) and Sampling Based (Multi Query and Single Query) motion plan…☆16May 28, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- measure objects in augmented reality, using just your smartphone.☆14Aug 22, 2022Updated 3 years ago
- Implementation of MINIVAN (Mixed INteger InteractiVe plAnNing)☆20Jan 30, 2022Updated 4 years ago
- [IEEE TIV] OccFusion: Multi-Sensor Fusion Framework for 3D Semantic Occupancy Prediction☆58Nov 5, 2024Updated last year
- A Spectral Diffusion Prior for Unsupervised Hyperspectral Image Super-Resolution, IEEE TGRS, 2024☆19Feb 24, 2025Updated last year
- 博客☆12Nov 8, 2025Updated 5 months ago
- PyTorch codes for reproducing TIP 2019 paper "HyperReconNet: Joint Coded Aperture Optimization and Image Reconstruction for Compressive H…☆10Apr 13, 2022Updated 4 years ago
- A framework to learn Compressive Learning system with multidimensional data☆12Jul 26, 2021Updated 4 years ago