Boosting Driving Scene Understanding with Advanced Vision-Language Models
☆33May 19, 2023Updated 2 years ago
Alternatives and similar repositories for DSify
Users that are interested in DSify are comparing it to the libraries listed below
Sorting:
- [ICML 2024] Junk DNA Hypothesis: A Task-Centric Angle of LLM Pre-trained Weights through Sparsity; Lu Yin*, Ajay Jaiswal*, Shiwei Liu, So…☆16Apr 21, 2025Updated 10 months ago
- Exploring Large Language Models for Trajectory Prediction: A Technical Perspective☆27Jun 12, 2024Updated last year
- Official Pytorch Implementation of "Outlier-weighed Layerwise Sampling for LLM Fine-tuning" by Pengxiang Li, Lu Yin, Xiaowei Gao, Shiwei …☆35Jun 3, 2025Updated 9 months ago
- Original materials for the "Introduction to Solving Biological Problems with Python" 2 days course☆10Jun 25, 2018Updated 7 years ago
- ☆10Jul 30, 2024Updated last year
- Official implementation of the paper "Pretraining Language Models to Ponder in Continuous Space"☆25Jul 21, 2025Updated 7 months ago
- Deep Learning Project for Trajectory Prediction using nuScenes dataset.☆10Sep 13, 2022Updated 3 years ago
- ☆11Sep 2, 2024Updated last year
- Official pytorch implementation of "Explore-And-Match: Bridging Proposal-Based and Proposal-Free With Transformer for Sentence Grounding …☆42Aug 5, 2022Updated 3 years ago
- ☆11Feb 9, 2026Updated 3 weeks ago
- This repository is dedicated to Track 2 of the W-CODA 2024 Workshop, "Multimodal Perception and Comprehension of Corner Cases in Autonomo…☆17Jun 12, 2024Updated last year
- [CVPR 2026] FocusUI: Efficient UI Grounding via Position-Preserving Visual Token Selection☆25Feb 10, 2026Updated 3 weeks ago
- Data for evaluating GPT-4V☆11Oct 26, 2023Updated 2 years ago
- We introduce Chart2Code, the first user-driven, hierarchical benchmark that systematically evaluates Large Multimodal Models on chart-to-…☆24Jan 27, 2026Updated last month
- A PyTorch implementation of the paper "MMoT: Mixture-of-Modality-Tokens Transformer for Composed Multimodal Conditional Image Synthesis".☆12Jan 16, 2023Updated 3 years ago
- Neural Network Image Compression☆13Jan 12, 2018Updated 8 years ago
- Code for Heima☆59Apr 21, 2025Updated 10 months ago
- NN1 network from FaceNet: A Unified Embedding for Face Recognition and Clustering, in Keras.☆11Jun 13, 2017Updated 8 years ago
- 第九届中国软件杯视频全量分析“一等奖”&第十届中国软件杯A2百度paddlepaddle跟踪赛道“二等奖”☆10Jul 10, 2023Updated 2 years ago
- Edit and Generate Anything in 3D world!☆14Apr 15, 2023Updated 2 years ago
- ☆10Mar 30, 2022Updated 3 years ago
- Repository of GUI Action Narrator☆13Apr 8, 2025Updated 10 months ago
- ☆24Jan 22, 2026Updated last month
- This project explores using machine learning methods for detection of Parkinson's disease using an individual's speech.☆15Nov 18, 2019Updated 6 years ago
- The official implementation of HybridNorm: Towards Stable and Efficient Transformer Training via Hybrid Normalization☆18Mar 7, 2025Updated last year
- [EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding☆3,128Jun 4, 2024Updated last year
- Driver Attention Prediction in Accidental Scenarios☆122Dec 11, 2024Updated last year
- Compress conventional Vision-Language Pre-training data☆53Sep 22, 2023Updated 2 years ago
- Official PyTorch implementation of our paper "Adversarial Training of Self-supervised Monocular Depth Estimation against Physical-World A…☆11Feb 8, 2023Updated 3 years ago
- pre-trained vision and language model summary☆12Apr 20, 2021Updated 4 years ago
- This project enables to visualize NuScene data such as Point Cloud data for Radar, Lidar and Images captures using various sensors☆11Feb 4, 2021Updated 5 years ago
- ☆15Jan 8, 2020Updated 6 years ago
- autoredteam: code for training models that automatically red team other language models☆15Aug 9, 2023Updated 2 years ago
- Code for DVD A Diagnostic Dataset for Multi-step Reasoning in Video Grounded Dialogue☆14Oct 12, 2021Updated 4 years ago
- This is the project page for the HOSNeRF☆16Dec 11, 2023Updated 2 years ago
- ☆10Mar 26, 2024Updated last year
- ☆15Mar 12, 2024Updated last year
- ☆13Aug 27, 2021Updated 4 years ago
- A structurally comprehensive dataset of AMR-to-text alignments for coverage of a larger variety of linguistic phenomena, for research rel…☆15Dec 10, 2022Updated 3 years ago