alibaba / AICITY2024_Track2_AliOpenTrek_CityLLaVAView external linksLinks
☆58Jul 1, 2024Updated last year
Alternatives and similar repositories for AICITY2024_Track2_AliOpenTrek_CityLLaVA
Users that are interested in AICITY2024_Track2_AliOpenTrek_CityLLaVA are comparing it to the libraries listed below
Sorting:
- AICITY2024 Track 2 - Code from AIO_ISC Team☆37Jul 13, 2024Updated last year
- ☆52Jun 16, 2025Updated 8 months ago
- ☆16Mar 26, 2025Updated 10 months ago
- ☆53May 6, 2024Updated last year
- Entropy-Driven GRPO with Guided Error Correction for Advantage Diversity☆22Aug 28, 2025Updated 5 months ago
- ☆17Nov 28, 2025Updated 2 months ago
- ☆13Oct 15, 2024Updated last year
- ☆42Sep 15, 2025Updated 5 months ago
- TransSimHub is a lightweight Python library for simulating and controlling transportation systems.☆49Updated this week
- Official pytorch implementation of the ICML2024 main conference paper: Pedestrian Attribute Recognition as Label-balanced Multi-label Lea…☆13Jul 22, 2024Updated last year
- Pytorch implementation of the paper 'Towards Scenario Generalization for Vision-based Roadside 3D Object Detection'☆16Mar 9, 2025Updated 11 months ago
- Official Repo for ICCV25-Video2BEV: Transforming Drone Videos to BEVs for Video-based Geo-localization☆27Feb 4, 2026Updated last week
- ☆16Aug 29, 2023Updated 2 years ago
- [CVPRW 2024] TrafficVLM: A Controllable Visual Language Model for Traffic Video Captioning. Official code for the 3rd place solution of t…☆51Feb 11, 2025Updated last year
- An Efficient and Realistic Traffic Simulator with Embedded Machine Learning Models☆22Dec 19, 2024Updated last year
- ☆50Jun 30, 2024Updated last year
- This is a project about Optical Character Recognition (OCR) in Vietnamese texts by using PaddleOCR and VietOCR.☆27Mar 19, 2024Updated last year
- ☆22Oct 4, 2024Updated last year
- rmp data ranking☆13Nov 4, 2025Updated 3 months ago
- [CVPR 2025] DynRefer: Delving into Region-level Multimodal Tasks via Dynamic Resolution☆58Mar 4, 2025Updated 11 months ago
- Code and models of paper " Chained Multi-stream Networks Exploiting Pose, Motion, and Appearance for Action Classification and Detection"…☆27Aug 10, 2018Updated 7 years ago
- Official repository of the first-ranking solution for the UPAR2024 Challenge - Track 1.☆28Dec 26, 2023Updated 2 years ago
- Enhancing Large Vision Language Models with Self-Training on Image Comprehension.☆69May 31, 2024Updated last year
- The first spoken long-text dataset derived from live streams, designed to reflect the redundancy-rich and conversational nature of real-w…☆13Jun 28, 2025Updated 7 months ago
- ☆124Jul 29, 2024Updated last year
- ☆67Dec 7, 2025Updated 2 months ago
- ☆31Jul 1, 2021Updated 4 years ago
- An experiment to see if we can process G2 reviews to extract topics from reviews☆10Feb 5, 2024Updated 2 years ago
- ☆18Jun 10, 2025Updated 8 months ago
- [ICCVW2025] V-RoAst: A New Dataset for Visual Road Assessment☆11Dec 17, 2025Updated last month
- [NeurIPS ENLSP Workshop'24] CSKV: Training-Efficient Channel Shrinking for KV Cache in Long-Context Scenarios☆16Oct 18, 2024Updated last year
- [CVPR2024] BEVSee☆78Jul 8, 2024Updated last year
- ☆36Jul 1, 2024Updated last year
- xKV: Cross-Layer SVD for KV-Cache Compression☆43Nov 30, 2025Updated 2 months ago
- A lightweight flexible Video-MLLM developed by TencentQQ Multimedia Research Team.☆74Oct 14, 2024Updated last year
- MM-Instruct: Generated Visual Instructions for Large Multimodal Model Alignment☆35Jul 1, 2024Updated last year
- ☆31Jan 23, 2024Updated 2 years ago
- [ICLR2025] Draw-and-Understand: Leveraging Visual Prompts to Enable MLLMs to Comprehend What You Want☆94Dec 1, 2025Updated 2 months ago
- 🏆The 1st place solution of track3 (City-Scale Multi-Camera Vehicle Tracking) in the NVIDIA AI City Challenge at CVPR 2021 Workshop.☆140Jul 24, 2022Updated 3 years ago