☆22Sep 9, 2025Updated 6 months ago
Alternatives and similar repositories for SparkUI-Parser
Users that are interested in SparkUI-Parser are comparing it to the libraries listed below
Sorting:
- ChineseCLIP using online learning☆14Nov 7, 2022Updated 3 years ago
- Awesome multi-modal large language paper/project, collections of popular training strategies, e.g., PEFT, LoRA.☆27Aug 2, 2024Updated last year
- ☆12Sep 19, 2021Updated 4 years ago
- ☆12Jan 10, 2025Updated last year
- A digital twin of the city of Chicago along with automated sensors☆13Nov 14, 2019Updated 6 years ago
- Code release for our paper "Divide and Conquer: Language Models can Plan and Self-Correct for Compositional Text-to-Image Generation".☆18Jan 30, 2024Updated 2 years ago
- NightSurveillance Sataset for Pedestrian Detection☆11Jul 30, 2020Updated 5 years ago
- trackets-level person attributes on the MARS dataset☆14Aug 14, 2019Updated 6 years ago
- MXNet-Gluon model to Caffe (support SSD in gluoncv)☆10Jun 20, 2019Updated 6 years ago
- Multi-Person Tracking in Tour Guide Robot☆10Aug 23, 2022Updated 3 years ago
- ☆12Sep 14, 2020Updated 5 years ago
- ☆24Jun 12, 2025Updated 9 months ago
- "FORB: A Flat Object Retrieval Benchmark for Universal Image Embedding", NeurIPS 2023 Datasets and Benchmarks Track☆12Jun 20, 2024Updated last year
- Official implementation of "VSTAR: Generative Temporal Nursing for Longer Dynamic Video Synthesis"☆20Jan 26, 2025Updated last year
- ☆11May 31, 2020Updated 5 years ago
- A curated list of resources dedicated to computer vision and related algorithms for creating, correcting maps. Feel free to make PRs to c…☆13Jan 3, 2019Updated 7 years ago
- [EMNLP 2024] Official code for "Beyond Embeddings: The Promise of Visual Table in Multi-Modal Models"☆20Oct 17, 2024Updated last year
- yolact, fcos, gluoncv☆14Nov 28, 2022Updated 3 years ago
- [CVPR'2022 Oral] The Devil is in the Labels: Noisy Label Correction for Robust Scene Graph Generation☆32Oct 19, 2023Updated 2 years ago
- (ACL 2025) MAmmoTH-VL: Eliciting Multimodal Reasoning with Instruction Tuning at Scale☆49Jun 4, 2025Updated 9 months ago
- [CVPR 2025] DeCLIP: Decoupled Learning for Open-Vocabulary Dense Perception☆153Jan 10, 2026Updated 2 months ago
- The application of large pre-trained vision model DINOv2 from MetaAI for feature points matching, and a ViT decoder used for Auto Encoder☆17Apr 27, 2023Updated 2 years ago
- AdaCrowd: Unlabeled Scene Adaptation for Crowd Counting (TMM 2021)☆10Feb 24, 2021Updated 5 years ago
- The first spoken long-text dataset derived from live streams, designed to reflect the redundancy-rich and conversational nature of real-w…☆12Jun 28, 2025Updated 8 months ago
- Hierarchical And Quantized AutoEncoders☆13Jun 12, 2020Updated 5 years ago
- Official repo of "MMBench-GUI: Hierarchical Multi-Platform Evaluation Framework for GUI Agents". It can be used to evaluate a GUI agent w…☆100Sep 8, 2025Updated 6 months ago
- ☆66Feb 1, 2026Updated last month
- [ICML 2024] Official Repository for the paper "Transformers Get Stable: An End-to-End Signal Propagation Theory for Language Models"☆10Jul 19, 2024Updated last year
- ☆14Oct 14, 2021Updated 4 years ago
- Code for CVPR 2024 paper: ConvoFusion: Multi-Modal Conversational Diffusion for Co-Speech Gesture Synthesis☆35Apr 29, 2025Updated 10 months ago
- Code for EDT's IGVC entry, Revo.☆19Sep 23, 2017Updated 8 years ago
- A Framework for Evaluating AI Agent Safety in Realistic Environments☆30Oct 2, 2025Updated 5 months ago
- Code release for "Understanding Bias in Large-Scale Visual Datasets"☆23Dec 4, 2024Updated last year
- The C++ implementation of Multi-H algorithm, which is a multi-plane fitting technique. If you use this work for Academic purposes, pleas…☆32Feb 26, 2019Updated 7 years ago
- ☆24Jul 31, 2024Updated last year
- ☆18Jun 10, 2025Updated 9 months ago
- [AAAI-2024] Structural Information Guided Multimodal Pre-training for Vehicle-centric Perception, Xiao Wang, Wentao Wu, Chenglong Li, Zhi…☆28Jul 29, 2024Updated last year
- [AAAI 2025] Official code for "OmniCount: Multi-label Object Counting with Semantic-Geometric Priors"☆21Sep 30, 2025Updated 5 months ago
- Implementation of "Single Shot Video Object Detector"☆23Mar 25, 2020Updated 5 years ago