☆31Aug 29, 2025Updated 6 months ago
Alternatives and similar repositories for Awesome-VLM-Synthetic-Data
Users that are interested in Awesome-VLM-Synthetic-Data are comparing it to the libraries listed below
Sorting:
- ☆10Apr 7, 2025Updated 11 months ago
- ☆20Mar 8, 2026Updated last week
- Offical respority for Gait Recogniton with Drones: A benchmark (TMM 2023)☆10Feb 2, 2024Updated 2 years ago
- A Cross-Modal RGB-Event Benchmark for Multi-Object Tracking and Detection.☆12Oct 17, 2023Updated 2 years ago
- [World-Model-Survey-2024] Paper list and projects for World Model☆15Oct 31, 2024Updated last year
- [ICLR 2026 Oral] Generative Universal Verifier as Multimodal Meta-Reasoner☆54Nov 14, 2025Updated 4 months ago
- MetaLadder: Ascending Mathematical Solution Quality via Analogical-Problem Reasoning Transfer (EMNLP 2025)☆11Apr 18, 2025Updated 11 months ago
- ☆80Nov 24, 2024Updated last year
- GaitParsing: Human Semantic Parsing for Gait Recognition (IEEE TMM)☆12May 20, 2024Updated last year
- LATTICE turns retrieval into an LLM-driven navigation problem over a semantic scaffold☆32Mar 9, 2026Updated last week
- Official implementation of "SPMTrack: Spatio-Temporal Parameter-Efficient Fine-Tuning with Mixture of Experts for Scalable Visual Trackin…☆43Oct 19, 2025Updated 5 months ago
- SOT画图工具(主要是LaSOT数据集),可以同时画多个跟踪器的跟踪框、蜘蛛网图、花花绿绿曲线图☆23Jan 10, 2025Updated last year
- The official repository of our paper "Reinforcing Video Reasoning with Focused Thinking"☆35Jun 12, 2025Updated 9 months ago
- A paper list of self-supervised pretrain method☆22Aug 15, 2025Updated 7 months ago
- Embedding model prioritized towards Multimodal RAG, overall + VisDoc double top1 on MMEB benchmark☆34Nov 6, 2025Updated 4 months ago
- ☆23Nov 4, 2024Updated last year
- Codebase from our first release.☆48Feb 17, 2026Updated last month
- [ACL2025] STATE ToxiCN: A Benchmark for Span-level Target-Aware Toxicity Extraction in Chinese Hate Speech Detection☆45Oct 25, 2025Updated 4 months ago
- ☆38Jan 9, 2026Updated 2 months ago
- Code for "CAFe: Unifying Representation and Generation with Contrastive-Autoregressive Finetuning"☆32Mar 26, 2025Updated 11 months ago
- LandmarkGait: Intrinsic Human Parsing for Gait Recognition (ACM MM 2023)☆18Jun 13, 2024Updated last year
- GaitFormer Official Codebase for the paper "Learning Gait Representations with Noisy Multi-Task Learning"☆20Feb 22, 2023Updated 3 years ago
- ☆11Jan 27, 2020Updated 6 years ago
- Official implementation of NeRFProtector [ECCV'24]☆22Aug 27, 2024Updated last year
- [TPAMI] Locating and Counting Heads in Crowds With a Depth Prior☆10Jan 7, 2022Updated 4 years ago
- A multimodal large-scale model, which performs close to the closed-source Qwen-VL-PLUS on many datasets and significantly surpasses the p…☆14Feb 5, 2024Updated 2 years ago
- 🚀 Codebase and Fondation Models for Visual Instruction Tuning☆14Aug 19, 2023Updated 2 years ago
- [NeurIPS 2023] Differentially Private Image Classification by Learning Priors from Random Processes☆12Jun 12, 2023Updated 2 years ago
- Code for the paper "Finetuning CLIP to Reason about Pairwise Differences"☆19Oct 1, 2024Updated last year
- The implementation of our CVPR 2023 paper: Frame-Event Alignment and Fusion Network for High Frame Rate Tracking☆34May 29, 2023Updated 2 years ago
- ☆52Dec 13, 2024Updated last year
- FreeVA: Offline MLLM as Training-Free Video Assistant☆69Jun 9, 2024Updated last year
- Data and code for ACL 2023 paper "RobuT: A Systematic Study of Table QA Robustness Against Human-Annotated Adversarial Perturbations"☆15Feb 8, 2024Updated 2 years ago
- ☆19Apr 28, 2023Updated 2 years ago
- ☆29Oct 3, 2023Updated 2 years ago
- ☆22Jul 5, 2025Updated 8 months ago
- ☆26Jul 10, 2025Updated 8 months ago
- ☆19Jul 23, 2024Updated last year
- ☆19Feb 12, 2025Updated last year