☆35Aug 29, 2025Updated 8 months ago
Alternatives and similar repositories for Awesome-VLM-Synthetic-Data
Users that are interested in Awesome-VLM-Synthetic-Data are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆10Apr 7, 2025Updated last year
- Offical respority for Gait Recogniton with Drones: A benchmark (TMM 2023)☆10Feb 2, 2024Updated 2 years ago
- [World-Model-Survey-2024] Paper list and projects for World Model☆15Oct 31, 2024Updated last year
- MetaLadder: Ascending Mathematical Solution Quality via Analogical-Problem Reasoning Transfer (EMNLP 2025)☆12Apr 18, 2025Updated last year
- [ICLR 2026 Oral] Generative Universal Verifier as Multimodal Meta-Reasoner☆57Nov 14, 2025Updated 5 months ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- This repository offers a comprehensive overview of existing datasets and methods in the field of change captioning.☆17Sep 2, 2025Updated 7 months ago
- OpenMediation SDK Server☆15Oct 4, 2022Updated 3 years ago
- ☆25Dec 23, 2024Updated last year
- Official implementation of Geometry Cloak [NeurIPS'24]☆24Apr 16, 2025Updated last year
- Unsupervised Domain Adaptation on Graphs☆15Apr 6, 2022Updated 4 years ago
- SOT画图工具(主要是LaSOT数据集),可以同时画多个跟踪器的跟踪框、蜘蛛网图、花花绿绿曲线图☆23Jan 10, 2025Updated last year
- ☆37Jun 9, 2025Updated 10 months ago
- A paper list of self-supervised pretrain method☆22Aug 15, 2025Updated 8 months ago
- Embedding model prioritized towards Multimodal RAG, overall + VisDoc double top1 on MMEB benchmark☆35Nov 6, 2025Updated 5 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A Population Based Reinforcement Learning Library based on PyTorch☆27Mar 5, 2023Updated 3 years ago
- [ICML 2023] Optimizing the Collaboration Structure in Cross-Silo Federated Learning. Wenxuan Bao, Haohan Wang, Jun Wu, Jingrui He.☆20Jul 25, 2023Updated 2 years ago
- [ICASSP 2026] Task Vector in TTS: Toward Emotionally Expressive Dialectal Speech Synthesis☆39Dec 24, 2025Updated 4 months ago
- ✨ PotPlayer AI字幕翻译插件 - 看剧不再愁翻译 你的专业翻译官 🎯 黑科技加持: • 🤖 接入顶级AI(OpenAI/DeepSeek/通义千问) - 智能翻译从此开始 • 🎬 8种专项模式 - 动漫、美漫、科幻、剧情...每种都精准 • 💬 口语化…☆65Updated this week
- Grounding Language Models for Compositional and Spatial Reasoning☆18Oct 26, 2022Updated 3 years ago
- A simple Latex template for response letter☆20May 27, 2024Updated last year
- ☆23Nov 4, 2024Updated last year
- [ACL2025] STATE ToxiCN: A Benchmark for Span-level Target-Aware Toxicity Extraction in Chinese Hate Speech Detection☆48Oct 25, 2025Updated 6 months ago
- GaitFormer Official Codebase for the paper "Learning Gait Representations with Noisy Multi-Task Learning"☆20Feb 22, 2023Updated 3 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- [ACM MM 2023] LandmarkGait: Intrinsic Human Parsing for Gait Recognition☆18Jun 13, 2024Updated last year
- ☆11Jan 27, 2020Updated 6 years ago
- Official implementation of NeRFProtector [ECCV'24]☆22Aug 27, 2024Updated last year
- Code for "CAFe: Unifying Representation and Generation with Contrastive-Autoregressive Finetuning"☆33Mar 26, 2025Updated last year
- Scaling Agentic Environments Automatically.☆62Mar 26, 2026Updated last month
- [AAAI 2024] QAGait: Revisit Gait Recognition From a Quality Perspective☆23Aug 26, 2024Updated last year
- Models and programs developed as part of XTX Forecastin Challenge 2019☆28Jul 6, 2023Updated 2 years ago
- [NeurIPS 2024] Official repository for downloading and using LAVIB☆25Aug 6, 2025Updated 8 months ago
- [TPAMI] Locating and Counting Heads in Crowds With a Depth Prior☆10Jan 7, 2022Updated 4 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- F-16 is a powerful video large language model (LLM) that perceives high-frame-rate videos, which is developed by the Department of Electr…☆36Jul 3, 2025Updated 9 months ago
- A multimodal large-scale model, which performs close to the closed-source Qwen-VL-PLUS on many datasets and significantly surpasses the p…☆14Feb 5, 2024Updated 2 years ago
- Official PyTorch implementation of paper “InsViE-1M: Effective Instruction-based Video Editing with Elaborate Dataset Construction”☆33Apr 3, 2026Updated 3 weeks ago
- FreeVA: Offline MLLM as Training-Free Video Assistant☆69Jun 9, 2024Updated last year
- Data and code for ACL 2023 paper "RobuT: A Systematic Study of Table QA Robustness Against Human-Annotated Adversarial Perturbations"☆15Feb 8, 2024Updated 2 years ago
- The implementation of our CVPR 2023 paper: Frame-Event Alignment and Fusion Network for High Frame Rate Tracking☆34May 29, 2023Updated 2 years ago
- ☆19Apr 28, 2023Updated 3 years ago