datasets resource
☆146May 27, 2026Updated last month
Alternatives and similar repositories for opendatalab-datasets
Users that are interested in opendatalab-datasets are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Data Set Description Language Specification (新一代人工智能数据集描述语言DSDL)☆46May 29, 2024Updated 2 years ago
- WanJuan-CC是以CommonCrawl为基础,经过数据抽取,规则清洗,去重,安全过滤,质量清洗等步骤得到的高质量数据。☆14Apr 18, 2024Updated 2 years ago
- Open-source multimodal data annotation platform with AI auto-annotation support.☆1,605Jun 17, 2026Updated last week
- MLLM-DataEngine: An Iterative Refinement Approach for MLLM☆49May 24, 2024Updated 2 years ago
- LabelBee is an annotation Library☆302Jun 9, 2026Updated 3 weeks ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ECCV2024_Parrot Captions Teach CLIP to Spot Text☆66Sep 6, 2024Updated last year
- The Open-Source Data Annotation Platform☆1,244Feb 19, 2025Updated last year
- Out-of-the-box Annotation Toolbox☆395Apr 19, 2024Updated 2 years ago
- UniMERNet: A Universal Network for Real-World Mathematical Expression Recognition☆486Sep 28, 2025Updated 9 months ago
- (ICCV 2025) OCR Hinders RAG: Evaluating the Cascading Impact of OCR on Retrieval-Augmented Generation☆103Dec 3, 2025Updated 6 months ago
- 万卷1.0多模态语料☆574Oct 20, 2023Updated 2 years ago
- A Python package for interacting with the MinerU Vision-Language Model.☆131Jun 11, 2026Updated 2 weeks ago
- NanaDraw turns complex scientific ideas into clear, expressive visuals you can use right away. Powered by Nano Banana, it generates edita…☆102Apr 29, 2026Updated 2 months ago
- MPB (Miner-PDF-Benchmark) is an end-to-end PDF document comprehension evaluation suite designed for large-scale model data scenarios.☆24Dec 11, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A Comprehensive Toolkit for High-Quality PDF Content Extraction☆9,748Jan 3, 2025Updated last year
- ☆121Jan 15, 2026Updated 5 months ago
- Implementation of the Paper Scene-Graph ViT☆10Dec 20, 2024Updated last year
- DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception☆2,200Apr 14, 2025Updated last year
- Normal Learning in Videos with Attention Prototype Network☆18Jan 19, 2023Updated 3 years ago
- The official implementation of the paper "CrossViewDiff: A Cross-View Diffusion Model for Satellite-to-Street View Synthesis"☆16Sep 2, 2024Updated last year
- Official implementation of Panacea: A foundation model for clinical trial design, recruitment, search, and summarization.☆21Dec 24, 2024Updated last year
- [ICLR2025] Swiss Army Knife: Synergizing Biases in Knowledge from Vision Foundation Models for Multi-Task Learning☆14Apr 8, 2025Updated last year
- This is a repository for ACMMM22 paper "Exploring Effective Knowledge Transfer for Few-shot Object Detection"☆18Jun 21, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Transforms complex documents like PDFs and Office docs into LLM-ready markdown/JSON for your Agentic workflows.☆71,142Updated this week
- ☆14Apr 19, 2024Updated 2 years ago
- 陆续开源医疗行业的深度学习模型及数据集☆13Dec 30, 2021Updated 4 years ago
- 公安网备 敏感词过滤词☆14Oct 7, 2018Updated 7 years ago
- vllm混合推理扩展插件,支持多NUMA混合推理,单卡推理Qwen3-Next模型可达1000+ prefill☆32Nov 7, 2025Updated 7 months ago
- GRPO Algorithm for Llava Architecture (Based on Verl)☆49May 9, 2025Updated last year
- 纯前端的 New API 调用测试页面,用来测试 OpenAI/Anthropic/Google 的一些特殊调用方式。所有数据仅在浏览器本地处理与保存。☆49Jan 29, 2026Updated 5 months ago
- ☆17Dec 13, 2023Updated 2 years ago
- ICDO: International Classification of Diseases Ontology☆12Apr 19, 2024Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Repository containing variety of model files and program scripts for Nvidia Isaac -enviroments☆16Feb 23, 2026Updated 4 months ago
- [ICLR 2025 Spotlight] The official implementation of the paper “LOKI:A Comprehensive Synthetic Data Detection Benchmark using Large Multi…☆180Feb 7, 2026Updated 4 months ago
- ☆21Jul 20, 2024Updated last year
- [CVPR 2025] A Comprehensive Benchmark for Document Parsing and Evaluation☆1,854Updated this week
- (CVPR 2026) TRivia: Self-supervised Fine-tuning of Vision-Language Models for Table Recognition☆34Feb 5, 2026Updated 4 months ago
- Official release of InternLM series (InternLM, InternLM2, InternLM2.5, InternLM3).☆7,230Oct 30, 2025Updated 8 months ago
- Some Useful Tools Code☆16Updated this week