opendatalab / opendatalab-datasetsLinks
datasets resource
☆117Updated 2 months ago
Alternatives and similar repositories for opendatalab-datasets
Users that are interested in opendatalab-datasets are comparing it to the libraries listed below
Sorting:
- Data Set Description Language Specification (新一代人工智能数据集描述语言DSDL)☆47Updated last year
- Data annotation component library --provided as NPM packages☆106Updated this week
- SDK of OpenDataLab - https://opendatalab.org.cn☆57Updated last year
- ☆25Updated 2 years ago
- The Open-Source Data Annotation Platform☆848Updated 4 months ago
- Data annotation toolbox supports image, audio and video data.☆1,235Updated this week
- 万卷1.0多模态语料☆561Updated last year
- ☆523Updated 11 months ago
- Dingo: A Comprehensive AI Data Quality Evaluation Tool☆183Updated last week
- ☆18Updated last week
- GOT的vLLM加速实现 并结合 MinerU 实现RAG中的pdf 解析☆58Updated 7 months ago
- 360LayoutAnaylsis, a series Document Analysis Models and Datasets deleveped by 360 AI Research Institute☆290Updated 9 months ago
- AAAI 2024: Visual Instruction Generation and Correction☆93Updated last year
- [ACL2024 Findings] Agent-FLAN: Designing Data and Methods of Effective Agent Tuning for Large Language Models☆348Updated last year
- ☆338Updated last year
- [CVPR 2025] A Comprehensive Benchmark for Document Parsing and Evaluation☆526Updated last month
- UniMERNet: A Universal Network for Real-World Mathematical Expression Recognition☆354Updated last week
- [ACM'MM 2024 Oral] Official code for "OneChart: Purify the Chart Structural Extraction via One Auxiliary Token"☆225Updated 2 months ago
- ☆229Updated last year
- WanJuan3.0(“万卷·丝路”)一个作为综合性的纯文本语料库,采集了多个国家地区的网络公开信息、文献、专利等资料,数据总规模超1.2TB,Token总数超过300B,处于国际领先水平,首期开源的语料库主要由泰语、俄语、阿拉伯语、韩语和越南语5个子集构成,每个子集的数据…☆30Updated 4 months ago
- ☆63Updated last year
- Analysis of Chinese and English layouts 中英文版面分析☆218Updated last week
- LLM Group Chat Framework: chat with multiple LLMs at the same time. 大模型群聊框架:同时与多个大语言模型聊天。☆304Updated last week
- MPB (Miner-PDF-Benchmark) is an end-to-end PDF document comprehension evaluation suite designed for large-scale model data scenarios.☆23Updated 6 months ago
- ViDoRAG: Visual Document Retrieval-Augmented Generation via Dynamic Iterative Reasoning Agents☆500Updated 2 weeks ago
- A High-efficiency Open-source Toolkit for Table-to-Latex Task☆246Updated 6 months ago
- Enhance LLM agents with rich tool APIs☆390Updated 9 months ago
- 文档方向分类☆219Updated 7 months ago
- 基于序列表格识别算法推理库,集成PP-Structure和modelscope等表格识别算法。☆314Updated this week
- 顾名思义:手搓的RAG☆124Updated last year