Data Set Description Language Specification (新一代人工智能数据集描述语言DSDL)
☆46May 29, 2024Updated last year
Alternatives and similar repositories for dsdl-docs
Users that are interested in dsdl-docs are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆25Nov 7, 2022Updated 3 years ago
- Data annotation component library --provided as NPM packages☆147Updated this week
- WanJuan-CC是以CommonCrawl为基础,经过数据抽取,规则清洗,去重,安全过滤,质量清洗等步骤得到的高质量数据。☆13Apr 18, 2024Updated last year
- Out-of-the-box Annotation Toolbox☆395Apr 19, 2024Updated last year
- WanJuan3.0(“万卷·丝路”)一个作为综合性的纯文本语料库,采集了多个国家地区的网络公开信息、文献、专利等资料,数据总规模超1.2TB,Token总数超过300B,处于国际领先水平,首期开源的语料库主要由泰语、俄语、阿拉伯语、韩语和越南语5个子集构成,每个子集的数据…☆43Feb 13, 2025Updated last year
- 万卷1.0多模态语料☆570Oct 20, 2023Updated 2 years ago
- The Open-Source Data Annotation Platform☆1,194Feb 19, 2025Updated last year
- UniMERNet: A Universal Network for Real-World Mathematical Expression Recognition☆459Sep 28, 2025Updated 5 months ago
- (ICCV 2025) OCR Hinders RAG: Evaluating the Cascading Impact of OCR on Retrieval-Augmented Generation☆94Dec 3, 2025Updated 3 months ago
- A Python package for interacting with the MinerU Vision-Language Model.☆109Feb 5, 2026Updated last month
- ☆120Jan 15, 2026Updated 2 months ago
- Dense Article Dataset (DAD): A Benchmark Dataset for Document Layout Analysis☆16Jan 13, 2022Updated 4 years ago
- ☆14Apr 19, 2024Updated last year
- [ICCV25 Highlight] The official implementation of the paper "LEGION: Learning to Ground and Explain for Synthetic Image Detection"☆75Oct 22, 2025Updated 5 months ago
- A PyTorch implementation of Cyclical Learning Rates☆25Jan 30, 2018Updated 8 years ago
- [ICLR 2025 Spotlight] The official implementation of the paper “LOKI:A Comprehensive Synthetic Data Detection Benchmark using Large Multi…☆176Feb 7, 2026Updated last month
- TRivia: Self-supervised Fine-tuning of Vision-Language Models for Table Recognition☆26Feb 5, 2026Updated last month
- CMake configurations for PPL projects☆12Aug 10, 2024Updated last year
- Beyond Hallucinations: Enhancing LVLMs through Hallucination-Aware Direct Preference Optimization☆100Jan 30, 2024Updated 2 years ago
- Paper Reading:涉及分布式、虚拟化、网络、机器学习☆23Sep 27, 2020Updated 5 years ago
- [NeurIPS 2025 🔥] FakeVLM: Advancing Synthetic Image Detection through Explainable Multimodal Models and Fine-Grained Artifact Analysis☆126Sep 24, 2025Updated 5 months ago
- ☆37Oct 29, 2024Updated last year
- DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception☆2,059Apr 14, 2025Updated 11 months ago
- ☆10Oct 8, 2021Updated 4 years ago
- Fibertree emulator☆17Nov 4, 2024Updated last year
- Stencil with Optimized Dataflow Architecture☆12Feb 27, 2024Updated 2 years ago
- You Only Search Once: On Lightweight Differentiable Architecture Search for Resource-Constrained Embedded Platforms☆12Apr 17, 2023Updated 2 years ago
- (NeurIPS 2025) Official implementation for "MJ-Bench: Is Your Multimodal Reward Model Really a Good Judge for Text-to-Image Generation?"☆47Jun 3, 2025Updated 9 months ago
- RoDLA: Benchmarking the Robustness of Document Layout Analysis Models☆39Mar 26, 2025Updated 11 months ago
- [ACL2024 Findings] Agent-FLAN: Designing Data and Methods of Effective Agent Tuning for Large Language Models☆359Mar 22, 2024Updated 2 years ago
- A DAG processor and compiler for a tree-based spatial datapath.☆16Aug 24, 2022Updated 3 years ago
- ☆15Mar 21, 2025Updated last year
- Task Compass: Scaling Multi-task Pre-training with Task Prefix (EMNLP 2022: Findings) (stay tuned & more will be updated)☆22Oct 17, 2022Updated 3 years ago
- Experiment for Understanding the Effects of Dataset Characteristics on Offline Reinforcement Learning☆26Jan 16, 2023Updated 3 years ago
- ☆901Jun 7, 2023Updated 2 years ago
- InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions☆2,923May 26, 2025Updated 9 months ago
- ☆528Mar 13, 2025Updated last year
- Common libraries for PPL projects☆31Mar 10, 2025Updated last year
- Official release of InternLM series (InternLM, InternLM2, InternLM2.5, InternLM3).☆7,172Oct 30, 2025Updated 4 months ago