Data Set Description Language Specification (新一代人工智能数据集描述语言DSDL)
☆46May 29, 2024Updated last year
Alternatives and similar repositories for dsdl-docs
Users that are interested in dsdl-docs are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆25Nov 7, 2022Updated 3 years ago
- datasets resource☆134Jul 1, 2025Updated 9 months ago
- Data annotation component library --provided as NPM packages☆147Mar 18, 2026Updated 3 weeks ago
- AAAI 2024: Visual Instruction Generation and Correction☆96Feb 4, 2024Updated 2 years ago
- WanJuan-CC是以CommonCrawl为基础,经过数据抽取,规则清洗,去重,安全过滤,质量清洗等步骤得到的高质量数据。☆13Apr 18, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [ACL 2024 Main Conference] Chinese commonsense benchmark for LLMs☆45Jul 27, 2024Updated last year
- Out-of-the-box Annotation Toolbox☆396Apr 19, 2024Updated last year
- LabelBee is an annotation Library☆300Mar 27, 2026Updated 2 weeks ago
- Data annotation toolbox supports image, audio and video data.☆1,539Mar 20, 2026Updated 3 weeks ago
- 万卷1.0多模态语料☆571Oct 20, 2023Updated 2 years ago
- The Open-Source Data Annotation Platform☆1,207Feb 19, 2025Updated last year
- (ICCV 2025) OCR Hinders RAG: Evaluating the Cascading Impact of OCR on Retrieval-Augmented Generation☆95Dec 3, 2025Updated 4 months ago
- A Python package for interacting with the MinerU Vision-Language Model.☆109Updated this week
- ☆14Apr 19, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- C++开发\机器学习\深度学习\推荐算法基础知识及面试题总结☆21Mar 4, 2021Updated 5 years ago
- [ICLR 2025 Spotlight] The official implementation of the paper “LOKI:A Comprehensive Synthetic Data Detection Benchmark using Large Multi…☆176Feb 7, 2026Updated 2 months ago
- TRivia: Self-supervised Fine-tuning of Vision-Language Models for Table Recognition☆30Feb 5, 2026Updated 2 months ago
- Beyond Hallucinations: Enhancing LVLMs through Hallucination-Aware Direct Preference Optimization☆101Jan 30, 2024Updated 2 years ago
- [ACL2024] T-Eval: Evaluating Tool Utilization Capability of Large Language Models Step by Step☆306Apr 3, 2024Updated 2 years ago
- Paper Reading:涉及分布式、虚拟化、网络、机器学习☆23Sep 27, 2020Updated 5 years ago
- ☆38Oct 29, 2024Updated last year
- DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception☆2,099Apr 14, 2025Updated 11 months ago
- Fibertree emulator☆17Nov 4, 2024Updated last year
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- A Comprehensive Toolkit for High-Quality PDF Content Extraction☆9,562Jan 3, 2025Updated last year
- Dingo: A Comprehensive AI Data, Model and Application Quality Evaluation Tool☆673Apr 3, 2026Updated last week
- The code for Joint Neural Architecture Search and Quantization☆14Apr 10, 2019Updated 7 years ago
- [CVPR 2025] A Comprehensive Benchmark for Document Parsing and Evaluation☆1,630Feb 27, 2026Updated last month
- [ACL2024 Findings] Agent-FLAN: Designing Data and Methods of Effective Agent Tuning for Large Language Models☆359Mar 22, 2024Updated 2 years ago
- Text-to-3D Generation within 5 Minutes☆731Mar 10, 2024Updated 2 years ago
- A DAG processor and compiler for a tree-based spatial datapath.☆16Aug 24, 2022Updated 3 years ago
- [ICML 2021] "Auto-NBA: Efficient and Effective Search Over the Joint Space of Networks, Bitwidths, and Accelerators" by Yonggan Fu, Yonga…☆16Jan 3, 2022Updated 4 years ago
- Cuckoo Hash – Comprehensive support in Go with no dependencies☆30Mar 20, 2021Updated 5 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Task Compass: Scaling Multi-task Pre-training with Task Prefix (EMNLP 2022: Findings) (stay tuned & more will be updated)☆22Oct 17, 2022Updated 3 years ago
- Experiment for Understanding the Effects of Dataset Characteristics on Offline Reinforcement Learning☆26Jan 16, 2023Updated 3 years ago
- GraphQL application using spring 5 reactive framework (webflux)☆45Mar 16, 2018Updated 8 years ago
- VisualGPTScore for visio-linguistic reasoning☆27Oct 7, 2023Updated 2 years ago
- InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions☆2,923May 26, 2025Updated 10 months ago
- This is the repo for the paper Multi-Agent Collaborative Data Selection for Efficient LLM Pretraining.☆47Aug 22, 2025Updated 7 months ago
- ☆29Sep 17, 2024Updated last year