Data Set Description Language Specification (新一代人工智能数据集描述语言DSDL)
☆46May 29, 2024Updated 2 years ago
Alternatives and similar repositories for dsdl-docs
Users that are interested in dsdl-docs are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- SDK of OpenDataLab - https://opendatalab.org.cn☆60Jul 31, 2025Updated 11 months ago
- ☆25Nov 7, 2022Updated 3 years ago
- datasets resource☆146May 27, 2026Updated last month
- Data annotation component library --provided as NPM packages☆156Jun 2, 2026Updated 3 weeks ago
- AAAI 2024: Visual Instruction Generation and Correction☆97Feb 4, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- WanJuan-CC是以CommonCrawl为基础,经过数据抽取,规则清洗,去重,安全过滤,质量清洗等步骤得到的高质量数据。☆14Apr 18, 2024Updated 2 years ago
- MLLM-DataEngine: An Iterative Refinement Approach for MLLM☆49May 24, 2024Updated 2 years ago
- [ACL 2024 Main Conference] Chinese commonsense benchmark for LLMs☆45Jul 27, 2024Updated last year
- Out-of-the-box Annotation Toolbox☆395Apr 19, 2024Updated 2 years ago
- [ACL 2025 Best Theme Paper] This is the official implementation for the paper: "Meta-rater: A Multi-dimensional Data Selection Method for…☆195Aug 29, 2025Updated 10 months ago
- Open-source multimodal data annotation platform with AI auto-annotation support.☆1,605Jun 17, 2026Updated 2 weeks ago
- WanJuan3.0(“万卷·丝路”)一个作为 综合性的纯文本语料库,采集了多个国家地区的网络公开信息、文献、专利等资料,数据总规模超1.2TB,Token总数超过300B,处于国际领先水平,首期开源的语料库主要由泰语、俄语、阿拉伯语、韩语和越南语5个子集构成,每个子集的数据…☆46Feb 13, 2025Updated last year
- 万卷1.0多模态语料☆574Oct 20, 2023Updated 2 years ago
- A Python package for interacting with the MinerU Vision-Language Model.☆132Jun 11, 2026Updated 2 weeks ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆121Jan 15, 2026Updated 5 months ago
- NanaDraw turns complex scientific ideas into clear, expressive visuals you can use right away. Powered by Nano Banana, it generates edita…☆102Apr 29, 2026Updated 2 months ago
- ☆14Apr 19, 2024Updated 2 years ago
- [ICLR 2025 Spotlight] The official implementation of the paper “LOKI:A Comprehensive Synthetic Data Detection Benchmark using Large Multi…☆181Feb 7, 2026Updated 4 months ago
- (CVPR 2026) TRivia: Self-supervised Fine-tuning of Vision-Language Models for Table Recognition☆34Feb 5, 2026Updated 4 months ago
- [ACL2024] T-Eval: Evaluating Tool Utilization Capability of Large Language Models Step by Step☆310Apr 3, 2024Updated 2 years ago
- The official pytorch implementation of Exploring the User Guidance for More Accurate Building Segmentation from High-Resolution Remote Se…☆18May 27, 2024Updated 2 years ago
- Paper Reading:涉及分布式、虚拟化、网络、机器学习☆22Sep 27, 2020Updated 5 years ago
- [NeurIPS 2025 🔥] FakeVLM: Advancing Synthetic Image Detection through Explainable Multimodal Models and Fine-Grained Artifact Analysis☆152Sep 24, 2025Updated 9 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆10Oct 8, 2021Updated 4 years ago
- Fibertree emulator☆17Nov 4, 2024Updated last year
- [NeurIPS'22] Learning from Future: A Novel Self-Training Framework for Semantic Segmentation.☆32Sep 22, 2022Updated 3 years ago
- A Comprehensive Toolkit for High-Quality PDF Content Extraction☆9,757Jan 3, 2025Updated last year
- Dingo: A Comprehensive AI Data, Model and Application Quality Evaluation Tool☆719Updated this week
- (NeurIPS 2025) Official implementation for "MJ-Bench: Is Your Multimodal Reward Model Really a Good Judge for Text-to-Image Generation?"☆51Jun 3, 2025Updated last year
- [ACL2024 Findings] Agent-FLAN: Designing Data and Methods of Effective Agent Tuning for Large Language Models☆361Mar 22, 2024Updated 2 years ago
- Text-to-3D Generation within 5 Minutes☆730Mar 10, 2024Updated 2 years ago
- A DAG processor and compiler for a tree-based spatial datapath.☆16Aug 24, 2022Updated 3 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- An multi-agent design-to-code tool that generates production-ready React code with high visual fidelity and iterative validation.☆109May 22, 2026Updated last month
- [ICML 2021] "Auto-NBA: Efficient and Effective Search Over the Joint Space of Networks, Bitwidths, and Accelerators" by Yonggan Fu, Yonga…☆16Jan 3, 2022Updated 4 years ago
- ☆15Mar 21, 2025Updated last year
- A multi-task(detection, tracking, dense estimation, object counting) frame-work based on yolov5+deepsort☆37Aug 6, 2021Updated 4 years ago
- ☆897Jun 7, 2023Updated 3 years ago
- This is the repo for the paper Multi-Agent Collaborative Data Selection for Efficient LLM Pretraining.☆48Aug 22, 2025Updated 10 months ago
- Some microbenchmarks and design docs before commencement☆11Feb 1, 2021Updated 5 years ago