MigoXLab/dingo

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/MigoXLab/dingo)

MigoXLab / dingo

Dingo: A Comprehensive AI Data, Model and Application Quality Evaluation Tool

☆728

Alternatives and similar repositories for dingo

Users that are interested in dingo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ccprocessor / web2json-agent
View on GitHub
Web Structured Data Extraction Agent
☆16Mar 10, 2026Updated 4 months ago
MigoXLab / LMeterX
View on GitHub
A general-purpose API load testing platform that supports LLM services and business HTTP interfaces, enabling one-click performance testi…
☆198Jul 10, 2026Updated last week
opendatalab / WebMainBench
View on GitHub
WebMainBench is a high-precision benchmark for evaluating web main content extraction.
☆19Jun 13, 2026Updated last month
MigoXLab / webqa-agent
View on GitHub
Autonomous web browser agent that audits performance, functionality & UX for engineers and vibe-coding creators. 网站自主评估测试 Agent，支持 GUI/CL…
☆223Jul 2, 2026Updated 2 weeks ago
opendatalab / MinerU-HTML
View on GitHub
MinerU-HTML: An SLM-powered HTML main content extractor that outputs clean HTML bodies. Perfect for Deep Research Agents, RAG application…
☆277Mar 27, 2026Updated 3 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
opendatalab / LabelLLM
View on GitHub
The Open-Source Data Annotation Platform
☆1,262Jul 2, 2026Updated 2 weeks ago
opendatalab / Meta-rater
View on GitHub
[ACL 2025 Best Theme Paper] This is the official implementation for the paper: "Meta-rater: A Multi-dimensional Data Selection Method for…
☆196Aug 29, 2025Updated 10 months ago
opendatalab / labelU
View on GitHub
Open-source multimodal data annotation platform with AI auto-annotation support.
☆1,631Updated this week
datajuicer / data-juicer
View on GitHub
Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷
☆6,752Updated this week
OpenDataArena / OpenDataArena-Tool
View on GitHub
Tools for OpenDataArena: Fair, Open, and Transparent Arena for Data
☆146Mar 15, 2026Updated 4 months ago
opendatalab / WanJuan3.0
View on GitHub
WanJuan3.0（“万卷·丝路”）一个作为综合性的纯文本语料库，采集了多个国家地区的网络公开信息、文献、专利等资料，数据总规模超1.2TB，Token总数超过300B，处于国际领先水平，首期开源的语料库主要由泰语、俄语、阿拉伯语、韩语和越南语5个子集构成，每个子集的数据…
☆47Feb 13, 2025Updated last year
opendatalab / OmniDocBench
View on GitHub
[CVPR 2025] A Comprehensive Benchmark for Document Parsing and Evaluation
☆1,900Jun 26, 2026Updated 3 weeks ago
modelscope / ms-swift
View on GitHub
Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.6, DeepSeek-V4, GLM-5.1, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL…
☆14,887Updated this week
opendatalab / opendatalab-python-sdk
View on GitHub
SDK of OpenDataLab - https://opendatalab.org.cn
☆60Jul 31, 2025Updated 11 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
InternLM / Condor
View on GitHub
[ACL 2025] An official pytorch implement of the paper: Condor: Enhance LLM Alignment with Knowledge-Driven Data Synthesis and Refinement
☆40May 28, 2025Updated last year
ConardLi / easy-dataset
View on GitHub
A powerful tool for creating datasets for LLM fine-tuning 、RAG and Eval
☆14,673May 1, 2026Updated 2 months ago
open-compass / opencompass
View on GitHub
OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, …
☆7,221Updated this week
modelscope / evalscope
View on GitHub
A streamlined and customizable framework for efficient large model (LLM, VLM, AIGC) evaluation and performance benchmarking.
☆3,115Updated this week
opendatalab / labelU-Kit
View on GitHub
Data annotation component library --provided as NPM packages
☆158Updated this week
opendatalab / MinerU
View on GitHub
Transforms complex documents like PDFs and Office docs into LLM-ready markdown/JSON for your Agentic workflows.
☆75,311Updated this week
OpenDCAI / DataFlow
View on GitHub
Easy Data Preparation with latest LLMs-based Operators and Pipelines.
☆6,693Updated this week
pengr / DataMan
View on GitHub
Our code for ICLR'25 paper "DataMan: Data Manager for Pre-training Large Language Models".
☆130Feb 7, 2026Updated 5 months ago
OpenDataBox / awesome-data-llm
View on GitHub
Official Repository of "LLM × DATA" Survey Paper
☆804Jun 15, 2026Updated last month
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
UltraData-OpenBMB / UltraData-Math
View on GitHub
☆30Apr 15, 2026Updated 3 months ago
SHUzhangshuo / ArborVista
View on GitHub
基于 MinerU 的智能论文阅读助手，提供 PDF 文档解析、OCR 识别、表格提取等功能。
☆19Dec 2, 2025Updated 7 months ago
opendatalab / labelbee
View on GitHub
☆25Nov 7, 2022Updated 3 years ago
opendatalab / Vis3
View on GitHub
Data browser based on s3. 一个基于 S3 的数据（json / jsonl / parquet / html / md等）可视化工具。👇 Try online.
☆89Apr 14, 2026Updated 3 months ago
alibaba / ROLL
View on GitHub
An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models
☆3,313Updated this week
opendatalab / dsdl-docs
View on GitHub
Data Set Description Language Specification （新一代人工智能数据集描述语言DSDL）
☆46May 29, 2024Updated 2 years ago
QwenLM / Qwen-Agent
View on GitHub
Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.
☆16,821Mar 4, 2026Updated 4 months ago
verl-project / verl
View on GitHub
verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework
☆22,587Updated this week
hiyouga / EasyR1
View on GitHub
EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL
☆5,074Updated this week
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
InternLM / lmdeploy
View on GitHub
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
☆7,969Updated this week
InternLM / lagent
View on GitHub
A lightweight framework for building LLM-based agents
☆2,270Jul 6, 2026Updated 2 weeks ago
Alibaba-NLP / DeepResearch
View on GitHub
Tongyi Deep Research, the Leading Open-source Deep Research Agent
☆19,691Feb 27, 2026Updated 4 months ago
areal-project / AReaL
View on GitHub
The RL Bridge for LLM-based Agent Applications. Made Simple & Flexible.
☆5,579Updated this week
OpenDCAI / Flash-MinerU
View on GitHub
Ray-powered accelerator for MinerU, turning PDF → Markdown into a scalable, cluster-ready data infrastructure. 基于 Ray 的 MinerU 加速层，将 PDF …
☆63Apr 20, 2026Updated 3 months ago
opendatalab / Miner-PDF-Benchmark
View on GitHub
MPB (Miner-PDF-Benchmark) is an end-to-end PDF document comprehension evaluation suite designed for large-scale model data scenarios.
☆24Dec 11, 2024Updated last year
opendatalab / PDF-Extract-Kit
View on GitHub
A Comprehensive Toolkit for High-Quality PDF Content Extraction
☆9,797Jan 3, 2025Updated last year