Dingo: A Comprehensive AI Data, Model and Application Quality Evaluation Tool
☆673Apr 3, 2026Updated last week
Alternatives and similar repositories for dingo
Users that are interested in dingo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Data annotation toolbox supports image, audio and video data.☆1,539Mar 20, 2026Updated 3 weeks ago
- The Open-Source Data Annotation Platform☆1,207Feb 19, 2025Updated last year
- WanJuan3.0(“万卷·丝路”)一个作为综合性的纯文本语料库,采集了多个国家地区的网络公开信息、文献、专利等资料,数据总规模超1.2TB,Token总数超过300B,处于国际领先水平,首期开源的语料库主要由泰语、俄语、阿拉伯语、韩语和越南语5个子集构成,每个子集的数据…☆44Feb 13, 2025Updated last year
- Data annotation component library --provided as NPM packages☆147Mar 18, 2026Updated 3 weeks ago
- conversion doc(pdf/html/doc/docx/ppt/pptx)to markdown☆48Jul 23, 2024Updated last year
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- [ACL 2025] An official pytorch implement of the paper: Condor: Enhance LLM Alignment with Knowledge-Driven Data Synthesis and Refinement☆39May 28, 2025Updated 10 months ago
- SDK of OpenDataLab - https://opendatalab.org.cn☆59Jul 31, 2025Updated 8 months ago
- MPB (Miner-PDF-Benchmark) is an end-to-end PDF document comprehension evaluation suite designed for large-scale model data scenarios.☆24Dec 11, 2024Updated last year
- Our code for ICLR'25 paper "DataMan: Data Manager for Pre-training Large Language Models".☆122Feb 7, 2026Updated 2 months ago
- [CVPR 2025] A Comprehensive Benchmark for Document Parsing and Evaluation☆1,630Feb 27, 2026Updated last month
- AAAI 2024: Visual Instruction Generation and Correction☆96Feb 4, 2024Updated 2 years ago
- A Python package for interacting with the MinerU Vision-Language Model.☆109Updated this week
- OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, …☆6,833Mar 30, 2026Updated last week
- Zone Evaluation: Revealing Spatial Bias in Object Detection (TPAMI 2024)☆46Dec 6, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Web archiving utility library☆11Mar 11, 2026Updated 3 weeks ago
- A lightweight framework for building LLM-based agents☆2,231Mar 24, 2026Updated 2 weeks ago
- Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷☆6,234Updated this week
- ☆19Oct 28, 2025Updated 5 months ago
- LMDeploy is a toolkit for compressing, deploying, and serving LLMs.☆7,755Apr 4, 2026Updated last week
- A powerful tool for creating datasets for LLM fine-tuning 、RAG and Eval☆13,864Updated this week
- [ACL 2025] Uncovering the Impact of Chain-of-Thought Reasoning for Direct Preference Optimization: Lessons from Text-to-SQL☆15Oct 9, 2025Updated 6 months ago
- Data Set Description Language Specification (新一代人工智能数据集描述语言DSDL)☆46May 29, 2024Updated last year
- LLM Group Chat Framework: chat with multiple LLMs at the same time. 大模型群聊框架:同时与多个大语言模型聊天。☆323Jun 19, 2025Updated 9 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆13Apr 19, 2025Updated 11 months ago
- Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.☆58,131Apr 3, 2026Updated last week
- Easy Data Preparation with latest LLMs-based Operators and Pipelines.☆3,194Mar 28, 2026Updated 2 weeks ago
- Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.5, DeepSeek-R1, GLM-5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, …☆13,516Apr 3, 2026Updated last week
- ☆12Sep 7, 2024Updated last year
- A unified evaluation library for multiple machine learning libraries☆269Mar 29, 2024Updated 2 years ago
- Official release of InternLM series (InternLM, InternLM2, InternLM2.5, InternLM3).☆7,184Oct 30, 2025Updated 5 months ago
- A Comprehensive Toolkit for High-Quality PDF Content Extraction☆9,562Jan 3, 2025Updated last year
- [ICML 2025] Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale☆269Jul 8, 2025Updated 9 months ago
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- [ACL2024 Findings] Agent-FLAN: Designing Data and Methods of Effective Agent Tuning for Large Language Models☆359Mar 22, 2024Updated 2 years ago
- [ICLR 2025] Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing. Your efficient and high-quality synthetic data …☆841Mar 17, 2025Updated last year
- 百度QA100万数据集☆45Nov 30, 2023Updated 2 years ago
- Enhancing Representations through Heterogeneous Self-Supervised Learning (TPAMI 2025)☆15May 2, 2025Updated 11 months ago
- 万卷1.0多模态语料☆571Oct 20, 2023Updated 2 years ago
- SQLynx Pro: Desktop and Web SQL Tool. Both web and desktop access. Support popular SQL databases like mysql, mariadb, postgresql, sqlite …☆30May 11, 2025Updated 10 months ago
- A Next-Generation Training Engine Built for Ultra-Large MoE Models☆5,118Updated this week