Dingo: A Comprehensive AI Data, Model and Application Quality Evaluation Tool
☆709Jun 10, 2026Updated this week
Alternatives and similar repositories for dingo
Users that are interested in dingo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An multi-agent design-to-code tool that generates production-ready React code with high visual fidelity and iterative validation.☆109May 22, 2026Updated 2 weeks ago
- Open-source multimodal data annotation platform with AI auto-annotation support.☆1,583Updated this week
- The Open-Source Data Annotation Platform☆1,235Feb 19, 2025Updated last year
- Data annotation component library --provided as NPM packages☆152Jun 2, 2026Updated last week
- conversion doc(pdf/html/doc/docx/ppt/pptx)to markdown☆49Jul 23, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [ACL 2025] An official pytorch implement of the paper: Condor: Enhance LLM Alignment with Knowledge-Driven Data Synthesis and Refinement☆40May 28, 2025Updated last year
- SDK of OpenDataLab - https://opendatalab.org.cn☆60Jul 31, 2025Updated 10 months ago
- MPB (Miner-PDF-Benchmark) is an end-to-end PDF document comprehension evaluation suite designed for large-scale model data scenarios.☆24Dec 11, 2024Updated last year
- Our code for ICLR'25 paper "DataMan: Data Manager for Pre-training Large Language Models".☆124Feb 7, 2026Updated 4 months ago
- [CVPR 2025] A Comprehensive Benchmark for Document Parsing and Evaluation☆1,795May 6, 2026Updated last month
- ☆23Nov 4, 2024Updated last year
- Zone Evaluation: Revealing Spatial Bias in Object Detection (TPAMI 2024)☆46Dec 6, 2024Updated last year
- OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, …☆7,075Updated this week
- Web archiving utility library☆11May 5, 2026Updated last month
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- A Python package for interacting with the MinerU Vision-Language Model.☆128May 28, 2026Updated 2 weeks ago
- A lightweight framework for building LLM-based agents☆2,256Jun 4, 2026Updated last week
- Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷☆6,489Updated this week
- ☆19Oct 28, 2025Updated 7 months ago
- A powerful tool for creating datasets for LLM fine-tuning 、RAG and Eval☆14,405May 1, 2026Updated last month
- LMDeploy is a toolkit for compressing, deploying, and serving LLMs.☆7,894Updated this week
- Data Set Description Language Specification (新一代人工智能数据集描述语言DSDL)☆46May 29, 2024Updated 2 years ago
- [ACL 2025] Uncovering the Impact of Chain-of-Thought Reasoning for Direct Preference Optimization: Lessons from Text-to-SQL☆16Oct 9, 2025Updated 8 months ago
- LLM Group Chat Framework: chat with multiple LLMs at the same time. 大模型群聊框架:同时与多个大语言模型聊天。☆325Jun 19, 2025Updated 11 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆13Apr 19, 2025Updated last year
- Transforms complex documents like PDFs and Office docs into LLM-ready markdown/JSON for your Agentic workflows.☆66,927Updated this week
- Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.6, DeepSeek-V4, GLM-5.1, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL…☆14,424Updated this week
- ☆12Sep 7, 2024Updated last year
- WanJuan-CC是以CommonCrawl为基础,经过数据抽取,规则清洗,去重,安全过滤,质量清洗等步骤得到的高质量数据。☆14Apr 18, 2024Updated 2 years ago
- Official release of InternLM series (InternLM, InternLM2, InternLM2.5, InternLM3).☆7,216Oct 30, 2025Updated 7 months ago
- A Comprehensive Toolkit for High-Quality PDF Content Extraction☆9,696Jan 3, 2025Updated last year
- [ICML 2025] Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale☆269Jul 8, 2025Updated 11 months ago
- [ACL2024 Findings] Agent-FLAN: Designing Data and Methods of Effective Agent Tuning for Large Language Models☆361Mar 22, 2024Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- [ICLR 2025] Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing. Your efficient and high-quality synthetic data …☆861Mar 17, 2025Updated last year
- Official implementation of BandPO: Bridging Trust Regions and Ratio Clipping via Probability-Aware Bounds for LLM Reinforcement Learning.…☆49Apr 8, 2026Updated 2 months ago
- 百度QA100万数据集☆46Nov 30, 2023Updated 2 years ago
- Enhancing Representations through Heterogeneous Self-Supervised Learning (TPAMI 2025)☆15May 2, 2025Updated last year
- A TTS Trained on Universal Audio.☆41Jun 6, 2025Updated last year
- 万卷1.0多模态语料☆574Oct 20, 2023Updated 2 years ago
- SQLynx Pro: Desktop and Web SQL Tool. Both web and desktop access. Support popular SQL databases like mysql, mariadb, postgresql, sqlite …☆30May 11, 2025Updated last year