ConardLi/easy-dataset

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ConardLi/easy-dataset)

ConardLi / easy-dataset

A powerful tool for creating datasets for LLM fine-tuning 、RAG and Eval

☆14,691

Alternatives and similar repositories for easy-dataset

Users that are interested in easy-dataset are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

hiyouga / LlamaFactory
View on GitHub
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
☆73,496Updated this week
opendatalab / MinerU
View on GitHub
Transforms complex documents like PDFs and Office docs into LLM-ready markdown/JSON for your Agentic workflows.
☆75,630Updated this week
infiniflow / ragflow
View on GitHub
RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to creat…
☆85,907Updated this week
langgenius / dify
View on GitHub
Build Agentic workflows, RAG pipelines, with rich AI model and tool support on one collaborative workspace. Deploy on cloud, VPC, or self…
☆150,110Updated this week
jingyaogong / minimind
View on GitHub
🧠「大模型」2小时完全从0训练64M的小参数LLM！Train a 64M-parameter LLM from scratch in just 2h!
☆53,816Updated this week
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
unslothai / unsloth
View on GitHub
Unsloth is a local UI for training and running Gemma 4, Qwen3.6, DeepSeek, Kimi, GLM and other models.
☆68,832Updated this week
modelscope / ms-swift
View on GitHub
Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.6, DeepSeek-V4, GLM-5.1, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL…
☆14,937Updated this week
FoundationAgents / OpenManus
View on GitHub
No fortress, purely open ground. OpenManus is Coming.
☆57,587Feb 11, 2026Updated 5 months ago
datawhalechina / self-llm
View on GitHub
《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调（全参数/Lora）、部署国内外开源大模型（LLM）/多模态大模型（MLLM）教程
☆31,410Jul 15, 2026Updated last week
chatchat-space / Langchain-Chatchat
View on GitHub
Langchain-Chatchat（原Langchain-ChatGLM）基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain…
☆38,469Nov 10, 2025Updated 8 months ago
labring / FastGPT
View on GitHub
FastGPT is a knowledge-based platform built on the LLMs, offers a comprehensive suite of out-of-the-box capabilities such as data process…
☆29,109Updated this week
microsoft / graphrag
View on GitHub
A modular graph-based Retrieval-Augmented Generation (RAG) system
☆34,808Updated this week
HKUDS / LightRAG
View on GitHub
[EMNLP2025] "LightRAG: Simple and Fast Retrieval-Augmented Generation"
☆38,086Updated this week
ConardLi / easy-learn-ai
View on GitHub
Easy-to-understand AI learning resources for beginners.
☆1,338Jul 12, 2026Updated last week
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
bytedance / deer-flow
View on GitHub
An open-source long-horizon SuperAgent harness that researches, codes, and creates. With the help of sandboxes, memories, tools, skill, s…
☆77,785Updated this week
AiHubCN / Awesome-Chinese-LLM
View on GitHub
整理开源的中文大语言模型，以规模较小、可私有化部署、训练成本较低的模型为主，包括底座模型，垂直领域微调及应用，数据集与教程等。
☆22,697May 10, 2026Updated 2 months ago
liguodongiot / llm-action
View on GitHub
本项目旨在分享大模型相关技术原理以及实战经验（大模型工程化、大模型应用落地）
☆24,800Updated this week
vllm-project / vllm
View on GitHub
A high-throughput and memory-efficient inference and serving engine for LLMs
☆87,069Updated this week
zilliztech / deep-searcher
View on GitHub
Open Source Deep Research Alternative to Reason and Search on Private Data. Written in Python.
☆8,013Nov 19, 2025Updated 8 months ago
Alibaba-NLP / DeepResearch
View on GitHub
Tongyi Deep Research, the Leading Open-source Deep Research Agent
☆19,714Feb 27, 2026Updated 4 months ago
xorbitsai / inference
View on GitHub
Swap GPT for any LLM by changing a single line of code. Xinference lets you run open-source, speech, and multimodal models on cloud, on-p…
☆9,449Updated this week
svcvit / Awesome-Dify-Workflow
View on GitHub
分享一些好用的 Dify DSL 工作流程，自用、学习两相宜。 Sharing some Dify workflows.
☆10,706Mar 25, 2026Updated 3 months ago
coze-dev / coze-studio
View on GitHub
An AI agent development platform with all-in-one visual tools, simplifying agent creation, debugging, and deployment like never before. C…
☆21,239Apr 20, 2026Updated 3 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
kvcache-ai / ktransformers
View on GitHub
A Flexible Framework for Experiencing Heterogeneous LLM Inference/Fine-tune Optimizations
☆18,995Updated this week
songquanpeng / one-api
View on GitHub
LLM API 管理 & 分发系统，支持 OpenAI、Azure、Anthropic Claude、Google Gemini、DeepSeek、字节豆包、ChatGLM、文心一言、讯飞星火、通义千问、360 智脑、腾讯混元等主流模型，统一 API 适配，可用于 key …
☆35,935Jan 9, 2026Updated 6 months ago
modelscope / evalscope
View on GitHub
A streamlined and customizable framework for efficient large model (LLM, VLM, AIGC) evaluation and performance benchmarking.
☆3,136Updated this week
1Panel-dev / MaxKB
View on GitHub
🔥 MaxKB is an open-source platform for building enterprise-grade agents. 强大易用的开源企业级智能体平台。
☆22,224Updated this week
CherryHQ / cherry-studio
View on GitHub
AI productivity studio with smart chat, autonomous agents, and 300+ assistants. Unified access to frontier LLMs
☆48,947Updated this week
Tencent / WeKnora
View on GitHub
Open-source LLM knowledge platform: turn raw documents into a queryable RAG, an autonomous reasoning agent, and a self-maintaining Wiki.
☆18,865Updated this week
QwenLM / Qwen-Agent
View on GitHub
Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.
☆16,841Mar 4, 2026Updated 4 months ago
mem0ai / mem0
View on GitHub
Universal memory layer for AI Agents
☆61,613Updated this week
netease-youdao / QAnything
View on GitHub
Question and Answer based on Anything.
☆14,046Mar 24, 2025Updated last year
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
eosphoros-ai / DB-GPT
View on GitHub
open-source agentic AI data assistant for the next generation of AI + Data products.
☆19,555Updated this week
hiyouga / EasyR1
View on GitHub
EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL
☆5,081Updated this week
Kiln-AI / Kiln
View on GitHub
Build, Evaluate, and Optimize AI Systems. Includes evals, RAG, agents, fine-tuning, synthetic data generation, dataset management, MCP, a…
☆4,971Updated this week
xerrors / Yuxi
View on GitHub
结合知识库、知识图谱管理的多租户 Agent Harness 平台。 An agent harness that integrates a LightRAG knowledge base and knowledge graphs. Build with LangChain…
☆6,251Updated this week
SwanHubX / SwanLab
View on GitHub
⚡️SwanLab - an open-source, modern-design AI training tracking and visualization tool. Supports Cloud / Self-hosted use. Integrated with …
☆4,081Updated this week
modelscope / FunASR
View on GitHub
Open-source speech recognition toolkit for training, inference, streaming ASR, VAD, punctuation, speaker diarization pipelines, and OpenA…
☆19,459Updated this week
punkpeye / awesome-mcp-servers
View on GitHub
A collection of MCP servers.
☆91,349Updated this week