DataEval / dingo
Dingo: A Comprehensive Data Quality Evaluation Tool
☆28Updated this week
Alternatives and similar repositories for dingo:
Users that are interested in dingo are comparing it to the libraries listed below
- [ACL2024 Findings] Agent-FLAN: Designing Data and Methods of Effective Agent Tuning for Large Language Models☆337Updated 9 months ago
- MPB (Miner-PDF-Benchmark) is an end-to-end PDF document comprehension evaluation suite designed for large-scale model data scenarios.☆20Updated last month
- ☆162Updated last month
- The Open-Source Data Annotation Platform☆632Updated last week
- ☆14Updated 6 months ago
- Xtuner Factory☆32Updated 10 months ago
- AAAI 2024: Visual Instruction Generation and Correction☆91Updated 11 months ago
- 基于《西游记》原文、白话文、ChatGPT生成数据制作的,以InternLM2微调的角色扮演多LLM聊天室。 本项目将介绍关于角色扮演类 LLM 的一切,从数据获取、数据处理,到使用 XTuner 微调并部署至 OpenXLab,再到使用 LMDeploy 部署,以 op…☆90Updated 9 months ago
- 一些大语言模型和多模态模型的应用,主要包括Rag,小模型,Agent,跨模态搜索,OCR等等☆144Updated 2 months ago
- official code for "Fox: Focus Anywhere for Fine-grained Multi-page Document Understanding"☆135Updated 7 months ago
- Enhance LLM agents with rich tool APIs☆364Updated 4 months ago
- datasets resource☆100Updated 5 months ago
- [ACM'MM 2024 Oral] Official code for "OneChart: Purify the Chart Structural Extraction via One Auxiliary Token"☆213Updated last month
- ☆56Updated 11 months ago
- LLM Group Chat Framework: chat with multiple LLMs at the same time. 大模型群聊框架:同时与多个大语言模型聊天。☆261Updated 9 months ago
- 顾名思义:手搓的RAG☆116Updated 10 months ago
- Valley is a cutting-edge multimodal large model designed to handle a variety of tasks involving text, images, and video data.☆178Updated this week
- ☆55Updated 10 months ago
- GOT的vLLM加速实现 并结合 MinerU 实现RAG中的pdf 解析☆40Updated 2 months ago
- ☆49Updated last year
- [ACL2024] T-Eval: Evaluating Tool Utilization Capability of Large Language Models Step by Step☆256Updated 9 months ago
- Vary-tiny codebase upon LAVIS (for training from scratch)and a PDF image-text pairs data (about 600k including English/Chinese)☆76Updated 3 months ago
- Baichuan-Omni: Towards Capable Open-source Omni-modal LLM 🌊☆258Updated 2 months ago
- ☆81Updated 5 months ago
- 【ArXiv】PDF-Wukong: A Large Multimodal Model for Efficient Long PDF Reading with End-to-End Sparse Sampling☆108Updated 3 months ago
- 万卷1.0多模态语料☆555Updated last year
- ☆104Updated last year
- ☆34Updated 3 months ago
- 训练一个对中文支持更好的LLaVA模型,并开源训练代码和数据。☆44Updated 4 months ago
- 如需体验textin文档解析,请点击https://cc.co/16YSIy☆70Updated 2 months ago