pany8125 / ShareGPTQAExtractor-mnbvc
MNBVC项目-ShareGPT语料清洗
☆12Updated last year
Alternatives and similar repositories for ShareGPTQAExtractor-mnbvc:
Users that are interested in ShareGPTQAExtractor-mnbvc are comparing it to the libraries listed below
- Evaluation for AI apps and agent☆36Updated last year
- 大语言模型训练和服务调研☆35Updated last year
- ☆24Updated 3 months ago
- GoGPT中文指令数据集构造☆10Updated 11 months ago
- 百川Dynamic NTK-ALiBi的代码实现:无需微调即可推理更长文本☆47Updated last year
- 如需体验textin文档解析,请点击https://cc.co/16YSIy☆23Updated 6 months ago
- LLM+RAG for QA☆21Updated last year
- 用于微调LLM的中文指令数据集☆27Updated last year
- 通用版面分析 | 中文文档解析 |Document Layout Analysis | layout paser☆45Updated 7 months ago
- the newest version of llama3,source code explained line by line using Chinese☆22Updated 9 months ago
- 基于 LoRA 和 P-Tuning v2 的 ChatGLM-6B 高效参数微调☆54Updated last year
- Imitate OpenAI with Local Models☆85Updated 4 months ago
- ☆9Updated last year
- (NBCE)Naive Bayes-based Context Extension on ChatGLM-6b☆14Updated last year
- Python3 package for Chinese/English OCR, with paddleocr-v4 onnx model(~14MB). 基于ppocr-v4-onnx模型推理,可实现 CPU 上毫秒级的 OCR 精准预测,通用场景中英文OCR达到开源SO…☆52Updated 3 weeks ago
- The complete training code of the open-source high-performance Llama model, including the full process from pre-training to RLHF.☆64Updated last year
- 该项目主要是抽取病历文件中的一些关键信息。并将抽取的内容进行streamlit前端的展示。目前支持的文件类型:图片,pdf文件,word文件☆23Updated 2 years ago
- aigc evals☆10Updated last year
- accelerate generating vector by using onnx model☆13Updated 11 months ago
- 有一个通用实体关系事件抽取的任务,需要使用到UIE模框架,而且需要将起部署到昇腾310服务器上,因为UIE模型底层使用的是ernie3.0,但是目前paddle官方还不支持ernie3.0模型在昇腾310上部署,所以才有了以下的操作,主要过程是,先试用paddle训练处模型…☆17Updated 2 years ago
- 通用简单工具项目☆15Updated 3 months ago
- (撰写ing..)本仓库偏教程性质,以「模型中文化」为一个典型的模型训练问题切入场景,指导读者上手学习LLM二次微调训练。☆31Updated 5 months ago
- LLM RAG 应用,支持 API 调用,语音交互。☆10Updated 6 months ago
- ☆15Updated 6 months ago
- ☆38Updated last year
- Large-scale exact string matching tool☆15Updated 2 months ago
- ☆44Updated 7 months ago
- ☆27Updated last year