jayhenry / pdf2txt_mnbvc
☆40Updated last year
Alternatives and similar repositories for pdf2txt_mnbvc:
Users that are interested in pdf2txt_mnbvc are comparing it to the libraries listed below
- SearchGPT: Building a quick conversation-based search engine with LLMs.☆45Updated 2 months ago
- TianGong-AI-Unstructure☆62Updated this week
- 国内首个全参数训练的法律大模型 HanFei-1.0 (韩非)☆114Updated last year
- 文本去重☆69Updated 10 months ago
- ☆63Updated 6 months ago
- Imitate OpenAI with Local Models☆88Updated 6 months ago
- ChatGPT WebUI using gradio. 给 LLM 对话和检索知识问答RAG提供一个简单好用的Web UI界面☆118Updated 7 months ago
- A Multi-Modal Dataset of Chinese Governmental Docunments☆31Updated 4 years ago
- ☆160Updated last year
- The complete training code of the open-source high-performance Llama model, including the full process from pre-training to RLHF.☆69Updated last year
- 实现了Baichuan-Chat微调,Lora、QLora等各种微调方式,一键运行。☆70Updated last year
- 本项目旨在对大量文本文件进行快速编码检测和转换以辅助mnbvc语料集项目的数据清洗工作☆59Updated 5 months ago
- 360LayoutAnaylsis, a series Document Analysis Models and Datasets deleveped by 360 AI Research Institute☆272Updated 6 months ago
- 通用版面分析 | 中文文档解析 |Document Layout Analysis | layout paser☆46Updated 9 months ago
- this repo is mnbvc text quality classification using fastText☆16Updated last year
- A large-scale language model for scientific domain, trained on redpajama arXiv split☆131Updated last year
- 中文原生检索增强生成测评基准☆112Updated 11 months ago
- 中文世界的NLP自动标注开源工具,简单样本,交给LabelFast。☆65Updated 2 months ago
- MNBVC项目-ShareGPT语料清洗☆15Updated last year
- 中文大语言模型评测第二期☆70Updated last year
- Finetune Bloom big language model with Lora method☆31Updated last year
- XVERSE-65B: A multilingual large language model developed by XVERSE Technology Inc.☆139Updated 11 months ago
- 利用LLM+敏感词库,来自动判别是否涉及敏感词。☆112Updated last year
- LAiW: A Chinese Legal Large Language Models Benchmark☆78Updated 8 months ago
- ☆64Updated last year
- ChatGLM2-6B微调, SFT/LoRA, instruction finetune☆106Updated last year
- 专注于中文领域大语言模型,落地到某个行业某个领域,成为一个行业大模型、公司级别或行业级别领域大模型。☆116Updated 2 weeks ago
- "桃李“: 国际中文教育大模型☆175Updated last year
- 大语言模型指令调优工具(支持 FlashAttention)☆171Updated last year
- 大语言模型训练和服务调研☆37Updated last year