jayhenry / pdf2txt_mnbvcLinks
☆43Updated 2 years ago
Alternatives and similar repositories for pdf2txt_mnbvc
Users that are interested in pdf2txt_mnbvc are comparing it to the libraries listed below
Sorting:
- 国内首个全参数训练的法律大模型 HanFei-1.0 (韩非)☆126Updated 2 years ago
- 中文原生检索增强生成测评基准☆123Updated last year
- ChatGPT WebUI using gradio. 给 LLM 对话和检索知识问答RAG提供一个简单好用的Web UI界面☆138Updated last year
- 文本去重☆77Updated last year
- TianGong-AI-Unstructure☆69Updated 2 months ago
- SearchGPT: Building a quick conversation-based search engine with LLMs.☆47Updated 11 months ago
- ☆67Updated last year
- 语言模型中文认知能力分析☆236Updated 2 years ago
- A large-scale language model for scientific domain, trained on redpajama arXiv split☆137Updated last year
- A Multi-Modal Dataset of Chinese Governmental Docunments☆39Updated 5 years ago
- Silk Road will be the dataset zoo for Luotuo(骆驼). Luotuo is an open sourced Chinese-LLM project founded by 陈启源 @ 华中师范大学 & 李鲁鲁 @ 商汤科技 & 冷子…☆40Updated 2 years ago
- XVERSE-65B: A multilingual large language model developed by XVERSE Technology Inc.☆141Updated last year
- 中文书籍收录整理, Collection of Chinese Books☆201Updated last year
- 360LayoutAnaylsis, a series Document Analysis Models and Datasets deleveped by 360 AI Research Institute☆305Updated last year
- Alpaca Chinese Dataset -- 中文指令微调数据集☆217Updated last year
- "桃李“: 国际中文教育大模型☆189Updated 2 years ago
- TechGPT 2.0: Technology-Oriented Generative Pretrained Transformer 2.0☆114Updated last year
- 雅意信息抽取大模型:在百万级人工构造的高质量信息抽取数据上进行指令微调,由中科闻歌算法团队研发。 (Repo for YAYI Unified Information Extraction Model)☆314Updated last year
- 利用LLM+敏感词库,来自动判别是否涉及敏感词。☆136Updated 2 years ago
- MNBVC项目-ShareGPT语料清洗☆15Updated 2 years ago
- 本项目旨在对大量文本文件进行快速编码检测和转换以辅助mnbvc语料集项目的数据清洗工作☆67Updated 2 months ago
- TechGPT: Technology-Oriented Generative Pretrained Transformer☆228Updated 2 years ago
- ☆44Updated 2 years ago
- [ACL 2024] IEPile: A Large-Scale Information Extraction Corpus☆208Updated 11 months ago
- 骆驼QA,中文大语言阅读理解模型。☆75Updated 2 years ago
- Finetune Bloom big language model with Lora method☆32Updated 2 years ago
- Gaokao Benchmark for AI☆109Updated 3 years ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆139Updated last year
- 演示 vllm 对中文大语言模型的神奇效果☆31Updated 2 years ago
- 用于大模型 RLHF 进行人工数据标注排序的工具。A tool for manual response data annotation sorting in RLHF stage.☆256Updated 2 years ago