liwenju0 / cutword
一个简单快速的分词、命名实体识别工具
☆573Updated last week
Alternatives and similar repositories for cutword:
Users that are interested in cutword are comparing it to the libraries listed below
- unified embedding model☆852Updated last year
- 中文Mixtral-8x7B(Chinese-Mixtral-8x7B)☆648Updated 7 months ago
- Phi2-Chinese-0.2B 从0开始训练自己的Phi2中文小模型,支持接入langchain加载本地知识库做检索增强生成RAG。Training your own Phi2 small chat model from scratch.☆540Updated 8 months ago
- DomainWordsDict, Chinese words dict that contains more than 68 domains, which can be used as text classification、knowledge enhance task。…☆689Updated 3 years ago
- 基于开源embedding模型的中文向量效果测试☆136Updated last year
- 360LayoutAnaylsis, a series Document Analysis Models and Datasets deleveped by 360 AI Research Institute☆271Updated 6 months ago
- 聚宝盆(Cornucopia): 中文金融系列开源可商用大模型,并提供一套高效轻量化的垂直领域LLM训练框架(Pretraining、SFT、RLHF、Quantize等)☆623Updated last year
- An easy-to-use framework for modular RAG☆344Updated this week
- 自然语言转SQL,直接连接数据库查询☆380Updated last year
- 雅意信息抽取大模型:在百万级人工构造的高质量信息抽取数据上进行指令微调,由中科闻歌算法团队研发。 (Repo for YAYI Unified Information Extraction Model)☆298Updated 7 months ago
- A Python Package to Access World-Class Generative Models☆128Updated 9 months ago
- PromptCLUE, 全中文任务支持零样本学习模型☆662Updated last year
- Baichuan2代码的逐行解析版本,适合小白☆212Updated last year
- Q&A based on elasticsearch+langchain+chatglm2 | 基于elasticsearch,langchain,chatglm2的自有知识库问答☆239Updated last year
- ChatGLM-6B 指令学习|指令数据|Instruct☆655Updated last year
- Analysis of Chinese and English layouts 中英文版面分析☆187Updated last week
- 本地知识库 + chatGLM6B + CustomAgent☆267Updated last year
- 语言模型中文认知能力分析☆236Updated last year
- 活字通用大模型☆387Updated 6 months ago
- ChatPilot: Chat Agent Web UI,实现Chat对话前端,支持Google搜索、文件网址对话(RAG)、代码解释器功能,复现了Kimi Chat(文件,拖进来;网址,发出来)。☆553Updated 2 months ago
- ChatGLM2-6B 全参数微调,支持多轮对话的高效微调。☆398Updated last year
- 更纯粹、更高压缩率的Tokenizer☆473Updated 4 months ago
- ☆349Updated 8 months ago
- 中文Mixtral混合专家大模型(Chinese Mixtral MoE LLMs)☆603Updated 11 months ago
- BiLLa: A Bilingual LLaMA with Enhanced Reasoning Ability☆420Updated last year
- Llama3-Chinese是以Meta-Llama-3-8B为底座,使用 DORA + LORA+ 的训练方法,在50w高质量中文多轮SFT数据 + 10w英文多轮SFT数据 + 2000单轮自我认知数据训练而来的大模型。☆294Updated 11 months ago
- Alpaca Chinese Dataset -- 中文指令微调数据集☆193Updated 5 months ago
- ☆316Updated 9 months ago
- 企业级RAG系统从入门到精通☆400Updated 3 weeks ago
- pke_zh, python keyphrase extraction for chinese(zh). 中文关键词或关键句提取工具,实现了KeyBert、PositionRank、TopicRank、TextRank等算法,开箱即用。☆201Updated last year