liwenju0 / cutwordLinks
一个简单快速的分词、命名实体识别工具
☆602Updated 3 months ago
Alternatives and similar repositories for cutword
Users that are interested in cutword are comparing it to the libraries listed below
Sorting:
- 中文拼写错误和语法错误纠正☆315Updated 2 weeks ago
- unified embedding model☆863Updated last year
- 中文Mixtral-8x7B(Chinese-Mixtral-8x7B)☆650Updated 10 months ago
- 中文Mixtral混合专家大模型(Chinese Mixtral MoE LLMs)☆601Updated last year
- ChatPilot: Chat Agent Web UI,实现Chat对话前端,支持Google搜索、文件网址对话(RAG)、代码解释器功能,复现了Kimi Chat(文件,拖进来;网址,发出来)。☆572Updated 3 weeks ago
- 夫子•明察司法大模型是由山东大学、浪潮云、中国政法大学联合研发,以 ChatGLM 为大模型底座,基于海量中文无监督司法语料与有监督司法微调数据训练的中文司法大模型。该模型支持法条检索、案例分析、三段论推理判决以及司法对话等功能,旨在为用户提供全方位、高精准的法律咨询与解答…☆346Updated 8 months ago
- 自然语言转SQL,直接连接数据库查询☆387Updated 2 years ago
- Phi2-Chinese-0.2B 从0开始训练自己的Phi2中文小模型,支持接入langchain加载本地知识库做检索增强生成RAG。Training your own Phi2 small chat model from scratch.☆555Updated last year
- An easy-to-use framework for modular RAG☆375Updated this week
- 中文法律LLaMA (LLaMA for Chinese legel domain)☆951Updated 10 months ago
- An open-source educational chat model from ICALK, East China Normal University. 开源中英教育对话大模型。(通用基座模型,GPU部署,数据清理) 致敬: LLaMA, MOSS, BELLE, Z…☆803Updated last week
- Similarities: a toolkit for similarity calculation and semantic search. 相似度计算、匹配搜索工具包,支持亿级数据文搜文、文搜图、图搜图,python3开发,开箱即用。☆857Updated 8 months ago
- Llama3-Chinese是以Meta-Llama-3-8B为底座,使用 DORA + LORA+ 的训练方法,在50w高质量中文多轮SFT数据 + 10w英文多轮SFT数据 + 2000单轮自我认知数据训练而来的大模型。☆295Updated last year
- 基于开源embedding模型的中文向量效果测试☆143Updated 2 years ago
- Q&A based on elasticsearch+langchain+chatglm2 | 基于elasticsearch,langchain,chatglm2的自有知识库问答☆242Updated last year
- 雅意信息抽取大模型:在百万级人工构造的高质量信息抽取数据上进行指令微调,由中科闻歌算法团队研发。 (Repo for YAYI Unified Information Extraction Model)☆306Updated 11 months ago
- Z-Bench 1.0 by 真格基金:一个麻瓜的大语言模型中文测试集。Z-Bench is a LLM prompt dataset for non-technical users, developed by an enthusiastic AI-focused team…☆496Updated 2 years ago
- A Python Package to Access World-Class Generative Models☆127Updated last year
- 360LayoutAnaylsis, a series Document Analysis Models and Datasets deleveped by 360 AI Research Institute☆291Updated 10 months ago
- DomainWordsDict, Chinese words dict that contains more than 68 domains, which can be used as text classification、knowledge enhance task。…☆708Updated 3 years ago
- PromptCLUE, 全中文任务支持零样本学习模型☆664Updated 2 years ago
- 一个适合学习、使用、自主扩展的RAG【检索增强生成】系统!可联网做AI搜索☆496Updated 10 months ago
- Firefly中文LLaMA-2大模型,支持增量预训练Baichuan2、Llama2、Llama、Falcon、Qwen、Baichuan、InternLM、Bloom等大模型☆409Updated last year
- Luotuo Embedding(骆驼嵌入) is a text embedding model, which developed by 李鲁鲁, 冷子昂, 陈启源, 蒟蒻等.☆266Updated last year
- 🚀WebUI integrated platform for latest LLMs | 各大语言模型的全流程工具 WebUI 整合包。支持主流大模型API接口和开源模型。支持知识库,数据库,角色扮演,mj文生图,LoRA和全参数微调,数据集制作,live2d等全流程应用…☆540Updated 7 months ago
- Analysis of Chinese and English layouts 中英文版面分析☆222Updated last week
- 聚宝盆(Cornucopia): 中文金融系列开源可商用大模型,并提供一套高效轻量化的垂直领域LLM训练框架(Pretraining、SFT、RLHF、Quantize等)☆637Updated 2 years ago
- Repo for adapting Meta LlaMA2 in Chinese! META最新发布的LlaMA2的汉化版! (完全开源可商用)☆743Updated last year
- Alpaca Chinese Dataset -- 中文指令微调数据集☆208Updated 9 months ago
- 利用LLM+敏感词库,来自动判别是否涉及敏感词。☆126Updated 2 years ago