liwenju0 / cutword
一个简单快速的分词、命名实体识别工具
☆538Updated 4 months ago
Related projects ⓘ
Alternatives and complementary repositories for cutword
- unified embedding model☆832Updated last year
- An easy-to-use framework for modular RAG☆295Updated this week
- A Python Package to Access World-Class Generative Models☆125Updated 5 months ago
- Phi2-Chinese-0.2B 从0开始训练自己的Phi2中文小模型,支持接入langchain加载本地知识库做检索增强生成RAG。Training your own Phi2 small chat model from scratch.☆489Updated 4 months ago
- 中文Mixtral-8x7B(Chinese-Mixtral-8x7B)☆641Updated 3 months ago
- 聚宝盆(Cornucopia): 中文金融系列开源可商用大模型,并提供一套高效轻量化的垂直领域LLM训练框架(Pretraining、SFT、RLHF、Quantize等)☆590Updated last year
- Firefly中文LLaMA-2大模型,支持增量预训练Baichuan2、Llama2、Llama、Falcon、Qwen、Baichuan、InternLM、Bloom等大模型☆399Updated last year
- 夫子•明察司法大模型是由山东大学、浪潮云、中国政法大学联合研发,以 ChatGLM 为大模型底座,基于海量中文无监督司法语料与有监督司法微调数据训练的中文司法大模型。该模型支持法条检索、案例分析、三段论推理判决以及司法对话等功能,旨在为用户提供全方位、高精准的法律咨询与解答…☆286Updated 3 weeks ago
- ChatPilot: Chat Agent Web UI,实现Chat对话前端,支持Google搜索、文件网址对话(RAG)、代码解释器功能,复现了Kimi Chat(文件,拖进来;网址,发出来)。☆513Updated last week
- 雅意信息抽取大模型:在百万级人工构造的高质量信息抽取数据上进行指令微调,由中科闻歌算法团队研发。 (Repo for YAYI Unified Information Extraction Model)☆269Updated 3 months ago
- [中文法律大模型] DISC-LawLLM: an intelligent legal system powered by large language models (LLMs) to provide a wide range of legal services.☆557Updated last month
- 中文Mixtral混合专家大模型(Chinese Mixtral MoE LLMs)☆586Updated 6 months ago
- 📝 An Awesome Collection of Chinese Legal Dataset and Relevant Resources. 致力于收集全面的中文法律数据源☆775Updated last year
- 语言模型中文认知能力分析☆235Updated last year
- BiLLa: A Bilingual LLaMA with Enhanced Reasoning Ability☆421Updated last year
- Llama3-Chinese是以Meta-Llama-3-8B为底座,使用 DORA + LORA+ 的训练方法,在50w高质量中文多轮SFT数据 + 10w英文多轮SFT数据 + 2000单轮自我认知数据训练而来的大模型。☆290Updated 7 months ago
- 通义千问VLLM推理 部署DEMO☆446Updated 7 months ago
- 360LayoutAnaylsis, a series Document Analysis Models and Datasets deleveped by 360 AI Research Institute☆242Updated 2 months ago
- 基于开源embedding模型的中文向量效果测试☆125Updated last year
- 更纯粹、更高压缩率的Tokenizer☆454Updated 7 months ago
- Baichuan2代码的逐行解析版本,适合小白☆208Updated last year
- GAOKAO-Bench is an evaluation framework that utilizes GAOKAO questions as a dataset to evaluate large language models.☆557Updated 7 months ago
- 轩辕:度小满中文金融对话大模型☆1,071Updated last month
- PDF解析(文字,章节,表格,图片,参考),基于大模型(ChatGLM2-6B, RWKV)+langchain+streamlit的PDF问答,摘要,信息抽取☆157Updated last year
- 大语言模型微调,Qwen2VL、Qwen2、GLM4指令微调☆212Updated this week
- 🚀WebUI integrated platform for latest LLMs | 各大语言模型的全流程工具 WebUI 整合包。支持主流大模型API接口和开源模型。支持知识库,数据库,角色扮演,mj文生图,LoRA和全参数微调,数据集制作,live2d等全流程应用…☆483Updated this week
- Alpaca Chinese Dataset -- 中文指令微调数据集【人工+GPT4o持续更新】☆185Updated last month
- 中文法律LLaMA (LLaMA for Chinese legel domain)☆856Updated 2 months ago
- Q&A based on elasticsearch+langchain+chatglm2 | 基于elasticsearch,langchain,chatglm2的自有知识库问答☆231Updated last year
- Luotuo Embedding(骆驼嵌入) is a text embedding model, which developed by 李鲁鲁, 冷子昂, 陈启源, 蒟蒻等.☆259Updated last year