shibing624 / pinyin-tokenizer
pinyintokenizer, 拼音分词器,将连续的拼音切分为单字拼音列表。
☆28Updated 10 months ago
Related projects ⓘ
Alternatives and complementary repositories for pinyin-tokenizer
- Tracking the hot Github repos and update daily 每天自动追踪Github热门项目☆42Updated this week
- 大语言模型训练和服务调研☆34Updated last year
- A Python Package to Access World-Class Generative Models☆125Updated 5 months ago
- 百度QA100万数据集☆49Updated 11 months ago
- GTS Engine: A powerful NLU Training System。GTS引擎(GTS-Engine)是一款开箱即用且性能强大的自然语言理解引擎,聚焦于小样本任务,能够仅用小样本就能自动化生产NLP模型。☆89Updated last year
- Large-scale exact string matching tool☆15Updated last week
- 如需体验textin文档解析,请点击https://cc.co/16YSIy☆23Updated 4 months ago
- 演示 vllm 对中文大语言模型的神奇效果☆31Updated last year
- 中文文本改写☆19Updated 4 years ago
- 百度百科 500 万数据集☆30Updated 11 months ago
- SmartSearch: Building a quick conversation-based search engine with LLMs.☆43Updated 6 months ago
- 大规模中文语料☆38Updated 5 years ago
- 本项目旨在对大量文本文件进行快速编码检测和转换以辅助mnbvc语料集项目的数据清洗工作☆55Updated last month
- moss chat finetuning☆50Updated 6 months ago
- 骆驼QA,中文大语言阅读理解模型。☆72Updated last year
- Rasa通过PaddleNLP提供中文支持☆33Updated 2 years ago
- ☆92Updated 6 months ago
- Silk Road will be the dataset zoo for Luotuo(骆驼). Luotuo is an open sourced Chinese-LLM project founded by 陈启源 @ 华中师范大学 & 李鲁鲁 @ 商汤科技 & 冷子…☆37Updated last year
- Agentica: Build Multi-Agent Workflow with 3 lines code. 三行代码打造个人助手智能体。☆88Updated last week
- 通用版面分析 | 中文文档解析 |Document Layout Analysis | layout paser☆45Updated 5 months ago
- 该项目主要是抽取病历文件中的一些关键信息。并将抽取的内容进行streamlit前端的展示。目前支持的文件类型:图片,pdf文件,word文件☆22Updated 2 years ago
- XVERSE-7B: A multilingual large language model developed by XVERSE Technology Inc.☆50Updated 7 months ago
- clue chatyuan finetuning☆16Updated 7 months ago
- Llama2开源模型中文版-全方位测评,基于SuperCLUE的OPEN基准 | Llama2 Chinese evaluation with SuperCLUE☆127Updated last year
- llama inference for tencentpretrain☆96Updated last year
- Evaluation for AI apps and agent☆35Updated 10 months ago
- chatglm-6b微调/LORA/PPO/推理, 样本为自动生成的整数/小数加减乘除运算, 可gpu/cpu☆163Updated last year
- use chatGLM to perform text embedding☆45Updated last year