zejunwang1 / fastMatch
Large-scale exact string matching tool
☆15Updated last week
Related projects ⓘ
Alternatives and complementary repositories for fastMatch
- 大语言模型训练和服务调研☆34Updated last year
- Unsupervised tableQA and databaseQA on chinese finance question and tabular data☆12Updated last year
- 有一个通用实体关系事件抽取的任务,需要使用到UIE模框架,而且需要将起部署到昇腾310服务器上,因为UIE模型底层使用的是ernie3.0,但是目前paddle官方还不支持ernie3.0模型在昇腾310上部署,所以才有了以下的操作,主要过程是,先试用paddle训练处模型…☆17Updated 2 years ago
- 大规模中文语料☆38Updated 5 years ago
- 基于 onnxruntime 推理引擎的中文 ltp 词法分析☆13Updated 2 years ago
- 一个非常高效的字符串匹配工具,支持正向/反向最大匹配分词和多模式字符串精确匹配☆17Updated last year
- Service for Bert model to Vector. 高效的文本转向量(Text-To-Vector)服务,支持GPU多卡、多worker、多客户端调用,开箱即用。☆10Updated 2 years ago
- aigc evals☆10Updated 11 months ago
- 百度QA100万数据集☆49Updated 11 months ago
- rasa_chinese 的服务 package☆18Updated 3 years ago
- TensorRT☆11Updated 4 years ago
- KuaiSearch PERKS☆11Updated 3 years ago
- 百川Dynamic NTK-ALiBi的代码实现:无需微调即可推理更长文本☆46Updated last year
- 1.4B sLLM for Chinese and English - HammerLLM🔨☆43Updated 7 months ago
- 高性能文本 Tokenizer 库☆27Updated 9 months ago
- Source code and checkpoints for legal pre-trained language models.☆15Updated 3 years ago
- 通用版面分析 | 中文文档解析 |Document Layout Analysis | layout paser☆45Updated 5 months ago
- 用于生成文本纠错模型(如Gector)需要的大量数据。☆14Updated last year
- MOSS 003 WebSearchTool: A simple but reliable implementation☆45Updated last year
- AGM阿格姆:AI基因图谱模型,从token-weight权重微粒角度,探索AI模型,GPT\LLM大模型的内在运作机制。☆26Updated last year
- 用Paddle复现论文ChineseBERT: Chinese Pretraining Enhanced by Glyph and Pinyin Information(ACL2021)☆9Updated 3 years ago
- share data, prompt data , pretraining data☆35Updated 11 months ago
- 该项目主要是抽取病历文件中的一些关键信息。并将抽取的内容进行streamlit前端的展示。目前支持的文件类型:图片,pdf文件,word文件☆22Updated 2 years ago
- Silk Road will be the dataset zoo for Luotuo(骆驼). Luotuo is an open sourced Chinese-LLM project founded by 陈启源 @ 华中师范大学 & 李鲁鲁 @ 商汤科技 & 冷子…☆37Updated last year
- the newest version of llama3,source code explained line by line using Chinese☆22Updated 7 months ago
- GRAIN: Gradient-based Intra-attention Pruning on Pre-trained Language Models☆17Updated last year
- 别名发现系统☆11Updated 2 years ago
- A more efficient GLM implementation!☆55Updated last year