zejunwang1 / darmatch
一个非常高效的字符串匹配工具,支持正向/反向最大匹配分词和多模式字符串精确匹配
☆17Updated last year
Related projects ⓘ
Alternatives and complementary repositories for darmatch
- 通用版面分析 | 中文文档解析 |Document Layout Analysis | layout paser☆45Updated 5 months ago
- 高性能文本 Tokenizer 库☆27Updated 9 months ago
- Large-scale exact string matching tool☆15Updated last week
- 大规模中文语料☆38Updated 5 years ago
- 百川Dynamic NTK-ALiBi的代码实现:无需微调即可推理更长文本☆46Updated last year
- CTC2021-中文文本纠错大赛的SOTA方案及在线演示☆72Updated last year
- 纯c++的全平台llm加速库,支持python调用,支持baichuan, glm, llama, moss基座,手机端流畅运行chatglm-6B级模型单卡可达10000+token / s,☆45Updated last year
- 时间抽取、解析、标准化工具☆49Updated 2 years ago
- benchmark of KgCLUE, with different models and methods☆26Updated 2 years ago
- Code & Data for our Paper "NaSGEC: Multi-Domain Chinese Grammatical Error Correction for Native Speaker Texts" (ACL 2023 Findings)☆76Updated last year
- 中文纠错☆91Updated 2 years ago
- LLM for NER☆55Updated 3 months ago
- 基于向量召回的检索式对话系统解决方案,dense retrieval,FAQ……☆32Updated 3 years ago
- 基于模板的文本纠错;Automatically Mining Error Templates for Grammatical Error Correction☆37Updated 2 years ago
- Finetune Bloom big language model with Lora method☆28Updated last year
- 零样本学习测评基准,中文版☆54Updated 3 years ago
- 中文文本纠错模型,keras实现☆70Updated 3 years ago
- Chinese MobileBERT(中文MobileBERT模型)☆80Updated 2 years ago
- BLOOM 模型的指令微调☆24Updated last year
- 用bert4keras加载CDial-GPT☆38Updated 4 years ago
- ☆22Updated 4 years ago
- A more efficient GLM implementation!☆55Updated last year
- 基于seq2edit (Gector) 的中文文本纠错。☆26Updated 2 years ago
- ☆57Updated last year
- 介绍docker、docker compose的使用。☆20Updated 2 months ago
- 基于bert进行中文文本纠错☆226Updated last year
- QBQTC: 大规模搜索匹配数据集☆71Updated 2 years ago