zejunwang1 / fastMatch
Large-scale exact string matching tool
☆17Updated 2 months ago
Alternatives and similar repositories for fastMatch:
Users that are interested in fastMatch are comparing it to the libraries listed below
- Service for Bert model to Vector. 高效的文本转向量(Text-To-Vector)服务,支持GPU多 卡、多worker、多客户端调用,开箱即用。☆10Updated 2 years ago
- share data, prompt data , pretraining data☆36Updated last year
- 一个非常高效的字符串匹配工具,支持正向/反向最大匹配分词和多模式字符串精确匹配☆17Updated last year
- aigc evals☆10Updated last year
- rasa_chinese 的服务 package☆18Updated 3 years ago
- GoGPT中文指令数据集构造☆10Updated last year
- moss chat finetuning☆50Updated last year
- 百川Dynamic NTK-ALiBi的代码实现:无需微调即可推理更长文本☆47Updated last year
- the newest version of llama3,source code explained line by line using Chinese☆22Updated last year
- AGM阿格姆:AI基因图谱模型,从token-weight权重微粒角度,探索AI模型,GPT\LLM大模型的内在运作机制。☆28Updated last year
- A more efficient GLM implementation!☆55Updated 2 years ago
- Unsupervised tableQA and databaseQA on chinese finance question and tabular data☆12Updated 2 years ago
- 大规模中文语料☆41Updated 5 years ago
- 千问14B和7B的逐行解释☆58Updated last year
- 大语言模型训练和服务调研☆37Updated last year
- GRAIN: Gradient-based Intra-attention Pruning on Pre-trained Language Models☆19Updated last year
- 纯c++的全平台llm加速库,支持python调用,支持baichuan, glm, llama, moss基座,手机端流畅运行chatglm-6B级模型单卡可达10000+token / s,☆45Updated last year
- ☆23Updated last year
- 基于 onnxruntime 推理引擎的中文 ltp 词法分析☆13Updated 2 years ago
- KuaiSearch PERKS☆11Updated 3 years ago
- FinanceEventGraph,金融领域事件图谱开放数据集,可用于事件图谱搭建于实验 ,包括3865个acquire并购事件、9093个invest投资事件,总计12960的事件☆19Updated last year
- ROUGE for multilingual Summarization☆24Updated 3 years ago
- 该项目主要是抽取病历文件中的一些关键信息。并将抽取的内容进行streamlit前端的展示。目前支持的文件类型:图片,pdf文件,word文件☆23Updated 2 years ago
- use chatGLM to perform text embedding☆45Updated 2 years ago
- TensorRT☆11Updated 4 years ago
- ☆14Updated last year
- 使用qlora对中文大语言模型进行微调,包含ChatGLM、Chinese-LLaMA-Alpaca、BELLE☆86Updated last year
- Fast pdf translate是一款pdf翻译软件,基于MinerU实现pdf转markdown的功能,接着对markdown进行分割, 送给大模型翻译,最后组装翻译结果并由pypandoc生成结果pdf。☆18Updated last month
- Silk Road will be the dataset zoo for Luotuo(骆驼). Luotuo is an open sourced Chinese-LLM project founded by 陈启源 @ 华中师范大学 & 李鲁鲁 @ 商汤科技 & 冷子…☆39Updated last year
- GLM (General Language Model)☆24Updated 3 years ago