zejunwang1 / fastMatchLinks
Large-scale exact string matching tool
☆17Updated 11 months ago
Alternatives and similar repositories for fastMatch
Users that are interested in fastMatch are comparing it to the libraries listed below
Sorting:
- share data, prompt data , pretraining data☆36Updated 2 years ago
- aigc evals☆10Updated 2 years ago
- Unsupervised tableQA and databaseQA on chinese finance question and tabular data☆13Updated 2 years ago
- GRAIN: Gradient-based Intra-attention Pruning on Pre-trained Language Models☆19Updated 2 years ago
- 百川Dynamic NTK-ALiBi的代码实现:无需微调即可推理更长文本☆49Updated 2 years ago
- Service for Bert model to Vector. 高效的文本转向量(Text-To-Vector)服务,支持GPU多卡、多worker、多客户端调用,开箱即用。☆12Updated 3 years ago
- the newest version of llama3,source code explained line by line using Chinese☆22Updated last year
- 百度QA100万数据集☆46Updated 2 years ago
- This repository provides an implementation of "A Simple yet Effective Training-free Prompt-free Approach to Chinese Spelling Correction B…☆86Updated 7 months ago
- Tracking the hot Github repos and update daily 每天自动追踪Github热门项目☆50Updated this week
- 一个非常高效的字符串匹配工具,支持正向/反向最大匹配分词和多模式字符串精确匹配☆16Updated 2 years ago
- 大语言模型训练和服务调研☆37Updated 2 years ago
- moss chat finetuning☆51Updated last year
- YiZhao: A 2TB Open Financial Corpus. Data and tools for generating and inspecting YiZhao, a safe, high-quality, open-source bilingual fin…☆38Updated 6 months ago
- 1.4B sLLM for Chinese and English - HammerLLM🔨☆43Updated last year
- Evaluation for AI apps and agent☆44Updated 2 years ago
- rasa_chinese 的服务 package☆18Updated 4 years ago
- Fast LLM Training CodeBase With dynamic strategy choosing [Deepspeed+Megatron+FlashAttention+CudaFusionKernel+Compiler];☆40Updated 2 years ago
- 大规模中文语料☆44Updated 6 years ago
- GoGPT中文指令数据集构造☆10Updated 2 years ago
- A more efficient GLM implementation!☆54Updated 2 years ago
- XVERSE-7B: A multilingual large language model developed by XVERSE Technology Inc.☆53Updated last year
- 通用版面分析 | 中文文档解析 |Document Layout Analysis | layout paser☆48Updated last year
- Source code and checkpoints for legal pre-trained language models.☆15Updated 4 years ago
- GTS Engine: A powerful NLU Training System。GTS引擎(GTS-Engine)是一款开箱即用且性能强大的自然语言理解引擎,聚焦于小样本任务,能够仅用小样本就能自动化生产NLP模型。☆93Updated 2 years ago
- 有一个通用实体关系事件抽取的任务,需要使用到UIE模框架,而且需要将起部署到昇腾310服务器上,因为UIE模型底层使用的是ernie3.0,但是目前paddle官方还不支持ernie3.0模型在昇腾310上部署,所以才有了以下的操作,主要过程是,先试用paddle训练处模型…☆20Updated 3 years ago
- Another ChatGLM2 implementation for GPTQ quantization☆54Updated 2 years ago
- 中文预训练ModernBert☆98Updated 9 months ago
- MOSS 003 WebSearchTool: A simple but reliable implementation☆45Updated 2 years ago
- DST(Dialogue State Tracker) for LLM(Large Language Model)☆25Updated 2 years ago