Htring / BM25Links
基于python的BM25文本匹配算法实现
☆33Updated 3 years ago
Alternatives and similar repositories for BM25
Users that are interested in BM25 are comparing it to the libraries listed below
Sorting:
- ☆57Updated 2 years ago
- BLOOM 模型的指令微调☆24Updated 2 years ago
- 记录NLP、CV、搜索、推荐等AI岗位最新情况。☆29Updated 2 years ago
- llama,chatglm 等模型的微调☆91Updated last year
- 通用简单工具项目☆20Updated last year
- 格物-多语言和中文大规模预训练模型-轻量版,涵盖纯中文、知识增强、113个语种多语言,采用主流Roberta架构,适用于NLU和NLG任务, 支持pytorch、tensorflow、uer、huggingface等框架。 Multilingual and Chinese …☆30Updated 2 years ago
- 大语言模型指令调优工具(支持 FlashAttention)☆178Updated last year
- ☆98Updated last year
- 文本智能校对大赛(Chinese Text Correction)的baseline☆68Updated 3 years ago
- moss chat finetuning☆51Updated last year
- ChatGLM2-6B微调, SFT/LoRA, instruction finetune☆110Updated 2 years ago
- deepspeed+trainer简单高效实现多卡微调大模型☆129Updated 2 years ago
- 1st Solution For Conversational Multi-Doc QA Workshop & International Challenge @ WSDM'24 - Xiaohongshu.Inc☆161Updated 3 months ago
- This repository open-sources our GEC system submitted by THU KELab (sz) in the CCL2023-CLTC Track 1: Multidimensional Chinese Learner Tex…☆15Updated last year
- (NBCE)Naive Bayes-based Context Extension on ChatGLM-6b☆15Updated 2 years ago
- 基于 LoRA 和 P-Tuning v2 的 ChatGLM-6B 高效参数微调☆55Updated 2 years ago
- GoGPT:基于Llama/Llama 2训练的中英文增强大模型|Chinese-Llama2☆79Updated 2 years ago
- 百川Dynamic NTK-ALiBi的代码实现:无需微调即可推理更长文本☆49Updated 2 years ago
- 中文机器阅读理解数据集☆107Updated 4 years ago
- 使用qlora 对中文大语言模型进行微调,包含ChatGLM、Chinese-LLaMA-Alpaca、BELLE☆90Updated 2 years ago
- 怎么训练一个LLM分词器☆153Updated 2 years ago
- The Corpus & Code for EMNLP 2022 paper "FCGEC: Fine-Grained Corpus for Chinese Grammatical Error Correction" | FCGEC中文语法纠错语料及STG模型☆119Updated 10 months ago
- A repo for update and debug Mixtral-7x8B、MOE、ChatGLM3、LLaMa2、 BaChuan、Qwen an other LLM models include new models mixtral, mixtral 8x7b, …☆47Updated 2 weeks ago
- 句子匹配模型,包括无监督的SimCSE、ESimCSE、PromptBERT,和有监督的SBERT、CoSENT。☆100Updated 2 years ago
- llama信息抽取实战☆100Updated 2 years ago
- MEASURING MASSIVE MULTITASK CHINESE UNDERSTANDING☆89Updated last year
- A Massive Multi-Level Multi-Subject Knowledge Evaluation benchmark☆102Updated 2 years ago
- LAiW: A Chinese Legal Large Language Models Benchmark☆84Updated last year
- [TALLIP] General and Domain Adaptive Chinese Spelling Check with Error Consistent Pretraining☆59Updated last year
- An open-source conversational language model developed by the Knowledge Works Research Laboratory at Fudan University.☆65Updated 2 years ago