Tanh-wink / tf-idfLinks
tf-idf 模型封装类,包含计算所有文档的tf-idf值,实现了基于tf-idf搜索引擎功能。根据query,计算与每个文档的相似度,返回与query相似度最高的topk文档
☆16Updated 4 years ago
Alternatives and similar repositories for tf-idf
Users that are interested in tf-idf are comparing it to the libraries listed below
Sorting:
- Use multi-threaded crawler to crawl the idiom data☆14Updated 4 years ago
- python class for elasticsearch , including add, batch add, update, delete, query, and scan query. also with a demo that put Wikipedia in…☆17Updated 3 years ago
- A based-bert baseline for Chinese idiom cloze test with pytorch.☆18Updated 4 years ago
- semantic similarity, word2vec + wmd, bert+wmd, pytorch☆31Updated last year
- ☆33Updated 4 years ago
- 基于SpanBert的中文指代消解,pytorch实现☆101Updated 2 years ago
- Grammar correct project based Tencent's paper(Sequence to Action)☆15Updated 3 years ago
- 基于pytorch+bert的指代消解☆14Updated 4 years ago
- ☆279Updated 3 years ago
- 百度2021年语言与智能技术竞赛多形态信息抽取赛道事件抽取部分torch版baseline☆79Updated 4 years ago
- A Specialist-annotated Dataset for Medical-domain Chinese Spelling Correction☆30Updated 3 years ago
- 收集了目前为止中文领域的MRC抽取式数据集☆122Updated last year
- pytorch版unilm模型☆27Updated 4 years ago
- ☆129Updated 3 years ago
- CCL 2022 汉语学习者文本纠错评测☆142Updated 2 years ago
- 中文无监督SimCSE Pytorch实现☆135Updated 4 years ago
- 中文自然语言推理数据集(A large-scale Chinese Nature language inference and Semantic similarity calculation Dataset)☆433Updated 5 years ago
- 百度2021年语言与智能技术竞赛多形态信息抽取赛道关系抽取部分torch版baseline☆52Updated 4 years ago
- 基于prompt的中文文本分类。☆55Updated 2 years ago
- ☆136Updated 4 years ago
- ☆14Updated 4 years ago
- 中文机器阅读理解数据集☆107Updated 4 years ago
- 全局指针统一处理嵌套与非嵌套NER☆255Updated 4 years ago
- 论文复现《Named Entity Recognition as Dependency Parsing》☆131Updated 4 years ago
- 大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP☆24Updated 6 years ago
- Implementation of AAAI2021 paper "Writing Polishment with Simile: Task, Dataset and A Neural Approach"☆21Updated 4 years ago
- 中文数据集下SimCSE+ESimCSE的实现☆193Updated 3 years ago
- 2021 语言与智能技术竞赛关系 篇章级关系抽取☆18Updated 4 years ago
- 句子匹配模型,包括无监督的SimCSE、ESimCSE、PromptBERT,和有监督的SBERT、CoSENT。☆100Updated 3 years ago
- ☆270Updated last year