hiyoung123 / DuplicateRemoveLinks
基于simhash的文本去重算法
☆20Updated 4 years ago
Alternatives and similar repositories for DuplicateRemove
Users that are interested in DuplicateRemove are comparing it to the libraries listed below
Sorting:
- 文本智能校对大赛(Chinese Text Correction)的baseline☆67Updated 3 years ago
- 基于Pytorch实现的中文文本分类脚手架,以及常用模型对比。☆18Updated 4 years ago
- 基于seq2edit (Gector) 的中文文本纠错。☆29Updated 3 years ago
- SinglepassTextCluster, an TextCluster tools based on Singlepass cluster algorithm that use tfidf vector and doc2vec,which can be used for…☆65Updated 4 years ago
- pytorch版simcse无监督语义相似模型☆23Updated 4 years ago
- RelExt: A Tool for Relation Extraction from Text. 文本实体关系抽取工具。☆51Updated 3 years ago
- pytorch版基于gpt+nezha的中文多轮Cdial☆12Updated 3 years ago
- 中文bigbird预训练模型☆96Updated 3 years ago
- CTC2021-中文文本纠错大赛的SOTA方案及在线演示☆73Updated 2 years ago
- 对话改写介绍文章☆98Updated 2 years ago
- Chinese Machine Reading 2021海华AI挑战赛·中文阅读理解·技术组·第三名☆20Updated 4 years ago
- ☆57Updated 3 years ago
- NLU & NLG (zero-shot) depend on mengzi-t5-base-mt pretrained model☆76Updated 3 years ago
- 基于PaddleNLP开源的抽取式UIE进行医学命名实体识别(torch实现)☆44Updated 3 years ago
- CLUEWSC2020: WSC Winograd模式挑战中文版,中文指代消解任务☆79Updated 5 years ago
- using lear to do ner extraction☆29Updated 3 years ago
- 句子匹配模型,包括无监督的SimCSE、ESimCSE、PromptBERT,和有监督的SBERT、CoSENT。☆99Updated 3 years ago
- 基于 Tensorflow,仿 Scikit-Learn 设计的深度学习自然语言处理框架。支持 40 余种模型类,涵盖语言模型、文本分类、NER、MRC、知识蒸馏等各个领域☆117Updated 2 years ago
- 中文版unilm预训练模型☆82Updated 4 years ago
- ☆11Updated 3 years ago
- bert_avg,bert_whitening,sbert,consert,simcse,esimcse 中文句向量表示☆16Updated 3 years ago
- sodic2021 法律咨询智能问答 Baseline 线上35+☆17Updated 4 years ago
- 基于“Seq2Seq+前缀树”的知识图谱问答☆70Updated 4 years ago
- An Open-Source Package for Chinese Open-domain Conversational Chatbot (中文闲聊对话系统,一键部署微信闲聊机器人)☆109Updated 2 years ago
- 时间抽取、解析、标准化工具☆56Updated 3 years ago
- This is the official code for paper titled "Exploration and Exploitation: Two Ways to Improve Chinese Spelling Correction Models".☆68Updated 4 years ago
- CCL2022 新闻脉络关系识别☆31Updated 3 years ago
- NLP实验:新词挖掘+预训练模型继续Pre-training☆47Updated 2 years ago
- R-Drop方法在中文任务上的简单实验☆91Updated 3 years ago
- P-tuning方法在中文上的简单实验☆140Updated 4 years ago