hiyoung123 / DuplicateRemove
基于simhash的文本去重算法
☆20Updated 3 years ago
Alternatives and similar repositories for DuplicateRemove:
Users that are interested in DuplicateRemove are comparing it to the libraries listed below
- 长文本相似度模型☆18Updated last year
- 基于seq2edit (Gector) 的中文文本纠错。☆27Updated 2 years ago
- RelExt: A Tool for Relation Extraction from Text. 文本实体关系抽取工具。☆48Updated 2 years ago
- SinglepassTextCluster, an TextCluster tools based on Singlepass cluster algorithm that use tfidf vector and doc2vec,which can be used for…☆62Updated 3 years ago
- CTC2021-中文文本纠错大赛的SOTA方案及在线演示☆72Updated last year
- 时间关键词正则提取以及标准化☆21Updated 3 years ago
- 中文文本纠错模型,keras实现☆70Updated 3 years ago
- 基于向量召回的检索式对话系统解决方案,dense retrieval,FAQ……☆33Updated 3 years ago
- 基于Pytorch实现的中文文本分类脚手架,以及常用模型对比。☆18Updated 3 years ago
- benchmark of KgCLUE, with different models and methods☆27Updated 3 years ago
- 零样本学习测评基准,中文版☆54Updated 3 years ago
- NLP实验:新词挖掘+预训练模型继续Pre-training☆47Updated last year
- using lear to do ner extraction☆29Updated 2 years ago
- (NBCE)Naive Bayes-based Context Extension on ChatGLM-6b☆14Updated last year
- ☆21Updated 4 years ago
- This is the official code for paper titled "Exploration and Exploitation: Two Ways to Improve Chinese Spelling Correction Models".☆67Updated 3 years ago
- 文本智能校对大赛(Chinese Text Correction)的baseline☆67Updated 2 years ago
- An Open-Source Package for Chinese Open-domain Conversational Chatbot (中文闲聊对话系统,一键部署微信闲聊机器人)☆107Updated last year
- ☆17Updated 3 years ago
- bert_avg,bert_whitening,sbert,consert,simcse,esimcse 中文句向量表示☆16Updated 2 years ago
- 本项目使用云问科技训练的中文版UniLM模型对微博数据集进行自动标题生成。☆37Updated 9 months ago
- 基于“Seq2Seq+前缀树”的知识图谱问答☆71Updated 3 years ago
- 基于模板的文本纠错;Automatically Mining Error Templates for Grammatical Error Correction☆37Updated 2 years ago
- 关键词抽取项目☆24Updated 4 years ago
- 一个简单易用的 Python 模块,用于通过字符串来操作日期/时间。正则时间提取,字符串时间解析,字符串时间提取。中文时间提取,一句话里面提取时间☆75Updated 6 months ago
- 中文版unilm预训练模型☆83Updated 3 years ago
- ☆57Updated 2 years ago
- ☆127Updated 2 years ago
- ☆101Updated 4 years ago
- 中文bigbird预训练模型☆91Updated 2 years ago