wxfsd / nltk_dataLinks
Install using NLTK downloader: nltk.download()
☆17Updated 5 years ago
Alternatives and similar repositories for nltk_data
Users that are interested in nltk_data are comparing it to the libraries listed below
Sorting:
- Chinese-Text-Classification Project including bert-classification, textCNN and so on.☆161Updated 3 years ago
- SimCSE中文语义相似度对比学习模型☆90Updated 3 years ago
- NLP句子编码、句子embedding、语义相似度:BERT_avg、BERT_whitening 、SBERT、SmiCSE☆179Updated 3 years ago
- SimCSE在中文任务上的简单实验☆606Updated 2 years ago
- A simple framework for building some basic NLP tasks☆59Updated 3 years ago
- SimCSE有监督与无监督实验复现☆152Updated last year
- 中文数据集下SimCSE+ESimCSE的实现☆193Updated 3 years ago
- SimCSE在中文上的复现,有监督+无监督☆280Updated 9 months ago
- 基于Pytorch的文本分类框架,支持TextCNN、Bert、Electra等。☆64Updated 2 years ago
- 超轻量级bert的pytorch版本,大量中文注释,容易修改结构,持续更新☆417Updated 3 years ago
- 超长文本分类(大于1000字);文档级/篇章级文本分类;主要是解决长距离依赖问题☆132Updated 4 years ago
- 该仓库主要记录 NLP 算法工程师相关的顶会论文研读笔记【信息抽取篇】☆28Updated 2 years ago
- This is the repo of the medical dialogue dataset 'imcs21' in CBLUE@Tianchi☆105Updated 2 years ago
- ☆279Updated 3 years ago
- experiments of some semantic matching models and comparison of experimental results.☆163Updated last month
- 基于实体首尾指针SPAN的序列标注框架☆28Updated 3 years ago
- CMeEE/CBLUE/NER实体识别☆132Updated 3 years ago
- “万创杯”中医药天池大数据竞赛——中医文献问题生成挑战 决赛 第一名方案☆138Updated 4 years ago
- 天池“公益AI之星”挑战赛-新冠疫情相似句对判定大赛☆16Updated 5 years ago
- 基于pytorch_bert的中文多标签分类☆92Updated 4 years ago
- Pattern-Exploiting Training在中文上的简单实验☆174Updated 5 years ago
- CBLUE-阿里天池中文医疗NLP打榜Baseline☆39Updated 3 years ago
- 苏神SPACE pytorch版本复现☆42Updated 4 years ago
- Archive for AINLP History Article☆198Updated 4 years ago
- 基于GOOGLE T5中文生成式模型的摘要生成/指代消解,支持batch批量生成,多进程☆227Updated 2 years ago
- 本项目是作者们根据个人面试和经验总结出的自然语言处理(NLP)面试准备的学习笔记与资料,该资料目前包含 自然语言处理各领域的 面试题积累。☆107Updated 4 years ago
- Knowledge Graph☆176Updated 3 years ago
- 3000000+语义理解与匹配数据集。可用于无监督对比学习、半监督学习等构建中文领域效果最好的预训练模型☆311Updated 3 years ago
- Implemention of NER model on chinese dataset.☆74Updated 2 years ago
- 中文文本分类、序列标注工具包(pytorch),支持中文长文本、短文本的多类、多标签分类任务,支持中文命名实体识别、词性标注、分词、抽取式文本摘要等序列标注任务。 Chinese text classification and sequence labeling toolk…☆351Updated last year