Tanh-wink / CrawlLinks
Use multi-threaded crawler to crawl the idiom data
☆14Updated 5 years ago
Alternatives and similar repositories for Crawl
Users that are interested in Crawl are comparing it to the libraries listed below
Sorting:
- python class for elasticsearch , including add, batch add, update, delete, query, and scan query. also with a demo that put Wikipedia in…☆17Updated 3 years ago
- tf-idf 模型封装类,包含计算所有文档的tf-idf值,实现了基于tf-idf搜索引擎功能。根据query,计算与每个文档的相似度,返回与query相似度最高的topk文档☆16Updated 5 years ago
- A based-bert baseline for Chinese idiom cloze test with pytorch.☆18Updated 5 years ago
- semantic similarity, word2vec + wmd, bert+wmd, pytorch☆31Updated last year
- ☆272Updated last year
- ☆14Updated 4 years ago
- Source code for the paper "PLOME: Pre-training with Misspelled Knowledge for Chinese Spelling Correction" in ACL2021☆240Updated 3 years ago
- ☆420Updated last year
- 大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP☆24Updated 6 years ago
- Hugging BERT together. Misc scripts for Huggingface transformers.☆73Updated 2 years ago
- 对Faspell的复现和思考☆23Updated 2 years ago
- ☆130Updated 3 years ago
- pytorch中文语言模型预训练☆386Updated 5 years ago
- 收集了目前为止中文领域的MRC抽取式数据集☆122Updated last year
- 全局指针统一处理嵌套与非嵌套NER☆259Updated 4 years ago
- SIGHAN中文纠错数据集及转换后格式☆71Updated 5 years ago
- 基于pytorch+bert的指代消解☆14Updated 4 years ago
- Use deep models including BiLSTM, ABCNN, ESIM, RE2, BERT, etc. and evaluate on 5 Chinese NLP datasets: LCQMC, BQ Corpus, ChineseSTS, OCN…☆78Updated 3 years ago
- code for ACL2021 paper "Tail-to-Tail Non-Autoregressive Sequence Prediction for Chinese Grammatical Error Correction"☆99Updated 4 years ago
- A Multi-modal Model Chinese Spell Checker Released on ACL2021.☆159Updated 2 years ago
- ACL 2019论文复现:Improving Multi-turn Dialogue Modelling with Utterance ReWriter☆137Updated 5 years ago
- This repository is for the paper "A Hybrid Approach to Automatic Corpus Generation for Chinese Spelling Check"☆294Updated 6 years ago
- A Specialist-annotated Dataset for Medical-domain Chinese Spelling Correction☆34Updated 3 years ago
- Pointer-generator transformer model and transformer model for the morphological inflection task. custom to the SIGMORPHON 2019 shared tas…☆27Updated 5 years ago
- ☆49Updated 2 years ago
- CPT: A Pre-Trained Unbalanced Transformer for Both Chinese Language Understanding and Generation☆494Updated 3 years ago
- ☆278Updated 3 years ago
- 端到端的长本文摘要模型(法研杯2020司法摘要赛道)☆398Updated last year
- SimBERT升级版(SimBERTv2)!☆445Updated 3 years ago
- 基于transformer的指针生成网络☆93Updated 5 years ago