Use multi-threaded crawler to crawl the idiom data
☆14Dec 11, 2020Updated 5 years ago
Alternatives and similar repositories for Crawl
Users that are interested in Crawl are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- python class for elasticsearch , including add, batch add, update, delete, query, and scan query. also with a demo that put Wikipedia in…☆17Sep 3, 2022Updated 3 years ago
- A based-bert baseline for Chinese idiom cloze test with pytorch.☆18Dec 24, 2020Updated 5 years ago
- tf-idf 模型封装类,包含计算所有文档的tf-idf值,实现了基于tf-idf搜索引擎功能。根据query,计算与每个文档的相似度,返回与query相似度最高的topk文档☆16Nov 20, 2020Updated 5 years ago
- semantic similarity, word2vec + wmd, bert+wmd, pytorch☆31Jan 29, 2024Updated 2 years ago
- Datafountain-Epidemic government affairs quiz assistant competition. We divided this task into two parts: document retrieval and answer e…☆14Aug 21, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Realtime Pose Estimation NCNN ONNX☆24Apr 29, 2020Updated 6 years ago
- 文档记录☆15Mar 16, 2021Updated 5 years ago
- Implementation of AAAI2021 paper "Writing Polishment with Simile: Task, Dataset and A Neural Approach"☆21Dec 25, 2020Updated 5 years ago
- A chinese simile recognition dataset of "Xiang".☆24Oct 5, 2022Updated 3 years ago
- BBPE 底层实现☆38Apr 29, 2024Updated 2 years ago
- ChineseBert用于中文拼写纠错☆43Mar 14, 2023Updated 3 years ago
- Reference Implementation for WSDM 2018 Paper "Hyperbolic Representation Learning for Fast and Efficient Neural Question Answering"☆68Nov 16, 2018Updated 7 years ago
- [AAAI 2024] LLMEval Phase II dataset — professional domain evaluation across 12 academic disciplines☆71May 21, 2026Updated last week
- 基于capsule的观点型阅读理解模型☆88Aug 8, 2019Updated 6 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- SG-Net: Syntax-guided machine reading comprehension (AAAI 2020)☆83Dec 16, 2022Updated 3 years ago
- ☆98Dec 5, 2023Updated 2 years ago
- 科赛网-莱斯杯:全国第二届“军事智能机器阅读”挑战赛 前十团队PPT文档代码总结☆132Feb 5, 2020Updated 6 years ago
- TensorFlow code and pre-trained models for BERT and ERNIE☆146Jun 5, 2019Updated 6 years ago
- Bicleaner is a parallel corpus classifier/cleaner that aims at detecting noisy sentence pairs in a parallel corpus.☆160Jun 18, 2024Updated last year
- 法研杯2019 阅读理解赛道 top3☆151Nov 13, 2023Updated 2 years ago
- Dynamic Memory Networks (https://arxiv.org/abs/1603.01417) in Tensorflow☆238Aug 10, 2016Updated 9 years ago
- keras example of seq2seq, auto title☆331Dec 9, 2019Updated 6 years ago
- 以词为基本单位的中文BERT☆476Nov 18, 2021Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- KgCLUE: 大规模中文开源知识图谱问答☆457Jul 5, 2022Updated 3 years ago
- Reject complicated operations for incorporating lexicon for Chinese NER.☆437Jan 22, 2022Updated 4 years ago
- Materials for learning SGLang☆826Jan 5, 2026Updated 4 months ago
- [ICML 2024] SqueezeLLM: Dense-and-Sparse Quantization☆720Aug 13, 2024Updated last year
- An implementation of TransE and its extended models for Knowledge Representation Learning on TensorFlow☆513Nov 3, 2022Updated 3 years ago
- A full Python Implementation of the ROUGE Metric (not a wrapper)☆718Nov 19, 2024Updated last year
- Natural Questions (NQ) contains real user questions issued to Google search, and answers found from Wikipedia by annotators. NQ is design…☆1,121Jul 30, 2021Updated 4 years ago
- Four word embedding models implemented in Python. Supporting arbitrary context features☆847Aug 22, 2019Updated 6 years ago
- An open-source educational chat model from ICALK, East China Normal University. 开源中英教育对话大模型。(通用基座模型,GPU部署,数据清理) 致敬: LLaMA, MOSS, BELLE, Z…☆934Jul 18, 2025Updated 10 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- 收录NLP竞赛策略实现、各任务baseline、相关竞赛经验贴(当前赛事、往期赛事、训练赛)、NLP会议时间、常用自媒体、GPU推荐等,持续更新中☆2,242Aug 29, 2023Updated 2 years ago
- Must-read papers on Machine Reading Comprehension☆889Jul 9, 2020Updated 5 years ago
- A collection of open-source dataset to train instruction-following LLMs (ChatGPT,LLaMA,Alpaca)☆1,148Jan 4, 2024Updated 2 years ago
- [ICML 2024] Break the Sequential Dependency of LLM Inference Using Lookahead Decoding☆1,337Mar 6, 2025Updated last year
- 📖《Python Parallel Programming Cookbook》中文版☆1,525Nov 11, 2025Updated 6 months ago
- Bi-directional Attention Flow (BiDAF) network is a multi-stage hierarchical process that represents context at different levels of granul…☆1,542May 31, 2023Updated 2 years ago
- Official github repo for C-Eval, a Chinese evaluation suite for foundation models [NeurIPS 2023]☆1,851Jul 27, 2025Updated 10 months ago