Use multi-threaded crawler to crawl the idiom data
☆14Dec 11, 2020Updated 5 years ago
Alternatives and similar repositories for Crawl
Users that are interested in Crawl are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- python class for elasticsearch , including add, batch add, update, delete, query, and scan query. also with a demo that put Wikipedia in…☆17Sep 3, 2022Updated 3 years ago
- A based-bert baseline for Chinese idiom cloze test with pytorch.☆18Dec 24, 2020Updated 5 years ago
- tf-idf 模型封装类,包含计算所有文档的tf-idf值,实现了基于tf-idf搜索引擎功能。根据query,计算与每个文档的相似度,返回与query相似度最高的topk文档☆16Nov 20, 2020Updated 5 years ago
- semantic similarity, word2vec + wmd, bert+wmd, pytorch☆31Jan 29, 2024Updated 2 years ago
- Datafountain-Epidemic government affairs quiz assistant competition. We divided this task into two parts: document retrieval and answer e…☆14Aug 21, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- DataFountain 疫情政务问答助手解决方案分享☆16May 2, 2020Updated 5 years ago
- 文档记录☆15Mar 16, 2021Updated 5 years ago
- Implementation of AAAI2021 paper "Writing Polishment with Simile: Task, Dataset and A Neural Approach"☆21Dec 25, 2020Updated 5 years ago
- ☆20Oct 15, 2022Updated 3 years ago
- A chinese simile recognition dataset of "Xiang".☆24Oct 5, 2022Updated 3 years ago
- Moss Vortex is a lightweight and high-performance deployment and inference backend engineered specifically for MOSS 003, providing a weal…☆37Apr 25, 2023Updated 2 years ago
- A Specialist-annotated Dataset for Medical-domain Chinese Spelling Correction☆36Jun 6, 2022Updated 3 years ago
- run chatglm3-6b in BM1684X☆39Mar 1, 2024Updated 2 years ago
- ChineseBert用于中文拼写纠错☆43Mar 14, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- 基于capsule的观点型阅读理解模型☆88Aug 8, 2019Updated 6 years ago
- A pytorch implementation of Capsule Network.☆100Jul 25, 2024Updated last year
- 科赛网-莱斯杯:全国第二届“军事智能机器阅读”挑战赛 前十团队PPT文档代码总结☆132Feb 5, 2020Updated 6 years ago
- Bicleaner is a parallel corpus classifier/cleaner that aims at detecting noisy sentence pairs in a parallel corpus.☆160Jun 18, 2024Updated last year
- 法研杯2019 阅读理解赛道 top3☆151Nov 13, 2023Updated 2 years ago
- Dynamic Memory Networks (https://arxiv.org/abs/1603.01417) in Tensorflow☆239Aug 10, 2016Updated 9 years ago
- ☆343Dec 11, 2018Updated 7 years ago
- A pytorch implementation of the ACL2019 paper "Simple and Effective Text Matching with Richer Alignment Features".☆305Aug 24, 2022Updated 3 years ago
- keras example of seq2seq, auto title☆331Dec 9, 2019Updated 6 years ago
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- ☆368Jul 19, 2023Updated 2 years ago
- This is updated version of the dataset for Chinese community medical question answering.☆376Jan 9, 2019Updated 7 years ago
- KgCLUE: 大规模中文开源知识图谱问答☆455Jul 5, 2022Updated 3 years ago
- Reject complicated operations for incorporating lexicon for Chinese NER.☆437Jan 22, 2022Updated 4 years ago
- Materials for learning SGLang☆785Jan 5, 2026Updated 2 months ago
- MuCGEC中文纠错数据集及文本纠错SOTA模型开源;Code & Data for our NAACL 2022 Paper "MuCGEC: a Multi-Reference Multi-Source Evaluation Dataset for Chinese Gr…☆565Jun 9, 2023Updated 2 years ago
- An implementation of TransE and its extended models for Knowledge Representation Learning on TensorFlow☆513Nov 3, 2022Updated 3 years ago
- XVERSE-13B: A multilingual large language model developed by XVERSE Technology Inc.☆643Apr 9, 2024Updated last year
- 📦 快速转化「中文数字」和「阿拉伯数字」~ (最新特性:分数,日期、温度等转化)☆758Dec 21, 2024Updated last year
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Code for the ICML 2023 paper "SparseGPT: Massive Language Models Can Be Accurately Pruned in One-Shot".☆877Aug 20, 2024Updated last year
- FP16xINT4 LLM inference kernel that can achieve near-ideal ~4x speedups up to medium batchsizes of 16-32 tokens.☆1,041Sep 4, 2024Updated last year
- Four word embedding models implemented in Python. Supporting arbitrary context features☆848Aug 22, 2019Updated 6 years ago
- An open-source educational chat model from ICALK, East China Normal University. 开源中英教育对话大模型。(通用基座模型,GPU部署,数据清理) 致敬: LLaMA, MOSS, BELLE, Z…☆920Jul 18, 2025Updated 8 months ago
- 收录NLP竞赛策略实现、各任务baseline、相关竞赛经验贴(当前赛事、往期赛事、训练赛)、NLP会议时间、常用自媒体、GPU推荐等,持续更新中☆2,238Aug 29, 2023Updated 2 years ago
- Task generation for testing text understanding and reasoning☆907Mar 27, 2019Updated 7 years ago
- LongBench v2 and LongBench (ACL 25'&24')☆1,122Jan 15, 2025Updated last year