Use multi-threaded crawler to crawl the idiom data
☆14Dec 11, 2020Updated 5 years ago
Alternatives and similar repositories for Crawl
Users that are interested in Crawl are comparing it to the libraries listed below
Sorting:
- python class for elasticsearch , including add, batch add, update, delete, query, and scan query. also with a demo that put Wikipedia in…☆17Sep 3, 2022Updated 3 years ago
- A based-bert baseline for Chinese idiom cloze test with pytorch.☆18Dec 24, 2020Updated 5 years ago
- tf-idf 模型封装类,包含计算所有文档的tf-idf值,实现了基于tf-idf搜索引擎功能。根据query,计算与每个文档的相似度,返回与query相似度最高的topk文档☆16Nov 20, 2020Updated 5 years ago
- semantic similarity, word2vec + wmd, bert+wmd, pytorch☆31Jan 29, 2024Updated 2 years ago
- Datafountain-Epidemic government affairs quiz assistant competition. We divided this task into two parts: document retrieval and answer e…☆14Aug 21, 2022Updated 3 years ago
- DataFountain 疫情政务问答助手解决方案分享☆16May 2, 2020Updated 5 years ago
- 文档记录☆15Mar 16, 2021Updated 4 years ago
- ☆20Oct 15, 2022Updated 3 years ago
- Implementation of AAAI2021 paper "Writing Polishment with Simile: Task, Dataset and A Neural Approach"☆21Dec 25, 2020Updated 5 years ago
- Moss Vortex is a lightweight and high-performance deployment and inference backend engineered specifically for MOSS 003, providing a weal…☆37Apr 25, 2023Updated 2 years ago
- MEASURING MASSIVE MULTITASK CHINESE UNDERSTANDING☆89Mar 24, 2024Updated last year
- 基于capsule的观点型阅读理解模型☆88Aug 8, 2019Updated 6 years ago
- ☆99Dec 5, 2023Updated 2 years ago
- A Massive Multi-Level Multi-Subject Knowledge Evaluation benchmark☆104Jul 20, 2023Updated 2 years ago
- This is the dataset for Chinese community medical question answering.☆111Oct 22, 2019Updated 6 years ago
- 科赛网-莱斯杯:全国第二届“军事智能机器阅读”挑战赛 前十团队PPT文档代码总结☆132Feb 5, 2020Updated 6 years ago
- TensorFlow code and pre-trained models for BERT and ERNIE☆146Jun 5, 2019Updated 6 years ago
- ChID: A Large-scale Chinese IDiom Dataset for Cloze Test☆150May 8, 2023Updated 2 years ago
- 法研杯2019 阅读理解赛道 top3☆151Nov 13, 2023Updated 2 years ago
- Dynamic Memory Networks (https://arxiv.org/abs/1603.01417) in Tensorflow☆239Aug 10, 2016Updated 9 years ago
- ☆343Dec 11, 2018Updated 7 years ago
- ☆367Jul 19, 2023Updated 2 years ago
- Reject complicated operations for incorporating lexicon for Chinese NER.☆437Jan 22, 2022Updated 4 years ago
- KgCLUE: 大规模中文开源知识图谱问答☆454Jul 5, 2022Updated 3 years ago
- 以词为基本单位的中文BERT☆476Nov 18, 2021Updated 4 years ago
- C++ implementation of Qwen-LM☆617Dec 6, 2024Updated last year
- ☆672Nov 1, 2024Updated last year
- [ICML 2024] SqueezeLLM: Dense-and-Sparse Quantization☆712Aug 13, 2024Updated last year
- 📦 快速转化「中文数字」和「阿拉伯数字」~ (最新特性:分数,日期、温度等转化)☆756Dec 21, 2024Updated last year
- Code for the ICML 2023 paper "SparseGPT: Massive Language Models Can Be Accurately Pruned in One-Shot".☆872Aug 20, 2024Updated last year
- An open-source educational chat model from ICALK, East China Normal University. 开源中英教育对话大模型。(通用基座模型,GPU部署,数据清理) 致敬: LLaMA, MOSS, BELLE, Z…☆912Jul 18, 2025Updated 7 months ago
- Four word embedding models implemented in Python. Supporting arbitrary context features☆850Aug 22, 2019Updated 6 years ago
- Natural Questions (NQ) contains real user questions issued to Google search, and answers found from Wikipedia by annotators. NQ is design…☆1,094Jul 30, 2021Updated 4 years ago
- FP16xINT4 LLM inference kernel that can achieve near-ideal ~4x speedups up to medium batchsizes of 16-32 tokens.☆1,025Sep 4, 2024Updated last year
- Must-read papers on Machine Reading Comprehension☆891Jul 9, 2020Updated 5 years ago
- Task generation for testing text understanding and reasoning☆906Mar 27, 2019Updated 6 years ago
- LongBench v2 and LongBench (ACL 25'&24')☆1,101Jan 15, 2025Updated last year
- A Tensorflow implementation of QANet for machine reading comprehension☆983May 30, 2018Updated 7 years ago
- Image Test Time Augmentation with PyTorch!☆1,028Jul 28, 2023Updated 2 years ago