Use multi-threaded crawler to crawl the idiom data
☆14Dec 11, 2020Updated 5 years ago
Alternatives and similar repositories for Crawl
Users that are interested in Crawl are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- python class for elasticsearch , including add, batch add, update, delete, query, and scan query. also with a demo that put Wikipedia in…☆17Sep 3, 2022Updated 3 years ago
- A based-bert baseline for Chinese idiom cloze test with pytorch.☆18Dec 24, 2020Updated 5 years ago
- tf-idf 模型封装类,包含计算所有文档的tf-idf值,实现了基于tf-idf搜索引擎功能。根据query,计算与每个文档的相似度,返回与query相似度最高的topk文档☆15Nov 20, 2020Updated 5 years ago
- semantic similarity, word2vec + wmd, bert+wmd, pytorch☆31Jan 29, 2024Updated 2 years ago
- Datafountain-Epidemic government affairs quiz assistant competition. We divided this task into two parts: document retrieval and answer e…☆14Aug 21, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- DataFountain 疫情政务问答助手解决方案分享☆16May 2, 2020Updated 6 years ago
- 文档记录☆15Mar 16, 2021Updated 5 years ago
- ☆21Oct 15, 2022Updated 3 years ago
- BBPE 底层实现☆38Apr 29, 2024Updated 2 years ago
- run chatglm3-6b in BM1684X☆39Mar 1, 2024Updated 2 years ago
- ChineseBert用于中文拼写纠错☆43Mar 14, 2023Updated 3 years ago
- Reference Implementation for WSDM 2018 Paper "Hyperbolic Representation Learning for Fast and Efficient Neural Question Answering"☆68Nov 16, 2018Updated 7 years ago
- [AAAI 2024] LLMEval Phase II dataset — professional domain evaluation across 12 academic disciplines☆71May 21, 2026Updated 3 weeks ago
- 基于capsule的观点型阅读理解模型☆88Aug 8, 2019Updated 6 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- SG-Net: Syntax-guided machine reading comprehension (AAAI 2020)☆83Dec 16, 2022Updated 3 years ago
- 科赛网-莱斯杯:全国第二届“军事智能机器阅读”挑战赛 前十团队PPT文档代码总结☆132Feb 5, 2020Updated 6 years ago
- ChID: A Large-scale Chinese IDiom Dataset for Cloze Test☆150May 8, 2023Updated 3 years ago
- Neural word segmentation with rich pretraining, code for ACL 2017 paper☆165Jan 10, 2019Updated 7 years ago
- Bicleaner is a parallel corpus classifier/cleaner that aims at detecting noisy sentence pairs in a parallel corpus.☆160Jun 18, 2024Updated 2 years ago
- 法研杯2019 阅读理解赛道 top3☆151Nov 13, 2023Updated 2 years ago
- Dynamic Memory Networks (https://arxiv.org/abs/1603.01417) in Tensorflow☆238Aug 10, 2016Updated 9 years ago
- ☆344Dec 11, 2018Updated 7 years ago
- Naive Bayes-based Context Extension☆328Dec 9, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- keras example of seq2seq, auto title☆331Dec 9, 2019Updated 6 years ago
- Tools for extracting tables and results from Machine Learning papers☆440Nov 28, 2022Updated 3 years ago
- Python wrapper for Stanford CoreNLP.☆917Dec 7, 2021Updated 4 years ago
- ☆368Jul 19, 2023Updated 2 years ago
- 以词为基本单位的中文BERT☆476Nov 18, 2021Updated 4 years ago
- C++ implementation of Qwen-LM☆627Dec 6, 2024Updated last year
- A prize for finding tasks that cause large language models to show inverse scaling☆621Oct 11, 2023Updated 2 years ago
- MuCGEC中文纠错数据集及文本纠错SOTA模型开源;Code & Data for our NAACL 2022 Paper "MuCGEC: a Multi-Reference Multi-Source Evaluation Dataset for Chinese Gr…☆568Jun 9, 2023Updated 3 years ago
- [ICML 2024] SqueezeLLM: Dense-and-Sparse Quantization☆722Aug 13, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- XVERSE-13B: A multilingual large language model developed by XVERSE Technology Inc.☆642Apr 9, 2024Updated 2 years ago
- 📦 快速转化「中文数字」和「阿拉伯数字」~ (最新特性:分数,日期、温度等转化)☆762Apr 23, 2026Updated last month
- Code for the ICML 2023 paper "SparseGPT: Massive Language Models Can Be Accurately Pruned in One-Shot".☆885Aug 20, 2024Updated last year
- 🎯🗯 Dataset generation for AI chatbots, NLP tasks, named entity recognition or text classification models using a simple DSL!☆888Sep 3, 2023Updated 2 years ago
- Natural Questions (NQ) contains real user questions issued to Google search, and answers found from Wikipedia by annotators. NQ is design…☆1,122Jul 30, 2021Updated 4 years ago
- Four word embedding models implemented in Python. Supporting arbitrary context features☆846Aug 22, 2019Updated 6 years ago
- An open-source educational chat model from ICALK, East China Normal University. 开源中英教育对话大模型。(通用基座模型,GPU部署,数据清理) 致敬: LLaMA, MOSS, BELLE, Z…☆941Jul 18, 2025Updated 11 months ago