Tanh-wink/Crawl

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Tanh-wink/Crawl)

Tanh-wink / Crawl

Use multi-threaded crawler to crawl the idiom data

☆14

Alternatives and similar repositories for Crawl

Users that are interested in Crawl are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Tanh-wink / es_search
View on GitHub
python class for elasticsearch , including add, batch add, update, delete, query, and scan query. also with a demo that put Wikipedia in…
☆17Sep 3, 2022Updated 3 years ago
Tanh-wink / Chid_Bert_baseline
View on GitHub
A based-bert baseline for Chinese idiom cloze test with pytorch.
☆18Dec 24, 2020Updated 5 years ago
Tanh-wink / tf-idf
View on GitHub
tf-idf 模型封装类，包含计算所有文档的tf-idf值，实现了基于tf-idf搜索引擎功能。根据query，计算与每个文档的相似度，返回与query相似度最高的topk文档
☆15Nov 20, 2020Updated 5 years ago
Tanh-wink / semantic-similarity
View on GitHub
semantic similarity， word2vec + wmd， bert+wmd， pytorch
☆31Jan 29, 2024Updated 2 years ago
wenjunyoung / multipose
View on GitHub
☆12Mar 26, 2021Updated 5 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Tanh-wink / NCPQA
View on GitHub
Datafountain-Epidemic government affairs quiz assistant competition. We divided this task into two parts: document retrieval and answer e…
☆14Aug 21, 2022Updated 3 years ago
wptoux / yqzwqa
View on GitHub
DataFountain 疫情政务问答助手解决方案分享
☆16May 2, 2020Updated 6 years ago
nilboy / reports
View on GitHub
文档记录
☆15Mar 16, 2021Updated 5 years ago
mrzjy / writing-polishment-with-simile
View on GitHub
Implementation of AAAI2021 paper "Writing Polishment with Simile: Task, Dataset and A Neural Approach"
☆21Dec 25, 2020Updated 5 years ago
utahnlp / DirectProbe
View on GitHub
☆21Oct 15, 2022Updated 3 years ago
cnunlp / Chinese-Simile-Recognition-Dataset
View on GitHub
A chinese simile recognition dataset of "Xiang".
☆24Oct 5, 2022Updated 3 years ago
OpenLMLab / MOSS_Vortex
View on GitHub
Moss Vortex is a lightweight and high-performance deployment and inference backend engineered specifically for MOSS 003, providing a weal…
☆37Apr 25, 2023Updated 3 years ago
OctopusMind / BBPE
View on GitHub
BBPE 底层实现
☆38Apr 29, 2024Updated 2 years ago
NVlabs / conv-tt-lstm
View on GitHub
☆129Nov 22, 2022Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
wangwang110 / CSC
View on GitHub
ChineseBert用于中文拼写纠错
☆43Mar 14, 2023Updated 3 years ago
vanzytay / WSDM2018_HyperQA
View on GitHub
Reference Implementation for WSDM 2018 Paper "Hyperbolic Representation Learning for Fast and Efficient Neural Question Answering"
☆68Nov 16, 2018Updated 7 years ago
Tanh-wink / Educational_system
View on GitHub
教务管理系统javaweb项目运行环境：window系统，Apache Tomcat v7.0.84、JDK1.8 开发环境：J2EE eclipse、navicat for mysql 运用的技术：MVC设计模式、DAO模式、Servlet、JSP、Filter、MyS…
☆136Jul 12, 2023Updated 3 years ago
icaffe / books
View on GitHub
☆166May 26, 2020Updated 6 years ago
Felixgithub2017 / MMCU
View on GitHub
MEASURING MASSIVE MULTITASK CHINESE UNDERSTANDING
☆90Mar 24, 2024Updated 2 years ago
freefuiiismyname / capsule-mrc
View on GitHub
基于capsule的观点型阅读理解模型
☆88Aug 8, 2019Updated 6 years ago
cooelf / SG-Net
View on GitHub
SG-Net: Syntax-guided machine reading comprehension (AAAI 2020)
☆83Dec 16, 2022Updated 3 years ago
tjunlp-lab / M3KE
View on GitHub
A Massive Multi-Level Multi-Subject Knowledge Evaluation benchmark
☆106Jul 20, 2023Updated 3 years ago
chujiezheng / ChID-Dataset
View on GitHub
ChID: A Large-scale Chinese IDiom Dataset for Cloze Test
☆150May 8, 2023Updated 3 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
jiesutd / RichWordSegmentor
View on GitHub
Neural word segmentation with rich pretraining, code for ACL 2017 paper
☆165Jan 10, 2019Updated 7 years ago
wipen / bert_and_ernie
View on GitHub
TensorFlow code and pre-trained models for BERT and ERNIE
☆147Jun 5, 2019Updated 7 years ago
bitextor / bicleaner
View on GitHub
Bicleaner is a parallel corpus classifier/cleaner that aims at detecting noisy sentence pairs in a parallel corpus.
☆160Jun 18, 2024Updated 2 years ago
NoneWait / cail2019
View on GitHub
法研杯2019 阅读理解赛道 top3
☆151Nov 13, 2023Updated 2 years ago
google / active-qa
View on GitHub
☆344Dec 11, 2018Updated 7 years ago
bojone / NBCE
View on GitHub
Naive Bayes-based Context Extension
☆328Dec 9, 2024Updated last year
alibaba-edu / simple-effective-text-matching-pytorch
View on GitHub
A pytorch implementation of the ACL2019 paper "Simple and Effective Text Matching with Richer Alignment Features".
☆305Aug 24, 2022Updated 3 years ago
bojone / seq2seq
View on GitHub
keras example of seq2seq, auto title
☆331Dec 9, 2019Updated 6 years ago
Lynten / stanford-corenlp
View on GitHub
Python wrapper for Stanford CoreNLP.
☆916Dec 7, 2021Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
ZhuiyiTechnology / nl2sql_baseline
View on GitHub
☆368Jul 19, 2023Updated 3 years ago
HillZhang1999 / MuCGEC
View on GitHub
MuCGEC中文纠错数据集及文本纠错SOTA模型开源；Code & Data for our NAACL 2022 Paper "MuCGEC: a Multi-Reference Multi-Source Evaluation Dataset for Chinese Gr…
☆570Jun 9, 2023Updated 3 years ago
xverse-ai / XVERSE-13B
View on GitHub
XVERSE-13B: A multilingual large language model developed by XVERSE Technology Inc.
☆641Apr 9, 2024Updated 2 years ago
thunlp / TensorFlow-TransX
View on GitHub
An implementation of TransE and its extended models for Knowledge Representation Learning on TensorFlow
☆512Nov 3, 2022Updated 3 years ago
Ailln / cn2an
View on GitHub
📦 快速转化「中文数字」和「阿拉伯数字」～ (最新特性：分数，日期、温度等转化）
☆765Apr 23, 2026Updated 3 months ago
pltrdy / rouge
View on GitHub
A full Python Implementation of the ROUGE Metric (not a wrapper)
☆720Nov 19, 2024Updated last year
IST-DASLab / sparsegpt
View on GitHub
Code for the ICML 2023 paper "SparseGPT: Massive Language Models Can Be Accurately Pruned in One-Shot".
☆891Aug 20, 2024Updated last year