wjn1996/scrapy_for_zh_wiki

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/wjn1996/scrapy_for_zh_wiki)

wjn1996 / scrapy_for_zh_wiki

基于scrapy的层次优先队列方法爬取中文维基百科，并自动抽取结构和半结构数据

☆157

Alternatives and similar repositories for scrapy_for_zh_wiki

Users that are interested in scrapy_for_zh_wiki are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

wjn1996 / KP-PLM
View on GitHub
（Accepted By EMNLP2022 main long）Knowledge Prompting in Pre-trained Language Model for Natural Language Understanding
☆15Oct 29, 2022Updated 3 years ago
nchen909 / CodeAttention
View on GitHub
CAT-probing: A Metric-based Approach to Interpret How Pre-trained Models for Programming Language Attend Code Structure, EMNLP 2022
☆13Dec 10, 2022Updated 3 years ago
tata24 / google_HDImage_crawler
View on GitHub
一个谷歌高清图片爬虫
☆13Jan 7, 2020Updated 6 years ago
wjn1996 / Mathematical-Knowledge-Entity-Recognition
View on GitHub
This is a novel project for mathematical knowledge entity recognition. The algorithm is mainly modeled by BiLSTM+CRF with Chinese Word Em…
☆48Dec 26, 2019Updated 6 years ago
NotCraft / ArxivDaily
View on GitHub
ArxivDaily
☆13Updated this week
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
wjn1996 / Math-NRE-Mathematical-Relation-Extraction
View on GitHub
Math-NRE：中学数学知识抽取——关系抽取
☆14Mar 20, 2020Updated 6 years ago
bigai-nlco / DocGNRE
View on GitHub
[EMNLP 2023] Semi-automatic Data Enhancement for Document-Level Relation Extraction with Distant Supervision from Large Language Models
☆17Oct 30, 2023Updated 2 years ago
wangmengsd / richpedia
View on GitHub
Richpedia: A Comprehensive Multi-Modal Knowledge Graph
☆53Apr 18, 2019Updated 7 years ago
yangjingo / IE-Datasets-Collections
View on GitHub
中英文信息抽取数据集整理
☆20May 15, 2022Updated 4 years ago
EdisonChen0816 / ner_toolkit
View on GitHub
命名实体识别
☆12Dec 21, 2020Updated 5 years ago
Wasim37 / marketing_text_generation
View on GitHub
文本生成 - 通过商品参数和图片自动生成营销文本
☆12Sep 17, 2021Updated 4 years ago
axmand / react-webpack-redux
View on GitHub
a map-based project build on react
☆11Jun 30, 2026Updated last week
hanjiale / HCRP
View on GitHub
Code of paper Exploring Task Difficulty for Few-Shot Relation Extraction. https://arxiv.org/abs/2109.05473
☆34Sep 12, 2021Updated 4 years ago
lizongyu1293306035 / CasRelPyTorch-
View on GitHub
CasRelPytorch项目的改写版本，根据源码增添了模型预测功能以及预测结果导入neo4j图数据库的功能。
☆13Jul 25, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
shigashiyama / nlp_survey
View on GitHub
☆15Mar 31, 2020Updated 6 years ago
hccngu / Viscacha
View on GitHub
Viscacha：通用信息抽取数据集收集
☆27Feb 21, 2024Updated 2 years ago
yanqiuxia / Doc2EDAG
View on GitHub
篇章级事件抽取
☆22Sep 2, 2020Updated 5 years ago
zjunlp / HVPNeT
View on GitHub
[NAACL 2022 Findings] Good Visual Guidance Makes A Better Extractor: Hierarchical Visual Prefix for Multimodal Entity and Relation Extrac…
☆123Mar 13, 2025Updated last year
izuna385 / jel
View on GitHub
Japanese Entity Linker.
☆12Jul 25, 2021Updated 4 years ago
thukg / AMinerOpen
View on GitHub
An open source community who focuses on developing and publishing elegant algorithms, models and tools for science big data mining and kn…
☆11Jul 27, 2019Updated 6 years ago
yinzhangyue / EoT
View on GitHub
Exchange-of-Thought: Enhancing Large Language Model Capabilities through Cross-Model Communication
☆21Mar 21, 2024Updated 2 years ago
izuna385 / Wikia-and-Wikipedia-EL-Dataset-Creator
View on GitHub
You can create datasets from Wikia/Wikipedia that can be used for entity recognition and Entity Linking. Dumps for ja-wiki and VTuber-wik…
☆18May 2, 2021Updated 5 years ago
luosx18 / UED
View on GitHub
Code and data for "An Accurate Unsupervised Method for Joint Entity Alignment and Dangling Entity Detection".
☆15Mar 26, 2022Updated 4 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
loujie0822 / CLUEDatasetSearch
View on GitHub
搜索所有中文NLP数据集，附常用英文NLP数据集
☆14Mar 1, 2020Updated 6 years ago
DataArcTech / ChatEA
View on GitHub
☆13Oct 9, 2024Updated last year
nocoolsandwich / iamQA
View on GitHub
中文wiki百科QA阅读理解问答系统，使用了CCKS2016数据的NER模型和CMRC2018的阅读理解模型，还有W2V词向量搜索,使用torchserve部署
☆90Jun 4, 2021Updated 5 years ago
lixiang0 / WEB_KG
View on GitHub
爬取百度百科中文页面，抽取三元组信息，构建中文知识图谱
☆957Jul 20, 2020Updated 5 years ago
pris-nlp / nlp-paper-reading-list
View on GitHub
motivation: 系统整理NLP各个方向需要阅读的论文
☆34Oct 28, 2020Updated 5 years ago
mvp18 / Popular-ZSL-Algorithms
View on GitHub
Python Implementation of Zero Shot Learning Algorithms (ALE, DeViSE, ESZSL, SAE, SJE) under ZSLGBU protocol
☆64May 19, 2020Updated 6 years ago
linsu07 / RelationExtraction
View on GitHub
a high performance tensorflow version for relation extraction of named entity
☆12Oct 18, 2023Updated 2 years ago
acharkq / Training-Free-Graph-Matching
View on GitHub
Source code of "Training Free Graph Neural Networks for Graph Matching"
☆12Jul 9, 2022Updated 4 years ago
wjn1996 / ChatGLM2-Tuning
View on GitHub
基于ChatGLM2-6B进行微调，包括全参数、参数有效性、量化感知训练等，可实现指令微调、多轮对话微调等。
☆25Jul 29, 2023Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
siat-nlp / TTOS
View on GitHub
Official repository of the EMNLP'2020 paper "Amalgamating Knowledge from Two Teachers for Task-oriented Dialogue System with Adversarial …
☆16Dec 9, 2021Updated 4 years ago
tangg555 / story-generation-demo
View on GitHub
A simple story generation demo which finetines Huggingface pretrained model to generate stories.
☆14May 12, 2023Updated 3 years ago
HazyResearch / tabi
View on GitHub
Code release for Type-Aware Bi-Encoders for Open-Domain Entity Retrieval
☆19Sep 24, 2022Updated 3 years ago
nusnlp / MFA4RE
View on GitHub
Code for modeling attention network for distant supervised relation extraction (CoNLL 2019).
☆15Feb 28, 2020Updated 6 years ago
MathAutoTag / mathdata
View on GitHub
K12高中数学试题数据集
☆18Aug 16, 2023Updated 2 years ago
harrylclc / AL-CPL-dataset
View on GitHub
Dataset for active learning for concept prerequisite learning
☆21May 21, 2018Updated 8 years ago
taishan1994 / python3_wiki_word2vec
View on GitHub
基于python3训练中文wiki词向量、字向量、拼音向量
☆11Jan 2, 2022Updated 4 years ago