zhaoshiyu/WikiExtractor

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/zhaoshiyu/WikiExtractor)

zhaoshiyu / WikiExtractor

维基百科离线语料获取

☆28

Alternatives and similar repositories for WikiExtractor

Users that are interested in WikiExtractor are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Dielianss / Chinese-BERT-KPE
View on GitHub
☆10Apr 6, 2022Updated 4 years ago
TJYSunset / Strokes.txt
View on GitHub
汉字组件笔画数据
☆15Aug 14, 2018Updated 7 years ago
lxw0109 / CJOSpider
View on GitHub
A Spider(with and w/o Scrapy) for crawling data from China Judgements Online(中国裁判文书网).
☆21Jun 21, 2018Updated 8 years ago
SivanLaai / BaiduPinyinCrawler
View on GitHub
百度汉语字典爬虫，拼音数据，35万海量百度词典数据。
☆29Sep 5, 2022Updated 3 years ago
UnderTides / v2ray
View on GitHub
a free clash subscribe url
☆11Apr 22, 2022Updated 4 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
flink-china / articles
View on GitHub
Flink 中文社区文章整理
☆13Jun 3, 2020Updated 6 years ago
pfllo / wikidata_neo4j_importer
View on GitHub
import wikidata to neo4j
☆27Jan 24, 2016Updated 10 years ago
bbende / hdf-trucking-app
View on GitHub
Example application demonstrating how to integrate all of the components of Hortonworks DataFlow.
☆14Jul 10, 2017Updated 9 years ago
YandZD / BaiduBookSuorce
View on GitHub
百度书源，可以配合饭团小说
☆12Nov 28, 2017Updated 8 years ago
NEUIR / LISRec
View on GitHub
[KDD '26] This is the code repo for our KDD '26 paper "LISRec: Modeling User Preferences with Learned Item Shortcuts for Sequential Recom…
☆18Updated this week
spandanagella / multisense
View on GitHub
☆11Dec 31, 2020Updated 5 years ago
Soappyooo / pointnet_cuda_eval
View on GitHub
UCAS国科大2024课程《GPU架构与编程》大作业1，编写pointnet的cuda推理程序。
☆20Dec 1, 2024Updated last year
ArmanHu / ErGou-WechatBot
View on GitHub
一个具有成语接龙，卜卦等功能的微信机器人----二狗
☆10Sep 7, 2022Updated 3 years ago
IrisRainbowNeko / synthesis_watermelon
View on GitHub
基于box2d物理引擎的安卓版合成大西瓜
☆16Feb 2, 2021Updated 5 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
cnuc-top / subway-svg-tools
View on GitHub
地铁线路图 SVG 解析工具
☆11May 31, 2018Updated 8 years ago
hucaixue / Oral-English-8000-sentences-in-simple
View on GitHub
必背：英语口语8000句
☆14Jul 21, 2022Updated 4 years ago
nemoNoboru / annas-archive-bot
View on GitHub
telegram bot for quickly downloading from anna's archive
☆12Dec 5, 2022Updated 3 years ago
shengtaovvv / Dialogue
View on GitHub
本项目由三个模块构成。意图识别：判断用户的意图是业务型还是闲聊型；模型检索：该部分构建一个语料库，当用户发起新的query（通过意图识别判断为业务型对话）时，为用户匹配query检索的最佳response，使用HSWN进行召回（粗排），然后构建句子的相似度，并利用Lig…
☆12Feb 18, 2021Updated 5 years ago
cd74 / Meteorological_warning
View on GitHub
Use machine learning model for intense rainfall prediction. 基于深度学习的天气预报系统研究应用
☆13Nov 7, 2020Updated 5 years ago
SecurityEnthusiast / MDX-Convertor-CSV
View on GitHub
A tiny script to convert your mdx dictionary file to CSV
☆11Dec 22, 2018Updated 7 years ago
lucasjinreal / gofind
View on GitHub
gofind - your personal find helper
☆11Apr 5, 2018Updated 8 years ago
smallmarker / TextRecognition
View on GitHub
结合 MLKit 和 PreviewView，构建一个文本识别应用程序 Demo。
☆12May 25, 2023Updated 3 years ago
parlab-tuwien / lockfree-linked-list
View on GitHub
A more Pragmatic Implementation of the Lock-free, Ordered, Linked List
☆19Dec 20, 2020Updated 5 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
backendbuilderdev / Django-English-Dictionary
View on GitHub
An English Dictionary for finding meaning , synonyms, antonyms of a word
☆11Jun 17, 2022Updated 4 years ago
iplaycodex / jquery-lucky-roller
View on GitHub
🏅幸运大转盘,很早之前的项目...orz...
☆11Dec 21, 2019Updated 6 years ago
Tuo-ZHANG / my-Android-dictionary-application
View on GitHub
An Android dictionary application with support for mdx format.
☆11Jan 7, 2023Updated 3 years ago
goldengrape / explanation_words_in_ebooks
View on GitHub
Automatically add explanations of unfamiliar words in ebooks
☆15Feb 9, 2023Updated 3 years ago
tedljw / rasa_test_ch
View on GitHub
基于rasa的多轮问答学习和测试
☆13May 12, 2019Updated 7 years ago
whmnoe4j / work12
View on GitHub
早期的计算机使用7位的ASCII编码，为了处理汉字，程序员设计了用于简体中文的GB2312和用于繁体中文的big5。 GB2312(1980年)一共收录了7445个字符，包括6763个汉字和682个其它符号。汉字区的内码范围高字节从B0-F7，低字节从A1-FE，占用的码…
☆10Sep 10, 2017Updated 8 years ago
gaoyangclub / gy-subway-sdk
View on GitHub
仿高德地铁线路图SDK
☆12Jun 15, 2021Updated 5 years ago
SimplGy / obsidian-open-file-by-magic-date
View on GitHub
☆11Mar 19, 2023Updated 3 years ago
liaojianqiang / word_frequency
View on GitHub
四级、六级、考研、雅思考试词频统计程序
☆10Dec 22, 2024Updated last year
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
sotch-pr35mac / chinese_dictionary
View on GitHub
A searchable Chinese / English dictionary with helpful utilities.
☆12Feb 24, 2024Updated 2 years ago
Jijun / ik-analyzer
View on GitHub
基于IK中文分词器,添加同义词功能
☆13Feb 24, 2018Updated 8 years ago
shiwusong / keras_lstm_generation
View on GitHub
利用keras模仿汪峰生成歌词
☆16Jan 28, 2018Updated 8 years ago
Go-zh / blog
View on GitHub
Go 官方博客翻译
☆11Mar 16, 2019Updated 7 years ago
tianyong90 / articles
View on GitHub
日常写作汇总
☆22Mar 19, 2019Updated 7 years ago
xiayuanquan / ReadPinYinEssayDemo
View on GitHub
给朗读课文添加拼音
☆14Apr 9, 2018Updated 8 years ago
se7ven012 / ConvLSTM
View on GitHub
Typhoon Prediction
☆13Jan 10, 2020Updated 6 years ago