如何將維基百科中文資料,簡轉繁並萃取文字內容整理成JSON檔案
☆19Aug 5, 2021Updated 4 years ago
Alternatives and similar repositories for Wiki_Extractor
Users that are interested in Wiki_Extractor are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ⚙️Tool for NLP - handle file and text☆15Feb 16, 2025Updated last year
- 🤖📇 handling multiple nlp task in one pipeline☆57Sep 18, 2025Updated 6 months ago
- 🍳 NLPrep - dataset tool for many natural language processing task☆28Jul 30, 2021Updated 4 years ago
- A list of awesome machine question answering dataset - 機器問答數據集☆15Dec 24, 2019Updated 6 years ago
- Phraseg - 一言:新詞發現工具包☆26Nov 30, 2021Updated 4 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Code and data for SCED sentence cloze dataset☆12Dec 8, 2022Updated 3 years ago
- This data release is meant to accompany and document the paper: https://arxiv.org/abs/2004.11997 Collecting Entailment Data for Pretrain…☆14Sep 29, 2020Updated 5 years ago
- This repository contains the code for the TextGraphs-15 paper "Modeling Graph Structure via Relative Position for Text Generation from Kn…☆13Aug 10, 2021Updated 4 years ago
- ASR text preprocessing utility☆21Aug 5, 2024Updated last year
- PyTorch Dataset for Kafka.☆17Aug 12, 2021Updated 4 years ago
- 本篇教學提供讀者了解 SIP 的基本知識與建置 SIP Server☆16Apr 11, 2016Updated 9 years ago
- Port of Kui-Namaplates to Vanilla☆16Jun 15, 2025Updated 9 months ago
- A TensorFlow implementation of "QANet: Combining Local Convolution with Global Self-Attention for Reading Comprehension"☆31Jun 2, 2018Updated 7 years ago
- This repository houses the IMPlicature and PRESupposition diagnostic dataset (IMPPRES), consisting of >25k semiautomatically generated se…☆19Sep 15, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Dynamic Connected Networks for Chinese Spelling Check☆50Apr 2, 2024Updated 2 years ago
- Go语言程序命令行调试工具☆69Jul 24, 2013Updated 12 years ago
- Implement Question Generator with SOTA pre-trained Language Models (RoBERTa, BERT, GPT, BART, T5, etc.)☆50Sep 20, 2022Updated 3 years ago
- Gradle Please Workflow for Alfred 2☆50Dec 22, 2017Updated 8 years ago
- Official Implementation for NYCU_TWD LT-EDI@ACL 2022☆20Feb 27, 2023Updated 3 years ago
- MXNet implementation of WaveNet☆19Oct 20, 2016Updated 9 years ago
- A 30000+ Chinese MRC dataset - Delta Reading Comprehension Dataset☆312Apr 21, 2020Updated 5 years ago
- We are creating a challenging new benchmark MultiReQA: A Cross-Domain Evaluation for Retrieval Question Answering Models. Retrieval quest…☆31Jul 9, 2020Updated 5 years ago
- Technical Analysis on Cryptocurrency☆25Oct 14, 2025Updated 5 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- CCL2019,“小牛杯”中文幽默计算任务的数据集及baseline☆24Aug 27, 2024Updated last year
- 公開的情緒訓練資料☆58Mar 7, 2023Updated 3 years ago
- Applications for Kubernetes☆12Mar 28, 2020Updated 6 years ago
- An improved directory and employee search tool☆10Feb 11, 2025Updated last year
- NILE : Natural Language Inference with Faithful Natural Language Explanations☆29Jun 12, 2023Updated 2 years ago
- 关系抽取实验☆32May 29, 2016Updated 9 years ago
- This is an example of creating an AI agent with flowchart☆12Jul 22, 2024Updated last year
- 🔬Experimental Minio (S3) Gateway for iRODS 💾☆12Aug 13, 2019Updated 6 years ago
- [EMNLP 2020] Collective HumAn OpinionS on Natural Language Inference Data☆40Apr 7, 2022Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- A utility to run health checks for gRPC services☆14May 20, 2024Updated last year
- Code for Findings of EMNLP 2022 short paper "CDGP: Automatic Cloze Distractor Generation based on Pre-trained Language Model".☆14May 22, 2023Updated 2 years ago
- Matlab implementation of TCK☆12Jul 5, 2019Updated 6 years ago
- ☆14Jan 11, 2023Updated 3 years ago
- Blank Language Models☆45Dec 31, 2020Updated 5 years ago
- UPC Deep Learning for Speech and Language 2018☆17Feb 26, 2018Updated 8 years ago
- ☆12Aug 12, 2022Updated 3 years ago