wanicca/WikiHowQAExtractor-mnbvc

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/wanicca/WikiHowQAExtractor-mnbvc)

wanicca / WikiHowQAExtractor-mnbvc

Extract Chinese/English QA Data from WikiHow pages.

☆17

Alternatives and similar repositories for WikiHowQAExtractor-mnbvc

Users that are interested in WikiHowQAExtractor-mnbvc are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

mnbvc-parallel-corpus-team / parallel_corpus_mnbvc
View on GitHub
parallel corpus dataset from the mnbvc project
☆17Feb 11, 2026Updated 5 months ago
pany8125 / ShareGPTQAExtractor-mnbvc
View on GitHub
MNBVC项目-ShareGPT语料清洗
☆16Oct 4, 2023Updated 2 years ago
hhu-adam / lean4monaco
View on GitHub
Browser support for Lean using a monaco editor.
☆16Jul 3, 2026Updated 2 weeks ago
zejunwang1 / gpt2ppl-zh
View on GitHub
基于中文 GPT2 预训练模型的语句困惑度计算
☆15Apr 20, 2023Updated 3 years ago
Beomi / transformers-language-modeling
View on GitHub
Train 🤗transformers with DeepSpeed: ZeRO-2, ZeRO-3
☆23May 20, 2021Updated 5 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
renxingkai / MRC_Leaderboard
View on GitHub
Machine Reading Comprehension Leadboard Summary
☆12Jan 4, 2021Updated 5 years ago
MetabrainAGI / Awaker2.5-R1
View on GitHub
☆12Mar 22, 2025Updated last year
HabanaAI / Megatron-DeepSpeed
View on GitHub
Intel Gaudi's Megatron DeepSpeed Large Language Models for training
☆18Dec 19, 2024Updated last year
Lurunchik / NF-CATS
View on GitHub
☆17Jul 18, 2022Updated 4 years ago
FudanNLP / ElasticBERT
View on GitHub
A pre-trained model with multi-exit transformer architecture.
☆56Dec 10, 2022Updated 3 years ago
lean-dojo / lean4code
View on GitHub
Lean4 Code Editor
☆17Jul 14, 2026Updated last week
wzh9969 / HPT
View on GitHub
This repository implements a prompt tuning model for hierarchical text classification. This work has been accepted as the long paper "HPT…
☆66Oct 7, 2023Updated 2 years ago
threelittlemonkeys / pointer-networks-pytorch
View on GitHub
Pointer Networks in PyTorch
☆16Nov 7, 2023Updated 2 years ago
amazon-science / pizza-semantic-parsing-dataset
View on GitHub
The PIZZA dataset continues the exploration of task-oriented parsing by introducing a new dataset for parsing pizza and drink orders, who…
☆20Dec 7, 2022Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
sherlcok314159 / ChineseMRC-Data
View on GitHub
收集了目前为止中文领域的MRC抽取式数据集
☆124Jun 20, 2024Updated 2 years ago
albertwy / GPT-4V-Evaluation
View on GitHub
Data for evaluating GPT-4V
☆11Oct 26, 2023Updated 2 years ago
loujie0822 / CLUEDatasetSearch
View on GitHub
搜索所有中文NLP数据集，附常用英文NLP数据集
☆14Mar 1, 2020Updated 6 years ago
princeton-nlp / datamux-pretraining
View on GitHub
MUX-PLMs: Pretraining LMs with Data Multiplexing
☆15Jan 29, 2023Updated 3 years ago
chenpk00 / IS2024_stream_decoder_only_asr
View on GitHub
☆16Mar 12, 2024Updated 2 years ago
lvwerra / deep-math
View on GitHub
Implementation of "Analysing Mathematical Reasoning Abilities of Neural Models"
☆30Mar 25, 2023Updated 3 years ago
lesismal / llib
View on GitHub
☆13Jul 14, 2026Updated last week
yu90892 / xiaohongshu-api
View on GitHub
小红书接口、小红书api、小红书sdk，提供首页推荐、用户信息、笔记、视频、关注、粉丝、搜索、评论等
☆15Feb 5, 2021Updated 5 years ago
gyunggyung / OpenMLLM
View on GitHub
Open Source + Multilingual MLLM + Fine-tuning + Distillation + More efficient models and learning + ?
☆19Jan 31, 2025Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
DunZhang / DFPassageRetrieve
View on GitHub
☆14Jun 20, 2022Updated 4 years ago
hanningzhang / prm
View on GitHub
☆17Nov 3, 2024Updated last year
ZINZINBIN / Tokamak-Plasma-Operation-Control-based-on-RL
View on GitHub
Tokamak plasma operation control through multi-objective reinforcement learning in KSTAR
☆25Mar 7, 2025Updated last year
genggui001 / FL-Tuning
View on GitHub
FL-Tuning
☆12Jul 11, 2022Updated 4 years ago
gaukas / socks5
View on GitHub
A sub-RFC1928 SOCKS5 server implementation in Go with zero external dependencies.
☆13Sep 5, 2023Updated 2 years ago
ustc-hyin / HiMAP
View on GitHub
Code for paper: Unraveling the Shift of Visual Information Flow in MLLMs: From Phased Interaction to Efficient Inference
☆14Jun 7, 2025Updated last year
CrazyBoyM / llama2-Chinese-chat
View on GitHub
首个llama2 13b 中文版模型（Base + 中文对话SFT，实现流畅多轮人机自然语言交互)
☆92Aug 21, 2023Updated 2 years ago
adxcreative / COPE
View on GitHub
☆15Dec 20, 2024Updated last year
alessaww / emotioNet_URLs_Download
View on GitHub
下载emotioNet_URLs的Python脚本，实现异步并行下载。
☆10Dec 22, 2017Updated 8 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
zhengyima / knowqa
View on GitHub
预训练模型知识量度量竞赛 Baseline F1 0.35 BERTForMaskedLM
☆13Sep 2, 2021Updated 4 years ago
Jiahao004 / DeepTheorem
View on GitHub
☆26Jun 10, 2025Updated last year
chris1111 / Disk-Speed-Test
View on GitHub
☆15Dec 25, 2020Updated 5 years ago
dqxiu / PLMs-with-Knowledge
View on GitHub
☆16Apr 11, 2022Updated 4 years ago
HKUNLP / SymGen
View on GitHub
[EMNLP'23] Code for Generating Data for Symbolic Language with Large Language Models
☆18Oct 21, 2023Updated 2 years ago
HKUNLP / ZeroGen
View on GitHub
[EMNLP 2022] Code for our paper “ZeroGen: Efficient Zero-shot Learning via Dataset Generation”.
☆16Feb 18, 2022Updated 4 years ago
NiShuang / mobile_info_crawler
View on GitHub
ZOL中关村在线手机参数爬虫
☆12Mar 13, 2017Updated 9 years ago