Extract Chinese/English QA Data from WikiHow pages.
☆16May 21, 2023Updated 2 years ago
Alternatives and similar repositories for WikiHowQAExtractor-mnbvc
Users that are interested in WikiHowQAExtractor-mnbvc are comparing it to the libraries listed below
Sorting:
- parallel corpus dataset from the mnbvc project☆15Feb 11, 2026Updated 2 weeks ago
- Machine Reading Comprehension Leadboard Summary☆12Jan 4, 2021Updated 5 years ago
- ☆12Mar 22, 2025Updated 11 months ago
- RACE is a multi-dimensional benchmark for code generation that focuses on Readability, mAintainability, Correctness, and Efficiency.☆12Oct 12, 2024Updated last year
- ☆15Dec 20, 2024Updated last year
- Evaluating Visual Fidelity of Image Descriptions☆11Aug 15, 2019Updated 6 years ago
- python多进程爬虫+文件/SQL存储☆10Mar 7, 2022Updated 3 years ago
- MNBVC项目-ShareGPT语料清洗☆15Oct 4, 2023Updated 2 years ago
- Learning to Copy for Automatic Post-Editing (EMNLP 2019)☆11May 6, 2021Updated 4 years ago
- A dataset for multimodal machine translation☆13Dec 6, 2021Updated 4 years ago
- 收集了目前为止中文领域的MRC抽取式数据集☆122Jun 20, 2024Updated last year
- ☆12Mar 12, 2022Updated 3 years ago
- [ICLR'25 Spotlight] Rethinking and improving autoformalization: towards a faithful metric and a Dependency Retrieval-based approach☆27May 20, 2025Updated 9 months ago
- Unsupervised specificity-guided optimization of Image Captioning models to encourage meaningful diversity in the generated captions. Code…☆13May 25, 2025Updated 9 months ago
- ☆17Jul 18, 2022Updated 3 years ago
- 用于生成文本纠错模型(如Gector)需要的大量数据。☆14Jan 5, 2023Updated 3 years ago
- 搜索所有中文NLP数据集,附常用英文NLP数据集☆14Mar 1, 2020Updated 6 years ago
- ☆17Dec 11, 2023Updated 2 years ago
- Code of training and implementing scene attribute classifiers. Project page: http://cs.brown.edu/~gen/sunattributes.html☆21Oct 12, 2018Updated 7 years ago
- Triton Implementation of Flash Attention with Bias.☆21Apr 16, 2025Updated 10 months ago
- Intel Gaudi's Megatron DeepSpeed Large Language Models for training☆18Dec 19, 2024Updated last year
- ☆25Jun 10, 2025Updated 8 months ago
- Pointer Networks in PyTorch☆15Nov 7, 2023Updated 2 years ago
- ☆17Nov 3, 2024Updated last year
- this repo is mnbvc text quality classification using fastText☆16Oct 2, 2023Updated 2 years ago
- ☆24Jan 14, 2021Updated 5 years ago
- Russian Artificial Text Detection☆18Nov 17, 2025Updated 3 months ago
- SciKnowEval: Evaluating Multi-level Scientific Knowledge of Large Language Models☆26Jul 13, 2025Updated 7 months ago
- 基于BERT和MRC框架实现的嵌套命名实体识别☆19Mar 13, 2022Updated 3 years ago
- 文本去重☆78May 23, 2024Updated last year
- Repo for ACL2023 paper "Won't Get Fooled Again: Answering Questions with False Premises"☆22Jun 11, 2023Updated 2 years ago
- Python programming interface and graph walk inference engine for ConceptNet5 Web API☆26Aug 3, 2015Updated 10 years ago
- Code for "[COLM'25] RepoST: Scalable Repository-Level Coding Environment Construction with Sandbox Testing"☆23Mar 18, 2025Updated 11 months ago
- MachineLP的工具包☆18Aug 23, 2021Updated 4 years ago
- Visual Storytelling post-edit dataset☆18Sep 27, 2019Updated 6 years ago
- Rough codebase for exploring initialization strategies for new word embeddings in pretrained LMs☆19Dec 10, 2021Updated 4 years ago
- [CVPR 2026] Thinking with Programming Vision: Towards a Unified View for Thinking with Images☆56Jan 23, 2026Updated last month
- Code for Group-Level Emotion Recognition Using Hybrid Deep Models Based on Faces, Scenes, Skeletons and Visual Attentions☆18Nov 12, 2018Updated 7 years ago
- Official implementation for NAACL 2024 paper "HILL: Hierarchy-aware Information Lossless Contrastive Learning for Hierarchical Text Class…☆19Mar 27, 2024Updated last year