Extract Chinese/English QA Data from WikiHow pages.
☆16May 21, 2023Updated 2 years ago
Alternatives and similar repositories for WikiHowQAExtractor-mnbvc
Users that are interested in WikiHowQAExtractor-mnbvc are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- MNBVC项目-ShareGPT语料清洗☆16Oct 4, 2023Updated 2 years ago
- 基于中文 GPT2 预训练模型的语句困惑度计算☆15Apr 20, 2023Updated 3 years ago
- Intel Gaudi's Megatron DeepSpeed Large Language Models for training☆18Dec 19, 2024Updated last year
- ☆17Jul 18, 2022Updated 3 years ago
- A Python toolkit for file processing, text cleaning and data splitting. 文件处理,文本清洗和数据划分的python工具包。☆36Oct 18, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆22Jul 15, 2024Updated last year
- API to extract data from wikiHow☆18Jul 10, 2021Updated 4 years ago
- ☆25Jun 10, 2025Updated 10 months ago
- 用于生成文本纠错模型(如Gector)需要的大量数据。☆14Jan 5, 2023Updated 3 years ago
- ChatGPT-JueJin是一款基于ChatGPT的web应用聊天软件,本项目为ChatGPT-JueJin应用的Java后端☆10Jun 20, 2023Updated 2 years ago
- 一个为RAG系统设计的Markdown文档工具,提供标题结构自动抽取和文档分割两大功能。完整保留文档层级结构,解决传统切分器丢失标题层级与破坏表格完整性的问题。A hierarchy-preserving Markdown document splitter for RAG…☆13Jan 2, 2025Updated last year
- Pointer Networks in PyTorch☆16Nov 7, 2023Updated 2 years ago
- 搜索所有中文NLP数据集,附常用英文NLP数据集☆14Mar 1, 2020Updated 6 years ago
- The PIZZA dataset continues the exploration of task-oriented parsing by introducing a new dataset for parsing pizza and drink orders, who…☆20Dec 7, 2022Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- 基于BERT和MRC框架实现的嵌套命名实体识别☆19Mar 13, 2022Updated 4 years ago
- A fast, local, and secure approach for training LLMs for coding tasks using GRPO with WebAssembly and interpreter feedback.☆42Apr 4, 2025Updated last year
- ☆17Nov 3, 2024Updated last year
- ☆29Sep 29, 2024Updated last year
- RACE is a multi-dimensional benchmark for code generation that focuses on Readability, mAintainability, Correctness, and Efficiency.☆14Oct 12, 2024Updated last year
- 下载emotioNet_URLs的Python脚本,实现异步并行下载。☆10Dec 22, 2017Updated 8 years ago
- 首个llama2 13b 中文版模型 (Base + 中文对话SFT,实现流畅多轮人机自然语言交互)☆91Aug 21, 2023Updated 2 years ago
- [ICCV 2025] A Benchmark for Multi-Step Reasoning in Long Narrative Videos☆27Aug 8, 2025Updated 8 months ago
- Repo for ACL2023 paper "Won't Get Fooled Again: Answering Questions with False Premises"☆22Jun 11, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Evaluating Visual Fidelity of Image Descriptions☆11Aug 15, 2019Updated 6 years ago
- 基于openai框架编写的辅助元提示词生成器☆19Jul 19, 2024Updated last year
- this repo is mnbvc text quality classification using fastText☆16Oct 2, 2023Updated 2 years ago
- CCL2022 领域问答库构建测评☆20Oct 31, 2022Updated 3 years ago
- Rough codebase for exploring initialization strategies for new word embeddings in pretrained LMs☆19Dec 10, 2021Updated 4 years ago
- 支持ChatGLM2 lora微调☆41Jul 11, 2023Updated 2 years ago
- A general purpose leaderboard for small ML competitions using Streamlit☆19Oct 29, 2020Updated 5 years ago
- Unsupervised specificity-guided optimization of Image Captioning models to encourage meaningful diversity in the generated captions. Code…☆13May 25, 2025Updated 11 months ago
- ☆12Mar 12, 2022Updated 4 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Download Bilibili video in golang☆14Jul 29, 2024Updated last year
- ☆18Mar 3, 2023Updated 3 years ago
- Code of training and implementing scene attribute classifiers. Project page: http://cs.brown.edu/~gen/sunattributes.html☆21Oct 12, 2018Updated 7 years ago
- Let Me Speak Freely? A Study on the Impact of Format Restrictions on Performance of Large Language Models☆26May 31, 2025Updated 11 months ago
- A dataset for multimodal machine translation☆13Dec 6, 2021Updated 4 years ago
- 文本去重☆78May 23, 2024Updated last year
- The official repository for the paper Multilingual Mathematical Autoformalization☆38May 20, 2024Updated last year