UnstoppableCurry/High-quality-Chinese-Q-A-dataset

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/UnstoppableCurry/High-quality-Chinese-Q-A-dataset)

UnstoppableCurry / High-quality-Chinese-Q-A-dataset

最大开源中文问答数据集 ,助力中文LLM.The largest open-source Chinese Q&A dataset, supporting Chinese LLM

☆10

Alternatives and similar repositories for High-quality-Chinese-Q-A-dataset

Users that are interested in High-quality-Chinese-Q-A-dataset are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

XinyanLi2016 / ND-NER
View on GitHub
This is a named entity recognition (NER) dataset for OSINT towards the national defense domain.
☆10Apr 21, 2023Updated 3 years ago
menghuanlater / Tianchi2020ChineseMedicineNER
View on GitHub
2020阿里云天池大数据竞赛-中医药命名实体识别挑战赛
☆27Nov 7, 2020Updated 5 years ago
nmd2k / face-mask-detector
View on GitHub
A Face Mask detection system based You Only Look Once (YOLO) architecture deploy in-browser with Serverless Edge Computing for COVID-19
☆11Jun 1, 2021Updated 4 years ago
ashishkssingh / Anomaly-Detection-SH-ESD
View on GitHub
Anomaly Detection using SH-ESD
☆10Feb 6, 2019Updated 7 years ago
protectai / nbdefense-jupyter
View on GitHub
☆13Oct 1, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
ufal / MLASK
View on GitHub
EACL 2023 paper "MLASK: Multimodal Summarization of Video-based News Articles"
☆12Nov 7, 2023Updated 2 years ago
asd5510 / fastText-chinese-word2vec-optimization
View on GitHub
fastText中文词向量训练调优，加权融合字向量和词向量，解决过度表征字面量而非语义的问题
☆11Aug 3, 2020Updated 5 years ago
Moyouket / smartpillbox
View on GitHub
一个基于微信小程序和STM32的智能药盒管理系统，帮助用户管理用药计划、连接智能硬件设备，并提供监护人远程监护功能。
☆28Apr 29, 2025Updated last year
SXU-YaxinGuo / CRMU
View on GitHub
儿童故事常识推理与寓意理解评测（Commonsense Reasoning and Moral Understanding Evaluation in Children's Stories，CRMU）
☆18Oct 22, 2024Updated last year
macroxue / zigen
View on GitHub
Custom Chinese input method with fcitx on Linux
☆12Jul 16, 2020Updated 5 years ago
Hannibal046 / SelfMemory
View on GitHub
[Neurips2023] Source code for Lift Yourself Up: Retrieval-augmented Text Generation with Self Memory
☆62May 24, 2023Updated 2 years ago
MonikaVen / LLM-Prompting-RAG-Aligment-Training-Tutorial
View on GitHub
A 4-hour long tutorial session for learning to use LLMs and align them with custom data. We will also train a custom LLM.
☆17Sep 12, 2024Updated last year
tangshuang / chatglmjs
View on GitHub
ChatGLM Node.js Addon
☆11Mar 4, 2024Updated 2 years ago
lukeewin / FunASR_API
View on GitHub
这是基于FunASR实现的区分说话人语音识别API | This is a speaker-diarization-based speech recognition API implemented using FunASR.
☆25Feb 12, 2026Updated 2 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
hackerxiaobai / bert_multi_label_text_classification
View on GitHub
pytorch bert 版的 multi_label_text_classification
☆10Dec 28, 2019Updated 6 years ago
percent4 / Keras_R_BERT
View on GitHub
本项目使用Keras实现R-BERT，在人物关系数据集上进行测试验证。
☆10Apr 17, 2021Updated 5 years ago
CerryXu / pytorch-transformer
View on GitHub
Transformer模型的PyTorch实现
☆13Dec 30, 2019Updated 6 years ago
huangjie-nlp / CasRel
View on GitHub
Chinese entity relation extraction
☆22Apr 26, 2024Updated 2 years ago
wangyifan2018 / ChatDoc-TPU
View on GitHub
适用于sophon bm1684x，基于 Langchain 与 ChatGLM 等语言模型的本地知识库问答
☆14Jun 5, 2024Updated last year
moronism189 / chinese-nlp-stepbystep
View on GitHub
从jieba分词到BERT-wwm，一步步带你进入中文NLP的世界
☆15Sep 1, 2022Updated 3 years ago
szemenyeim / DynEnv
View on GitHub
Dynamic Simulation Environments for Reinforcement Learning
☆13Apr 17, 2021Updated 5 years ago
labuladong / redis-3.0-annotated
View on GitHub
带有详细注释的 Redis 3.0 代码（annotated Redis 3.0 source code）。
☆11Dec 31, 2018Updated 7 years ago
morenjiujiu / Transformer_Pytorch
View on GitHub
Transformer(attention-is-all-you-need)的pytorch实现，带run demo，可以跑通
☆10Apr 16, 2019Updated 7 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
nhthang9x / HAN-Text-Classification-Pytorch
View on GitHub
My pytorch implementation of the model described in the paper **Hierarchical Attention Networks for Document Classification** [paper](htt…
☆11Mar 22, 2019Updated 7 years ago
XudongLiu / rasa_chatbot_medical
View on GitHub
the chatbot on medical based on rasa. 基于Rasa的智能聊天机器人，支持中文，面向医疗问答领域。
☆10Apr 4, 2020Updated 6 years ago
PommesPeter / Tianchi_FT-Data_Ranker
View on GitHub
FT-Data Ranker: Fine-Tuning Data Processing Competition for LLMs (1B-Model Track & 7B-Model Track) FT-Data Ranker：大语言模型微调数据竞赛 -- 1B模型赛道比赛…
☆15Dec 6, 2023Updated 2 years ago
wjjingtian / cMQA
View on GitHub
中文医疗问答数据集
☆46May 22, 2020Updated 5 years ago
shuliu586 / AI_Chinese_DataSet_KnowledgeDAO
View on GitHub
供AI训练的中文数据集（持续更新。。。）与AI公司图谱，目前的数据集餐饮行业8000问，百度知道，Alpaca中文数据集，计算机领域数据集，Vicuna数据集，RedPajama数据集，Wikipedia中文词条数据集，网站论坛问答数据集
☆65Nov 29, 2023Updated 2 years ago
dengwentao99 / SLJA
View on GitHub
☆22May 22, 2024Updated last year
Wybxc / TalkServer
View on GitHub
基于 pytorch 实现的一个聊天机器人模型，开箱即用。
☆15Aug 15, 2021Updated 4 years ago
alex000kim / ML-Pipeline-With-DVC-SkyPilot-HuggingFace
View on GitHub
☆16Sep 9, 2023Updated 2 years ago
znMemories / ChatGLM-Dataset-Maker
View on GitHub
适用于ChatGLM微调的数据集生成器, 支持多轮对话
☆15Jul 22, 2023Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
yhao-wang / LLM-Knowledge-Boundary
View on GitHub
Implementation of "Investigating the Factual Knowledge Boundary of Large Language Models with Retrieval Augmentation"
☆21Jul 31, 2023Updated 2 years ago
OS-Copilot / FRIDAY-front
View on GitHub
☆20Feb 28, 2024Updated 2 years ago
cchen-nlp / weiboNER
View on GitHub
Chinese social media (Weibo) corpus rearrangement, taking the word as the basic unit instead of character.
☆14Aug 12, 2020Updated 5 years ago
AngusMonroe / BLSTM-CRF-NER
View on GitHub
BiLSTM+CNN+CRF NER, using pytorch
☆16May 26, 2019Updated 6 years ago
llmsresearch / rearag
View on GitHub
Implementing ReaRAG, a knowledge-guided reasoning model that enhances factual accuracy using iterative retrieval-augmented generation. Ad…
☆16Feb 2, 2026Updated 3 months ago
l294265421 / multi-turn-alpaca
View on GitHub
Multi-turn alpaca is an extension of stanford alpaca and supports multi-turn dialogue 多轮对话版alpaca
☆22May 9, 2023Updated 2 years ago
theopsall / Video-Summarization
View on GitHub
Multimodal summarization of user-generated videos from wearable cameras
☆23Jun 22, 2025Updated 10 months ago