thu-coai/COLDataset

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/thu-coai/COLDataset)

thu-coai / COLDataset

The official repository of the paper: COLD: A Benchmark for Chinese Offensive Language Detection

☆351

Alternatives and similar repositories for COLDataset

Users that are interested in COLDataset are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

para-zhou / CDial-Bias
View on GitHub
☆33Aug 7, 2024Updated last year
aggiejiang / SWSR
View on GitHub
A new release of Chinese sexism dataset and lexicon
☆14May 23, 2023Updated 3 years ago
RXJ588 / CHSD
View on GitHub
仇恨言论语料库
☆29Jun 12, 2023Updated 3 years ago
DUT-lujunyu / ToxiCN
View on GitHub
The code and resource of "Facilitating Fine-grained Detection of Chinese Toxic Language: Hierarchical Taxonomy, Resources, and Benchmark"…
☆124Jun 2, 2026Updated last month
thu-coai / Safety-Prompts
View on GitHub
Chinese safety prompts for evaluating and improving the safety of LLMs. 中文安全prompts，用于评估和提升大模型的安全性。
☆1,190Feb 27, 2024Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
thu-coai / DiaSafety
View on GitHub
This repo is for the paper: On the Safety of Conversational Models: Taxonomy, Dataset, and Benchmark
☆25Aug 13, 2022Updated 3 years ago
X-PLUG / CValues
View on GitHub
面向中文大模型价值观的评估与对齐研究
☆560Jul 20, 2023Updated 3 years ago
thu-coai / OPD
View on GitHub
OPD: Chinese Open-Domain Pre-trained Dialogue Model
☆73Jun 5, 2023Updated 3 years ago
thu-coai / SafetyBench
View on GitHub
Official github repo for SafetyBench, a comprehensive benchmark to evaluate LLMs' safety. [ACL 2024]
☆296Jul 28, 2025Updated last year
wptoux / self-instruct-zh
View on GitHub
基于ChatGPT构建的中文self-instruct数据集
☆118May 16, 2023Updated 3 years ago
liuchengyuan123 / CPAD
View on GitHub
The official dataset of paper "Goal-Oriented Prompt Attack and Safety Evaluation for LLMs".
☆22Feb 5, 2024Updated 2 years ago
CASIA-LM / ChineseWebText
View on GitHub
☆186Nov 13, 2023Updated 2 years ago
microsoft / TOXIGEN
View on GitHub
This repo contains the code for generating the ToxiGen dataset, published at ACL 2022.
☆350Jun 17, 2024Updated 2 years ago
DUTIR-Emotion-Group / CCL2025-Chinese-Hate-Speech-Detection
View on GitHub
☆22Mar 1, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
whitzard-ai / jade-db
View on GitHub
"他山之石、可以攻玉"：复旦JADE团队发布的大模型测评与治理系列
☆522Updated this week
JamyDon / PLM-based-CGEC-Model-Ensemble
View on GitHub
[ACL 2023] Are Pre-trained Language Models Useful for Model Ensemble in Chinese Grammatical Error Correction?
☆10Dec 15, 2025Updated 7 months ago
Blue-Raincoat / SelectIT
View on GitHub
☆24Oct 14, 2024Updated last year
morning-hao / domain-self-instruct
View on GitHub
受到self-instruct启发,除了通用LLM还能做垂直领域的小LLM实现定制效果，通过GPT获得question和answer来作为训练数据
☆18May 12, 2023Updated 3 years ago
XiaoMi / C3KG
View on GitHub
☆64Jun 9, 2022Updated 4 years ago
thu-coai / ShieldLM
View on GitHub
ShieldLM: Empowering LLMs as Aligned, Customizable and Explainable Safety Detectors [EMNLP 2024 Findings]
☆231Sep 29, 2024Updated last year
LivingFutureLab / ChineseSafetyQA
View on GitHub
☆37Jan 7, 2025Updated last year
sww9370 / RoCBert
View on GitHub
☆20Dec 26, 2022Updated 3 years ago
STAIR-BUPT / JailBench
View on GitHub
JailBench：大型语言模型越狱攻击风险评测中文数据集 [PAKDD 2025]
☆191Mar 3, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
IronBeliever / CaR
View on GitHub
Clustering and Ranking: Diversity-preserved Instruction Selection through Expert-aligned Quality Estimation
☆91Nov 13, 2024Updated last year
thu-coai / CritiqueLLM
View on GitHub
☆147Jul 1, 2024Updated 2 years ago
thu-coai / JailbreakDefense_GoalPriority
View on GitHub
[ACL 2024] Defending Large Language Models Against Jailbreaking Attacks Through Goal Prioritization
☆29Jul 9, 2024Updated 2 years ago
InteractiveNLP-Team / RoleLLM-public
View on GitHub
RoleLLM: Benchmarking, Eliciting, and Enhancing Role-Playing Abilities of Large Language Models
☆528Oct 11, 2024Updated last year
STAIR-BUPT / SCCD
View on GitHub
SCCD:基于会话的中文网络欺凌检测数据集
☆24Mar 9, 2025Updated last year
HillZhang1999 / MuCGEC
View on GitHub
MuCGEC中文纠错数据集及文本纠错SOTA模型开源；Code & Data for our NAACL 2022 Paper "MuCGEC: a Multi-Reference Multi-Source Evaluation Dataset for Chinese Gr…
☆570Jun 9, 2023Updated 3 years ago
zjunlp / ChineseHarm-bench
View on GitHub
ChineseHarm-Bench: A Chinese Harmful Content Detection Benchmark
☆66Sep 2, 2025Updated 10 months ago
AI45Lab / Flames
View on GitHub
Flames is a highly adversarial benchmark in Chinese for LLM's harmlessness evaluation developed by Shanghai AI Lab and Fudan NLP Group.
☆68May 21, 2024Updated 2 years ago
t-davidson / hate-speech-and-offensive-language
View on GitHub
Repository for the paper "Automated Hate Speech Detection and the Problem of Offensive Language", ICWSM 2017
☆846Jun 12, 2023Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
thu-coai / LAUG
View on GitHub
Language Understanding Augmentation Toolkit for Robustness Testing
☆20Jan 22, 2023Updated 3 years ago
limenlp / safer-instruct
View on GitHub
This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"
☆17Feb 22, 2024Updated 2 years ago
bcwangavailable / C2D2-Cognitive-Distortion
View on GitHub
This is for C2D2 Dataset: A Resource for Analyzing Cognitive Distortions and Its Impact on Mental Health
☆34Nov 10, 2023Updated 2 years ago
thu-coai / CDial-GPT
View on GitHub
A Large-scale Chinese Short-Text Conversation Dataset and Chinese pre-training dialog models
☆1,957Jun 12, 2023Updated 3 years ago
PlusLabNLP / Narrative-Discourse
View on GitHub
☆16Nov 5, 2024Updated last year
yangjianxin1 / Firefly
View on GitHub
Firefly: 大模型训练工具，支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、…
☆6,649Oct 24, 2024Updated last year
esbatmop / MNBVC
View on GitHub
MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化，也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志…
☆4,244Jul 13, 2026Updated 2 weeks ago