RockyHHH / Safety-Evaluating
本文提出了一个基于“文心一言”的中国LLMs的安全评估基准,其中包括8种典型的安全场景和6种指令攻击类型。此外,本文还提出了安全评估的框架和过程,利用手动编写和收集开源数据的测试Prompts,以及人工干预结合利用LLM强大的评估能力作为“共同评估者”。
☆22Updated last year
Alternatives and similar repositories for Safety-Evaluating:
Users that are interested in Safety-Evaluating are comparing it to the libraries listed below
- SC-Safety: 中文大模型多轮对抗安全基准☆118Updated 10 months ago
- ShieldLM: Empowering LLMs as Aligned, Customizable and Explainable Safety Detectors [EMNLP 2024 Findings]☆170Updated 3 months ago
- 基于ChatGPT构建的中文self-instruct数据集☆113Updated last year
- Flames is a highly adversarial benchmark in Chinese for LLM's harmlessness evaluation developed by Shanghai AI Lab and Fudan NLP Group.☆37Updated 8 months ago
- ☆21Updated 6 months ago
- 一套代码指令微调大模型☆37Updated last year
- [EMNLP 2023 Demo] CLEVA: Chinese Language Models EVAluation Platform☆60Updated last year
- Official github repo for SafetyBench, a comprehensive benchmark to evaluate LLMs' safety. [ACL 2024]☆182Updated 6 months ago
- 中文通用大模型开放域多轮测评基准 | An Open Domain Benchmark for Foundation Models in Chinese☆77Updated last year
- ☆23Updated last year
- Silk Road will be the dataset zoo for Luotuo(骆驼). Luotuo is an open sourced Chinese-LLM project founded by 陈启源 @ 华中师范大学 & 李鲁鲁 @ 商汤科技 & 冷子…☆38Updated last year
- 通用简单工具项目☆15Updated 3 months ago
- GoGPT中文指令数据集构造☆10Updated 11 months ago
- ☆94Updated 10 months ago
- 百川Dynamic NTK-ALiBi的代码实现:无需微调即可推理更长文本☆47Updated last year
- 用于微调LLM的中文指令数据集☆27Updated last year
- ☆137Updated 6 months ago
- 多轮共情对话模型PICA☆87Updated last year
- 基于 LoRA 和 P-Tuning v2 的 ChatGLM-6B 高效参数微调☆54Updated last year
- 本项目旨在对大量文本文件进行快速编码检测和转换以辅助mnbvc语料集项目的数据清洗工作☆56Updated 3 months ago
- make LLM easier to use☆59Updated last year
- Source code for ACL 2023 paper Decoder Tuning: Efficient Language Understanding as Decoding☆48Updated last year
- Large language Model fintuning bloom , opt , gpt, gpt2 ,llama,llama-2,cpmant and so on☆96Updated 8 months ago
- JailBench:大型语言模型越狱攻击风险评测中文数据集☆31Updated 6 months ago
- Official Repository for SIGIR2024 Demo Paper "An Integrated Data Processing Framework for Pretraining Foundation Models"☆64Updated 4 months ago
- the newest version of llama3,source code explained line by line using Chinese☆22Updated 9 months ago
- 官方transformers源码解析。AI大模型时代,pytorch、transformer是新操作系统,其他都是运行在其上面的软件。☆11Updated last year
- 大语言模型训练和服务调研☆35Updated last year
- 中文原生检索增强生成测评基准