liuchengyuan123/CPAD

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/liuchengyuan123/CPAD)

liuchengyuan123 / CPAD

The official dataset of paper "Goal-Oriented Prompt Attack and Safety Evaluation for LLMs".

☆22

Alternatives and similar repositories for CPAD

Users that are interested in CPAD are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

chenzijie156 / DeepSeek_R1_Medical_Cot
View on GitHub
☆13Mar 8, 2025Updated last year
limenlp / safer-instruct
View on GitHub
This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"
☆17Feb 22, 2024Updated 2 years ago
vinid / safety-tuned-llamas
View on GitHub
ICLR2024 Paper. Showing properties of safety tuning and exaggerated safety.
☆95May 9, 2024Updated 2 years ago
xu1998hz / SEScore2
View on GitHub
☆17Mar 3, 2025Updated last year
bryant03 / Sina-Weibo-Dataset
View on GitHub
Dataset of the paper:“Latent suicide risk detection on microblog via suicide-oriented word embeddings and layered attention
☆12Oct 22, 2024Updated last year
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
NijiaLu / BullyDataset
View on GitHub
a new Chinese Weibo comments dataset collected from Sina Weibo comment specifically for cyberbullying detection
☆16Aug 29, 2019Updated 6 years ago
SchwinnL / LLM_Embedding_Attack
View on GitHub
Code to conduct an embedding attack on LLMs
☆33Jan 10, 2025Updated last year
chaojun-wang / Exposure-Bias-Hallucination-Domain-Shift
View on GitHub
Introduction and scripts for ACL-2020 paper "On Exposure Bias, Hallucination and Domain Shift in Neural Machine Translation"
☆21Jun 23, 2020Updated 6 years ago
timvieira / vocrf
View on GitHub
Variable-order CRFs with structure learning
☆17Aug 1, 2024Updated last year
NeuralSentinel / SafeInfer
View on GitHub
☆23Jan 14, 2025Updated last year
jiangjiechen / EDUCAT
View on GitHub
Resources for our AAAI 2022 paper: "Unsupervised Editing for Counterfactual Stories".
☆13Oct 25, 2022Updated 3 years ago
morning-hao / domain-self-instruct
View on GitHub
受到self-instruct启发,除了通用LLM还能做垂直领域的小LLM实现定制效果，通过GPT获得question和answer来作为训练数据
☆18May 12, 2023Updated 3 years ago
JohnZhengHub / DRM
View on GitHub
用深度神经网络识别语篇关系的模型，主要结合了ＴreeLSTM和NTN两种神经网络，用TreeLSTM来获得句子向量，NTN来识别两个句子向量之间的关系.
☆14Mar 25, 2016Updated 10 years ago
bplank / sp
View on GitHub
Simple Structured Perceptron tagger in Python
☆10May 30, 2017Updated 9 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
abccaba2000 / discourse-parser
View on GitHub
☆12Aug 16, 2018Updated 7 years ago
karlhigley / ranking-metrics-torch
View on GitHub
Simple ranking metrics for PyTorch on CPU or GPU
☆15Nov 20, 2020Updated 5 years ago
declare-lab / red-instruct
View on GitHub
Codes and datasets of the paper Red-Teaming Large Language Models using Chain of Utterances for Safety-Alignment
☆111Mar 8, 2024Updated 2 years ago
MahdeenSky / SoftVC-VITS-MusicSingerChanger
View on GitHub
Google collab for testing SoftVC VITS Singing Voice Conversion for AI capable of changing the singer within music files.
☆13Apr 21, 2023Updated 3 years ago
liuquncn / liuquncn.github.io
View on GitHub
Personal website
☆15Jun 18, 2026Updated last month
deep-diver / hllama
View on GitHub
hllama is a library which aims to provide a set of utility tools for large language models.
☆10Apr 16, 2024Updated 2 years ago
grasses / PoisonPrompt
View on GitHub
Code for paper: PoisonPrompt: Backdoor Attack on Prompt-based Large Language Models, IEEE ICASSP 2024. Demo//124.220.228.133:11107
☆21Aug 10, 2024Updated last year
pillowsofwind / Course-Correction
View on GitHub
[EMNLP 2024] The official GitHub repo for the paper "Course-Correction: Safety Alignment Using Synthetic Preferences"
☆20Oct 2, 2024Updated last year
RunpeiDong / DGMS
View on GitHub
[ICML 2022 Spotlight] Finding the Task-Optimal Low-Bit Sub-Distribution in Deep Neural Networks
☆11May 21, 2023Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
YisenWang / ONL
View on GitHub
Code for CVPR2018 "Iterative Learning with Open-set Noisy Labels"
☆12Mar 12, 2021Updated 5 years ago
tml-epfl / llm-past-tense
View on GitHub
Does Refusal Training in LLMs Generalize to the Past Tense? [ICLR 2025]
☆78Jan 23, 2025Updated last year
PKU-ML / Message-Passing-Contrastive-Learning
View on GitHub
Official Code for ICLR 2023 Paper: A Message Passing Perspective on Learning Dynamics of Contrastive Learning
☆11Mar 9, 2023Updated 3 years ago
Blue-Raincoat / SelectIT
View on GitHub
☆24Oct 14, 2024Updated last year
L-Maybe / SemEval-2019-task3-EmoContext
View on GitHub
dawei.li SemEval-2019 task3 EmoContext: Multi-Step Ensemble Neural Network for Sentiment Analysis in Textual Conversation
☆16Jun 6, 2019Updated 7 years ago
mullerpeter / authorstyle
View on GitHub
Python package to deal with PAN corpora and extract stylometric features from text documents.
☆15Nov 11, 2022Updated 3 years ago
Babelscape / ALERT
View on GitHub
Official repository for the paper "ALERT: A Comprehensive Benchmark for Assessing Large Language Models’ Safety through Red Teaming"
☆60Sep 20, 2024Updated last year
cooperleong00 / ToxificationReversal
View on GitHub
Code for the paper "Self-Detoxifying Language Models via Toxification Reversal" (EMNLP 2023)
☆18Oct 17, 2023Updated 2 years ago
Aatrox103 / SAP
View on GitHub
☆49May 9, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
epfl-dlab / llm-latent-language
View on GitHub
Repo accompanying our paper "Do Llamas Work in English? On the Latent Language of Multilingual Transformers".
☆87Mar 11, 2024Updated 2 years ago
Manu21JC / DataElixir
View on GitHub
[AAAI 2024] DataElixir: Purifying Poisoned Dataset to Mitigate Backdoor Attacks via Diffusion Models
☆12Dec 5, 2024Updated last year
RockyHHH / Safety-Evaluating
View on GitHub
本文提出了一个基于“文心一言”的中国LLMs的安全评估基准，其中包括8种典型的安全场景和6种指令攻击类型。此外，本文还提出了安全评估的框架和过程，利用手动编写和收集开源数据的测试Prompts，以及人工干预结合利用LLM强大的评估能力作为“共同评估者”。
☆35Sep 1, 2023Updated 2 years ago
tan92hl / Complex-Question-Answering-Evaluation-of-GPT-family
View on GitHub
A large-scale complex question answering evaluation of ChatGPT and similar large-language models
☆39Apr 23, 2024Updated 2 years ago
affjljoo3581 / polyglot-jax-inference
View on GitHub
TPU에서 한국어용 LLM 추론을 위한 Jax/Flax 구현체입니다.
☆12Jun 12, 2023Updated 3 years ago
TIGER-AI-Lab / TIGERScore
View on GitHub
"TIGERScore: Towards Building Explainable Metric for All Text Generation Tasks" [TMLR 2024]
☆32Dec 21, 2024Updated last year
Kagami / docker_cve-2015-2925
View on GitHub
Docker + CVE-2015-2925 = escaping from --volume
☆11Jun 30, 2015Updated 11 years ago