The official dataset of paper "Goal-Oriented Prompt Attack and Safety Evaluation for LLMs".
☆21Feb 5, 2024Updated 2 years ago
Alternatives and similar repositories for CPAD
Users that are interested in CPAD are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"☆17Feb 22, 2024Updated 2 years ago
- Dataset of the paper:“Latent suicide risk detection on microblog via suicide-oriented word embeddings and layered attention☆12Oct 22, 2024Updated last year
- ICLR2024 Paper. Showing properties of safety tuning and exaggerated safety.☆93May 9, 2024Updated 2 years ago
- ☆17Mar 3, 2025Updated last year
- 中文金融大模型测评基准,六大类二十五任务、等级化评价,国内模型获得A级☆10May 6, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Code to conduct an embedding attack on LLMs☆32Jan 10, 2025Updated last year
- ☆23Oct 14, 2024Updated last year
- Introduction and scripts for ACL-2020 paper "On Exposure Bias, Hallucination and Domain Shift in Neural Machine Translation"☆21Jun 23, 2020Updated 5 years ago
- Variable-order CRFs with structure learning☆17Aug 1, 2024Updated last year
- ☆22Jan 14, 2025Updated last year
- Resources for our AAAI 2022 paper: "Unsupervised Editing for Counterfactual Stories".☆12Oct 25, 2022Updated 3 years ago
- ☆11Mar 15, 2017Updated 9 years ago
- Finding Camouflaged Needle in a Haystack? Pornographic Products Detection via Berrypicking Tree Model☆10Jul 29, 2019Updated 6 years ago
- Official repository for U-SAM (Interspeech 2025)☆27Jun 3, 2025Updated 11 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- 受到self-instruct启发,除了通用LLM还能做垂直领域的小LLM实现定制效果,通过GPT获得question和answer来作为训练数据☆18May 12, 2023Updated 3 years ago
- 用深度神经网络识别语篇关系的模型,主要结合了TreeLSTM和NTN两种神经网络,用TreeLSTM来获得句子向量,NTN来识别两个句子向量之间的关系.☆14Mar 25, 2016Updated 10 years ago
- BeaverTails is a collection of datasets designed to facilitate research on safety alignment in large language models (LLMs).☆181Oct 27, 2023Updated 2 years ago
- Simple Structured Perceptron tagger in Python☆10May 30, 2017Updated 8 years ago
- ☆23Jan 29, 2026Updated 4 months ago
- ☆12Aug 16, 2018Updated 7 years ago
- Codes and datasets of the paper Red-Teaming Large Language Models using Chain of Utterances for Safety-Alignment☆111Mar 8, 2024Updated 2 years ago
- [AAAI 2025] Enhance Vision-Language Alignment with Noise☆26Dec 19, 2024Updated last year
- hllama is a library which aims to provide a set of utility tools for large language models.☆10Apr 16, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Core Engine of Singing Voice Conversion & Singing Voice Clone☆17Jul 15, 2023Updated 2 years ago
- [ICML 2022 Spotlight] Finding the Task-Optimal Low-Bit Sub-Distribution in Deep Neural Networks☆11May 21, 2023Updated 3 years ago
- Personal website☆15Apr 17, 2026Updated last month
- Official Code for ICLR 2023 Paper: A Message Passing Perspective on Learning Dynamics of Contrastive Learning☆11Mar 9, 2023Updated 3 years ago
- Code for ISSTA'21 paper 'Attack as Defense: Characterizing Adversarial Examples using Robustness'.☆12Sep 4, 2021Updated 4 years ago
- Code for paper: PoisonPrompt: Backdoor Attack on Prompt-based Large Language Models, IEEE ICASSP 2024. Demo//124.220.228.133:11107☆21Aug 10, 2024Updated last year
- Does Refusal Training in LLMs Generalize to the Past Tense? [ICLR 2025]☆79Jan 23, 2025Updated last year
- [EMNLP 2024] The official GitHub repo for the paper "Course-Correction: Safety Alignment Using Synthetic Preferences"☆20Oct 2, 2024Updated last year
- [AAAI 2024] DataElixir: Purifying Poisoned Dataset to Mitigate Backdoor Attacks via Diffusion Models☆12Dec 5, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Code for the paper "Self-Detoxifying Language Models via Toxification Reversal" (EMNLP 2023)☆18Oct 17, 2023Updated 2 years ago
- The official github repo for MixEval-X, the first any-to-any, real-world benchmark.☆17Feb 15, 2025Updated last year
- ☆48May 9, 2024Updated 2 years ago
- [ICML 2024] Fool Your (Vision and) Language Model With Embarrassingly Simple Permutations☆15Oct 28, 2023Updated 2 years ago
- Python package to deal with PAN corpora and extract stylometric features from text documents.☆15Nov 11, 2022Updated 3 years ago
- Real-time end-to-end singing voice convertion☆25Nov 3, 2024Updated last year
- Repo accompanying our paper "Do Llamas Work in English? On the Latent Language of Multilingual Transformers".☆83Mar 11, 2024Updated 2 years ago