zhu-minjun/PAlign

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/zhu-minjun/PAlign)

zhu-minjun / PAlign

Personality Alignment of Language Models

☆56

Alternatives and similar repositories for PAlign

Users that are interested in PAlign are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

zhu-minjun / SafetyLock
View on GitHub
Your finetuned model's back to its original safety standards faster than you can say "SafetyLock"!
☆11Oct 16, 2024Updated last year
WENGSYX / ControlLM
View on GitHub
ControlLM is a method to control the personality traits and behaviors of language models in real-time at inference without costly trainin…
☆21Nov 6, 2024Updated last year
NUSTM / CCAC-ABSA
View on GitHub
☆10Jul 5, 2023Updated 3 years ago
MaheepChaudhary / SAE-Ravel
View on GitHub
Providing the answer to "How to do patching on all available SAEs on GPT-2?". It is an official repository of the implementation of the p…
☆13Jan 26, 2025Updated last year
princeton-polaris-lab / Evaluating-Durable-Safeguards
View on GitHub
[ICLR 2025] On Evluating the Durability of Safegurads for Open-Weight LLMs
☆13Jun 20, 2025Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
tanganke / subspace_fusion
View on GitHub
Code for paper "Concrete Subspace Learning based Interference Elimination for Multi-task Model Fusion"
☆14Mar 28, 2024Updated 2 years ago
declare-lab / safety-arithmetic
View on GitHub
☆13Jan 14, 2025Updated last year
Li-Hyn / LLM_CatastrophicForgetting
View on GitHub
Code for LLM_Catastrophic_Forgetting via SAM.
☆11Jun 7, 2024Updated 2 years ago
jtonglet / Numerical-Hybrid-QA-Literature
View on GitHub
A list of Numerical Multimodal reasoning papers and their implementation
☆11May 13, 2024Updated 2 years ago
DYR1 / MoGU
View on GitHub
Our research proposes a novel MoGU framework that improves LLMs' safety while preserving their usability.
☆18Jan 14, 2025Updated last year
bicici / SMTData
View on GitHub
Datasets for machine translation
☆10Jul 5, 2019Updated 7 years ago
zhouhanxie / PRAG
View on GitHub
☆12May 13, 2023Updated 3 years ago
zhengyima / DHAP
View on GitHub
Source code of SIGIR2021 Paper 'One Chatbot Per Person: Creating Personalized Chatbots based on Implicit Profiles'
☆46Sep 21, 2021Updated 4 years ago
danielqingz / spiders
View on GitHub
爬虫：用于爬取百度百科中英语料、东方财富网财报、医学NER中英语料，可实现Deepl多语自动翻译
☆13Jul 7, 2021Updated 5 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
Yarkona / TOF
View on GitHub
Code for Paper "Target-oriented Fine-tuning for Zero-Resource Named Entity Recognition"
☆20Sep 28, 2022Updated 3 years ago
SophieZheng998 / ALI-Agent
View on GitHub
Official implementation for "ALI-Agent: Assessing LLMs'Alignment with Human Values via Agent-based Evaluation"
☆21Jan 31, 2026Updated 5 months ago
ResearAI / MeOS
View on GitHub
Fork yourself as a Skill, so agents understand you better.
☆20Apr 8, 2026Updated 3 months ago
kookeej / CORAL
View on GitHub
Implementation of NAACL'25 "Empowering Retrieval-based Conversational Recommendation with Contrasting User Preferences"
☆14Sep 9, 2025Updated 10 months ago
SophieZheng998 / RSafe
View on GitHub
Official implementation for "RSafe: Incentivizing proactive reasoning to build robust and adaptive LLM safeguards"
☆17Jan 31, 2026Updated 5 months ago
deeplearning-wisc / picle
View on GitHub
Official code for ICML 2024 paper on Persona In-Context Learning (PICLe)
☆28Jun 27, 2024Updated 2 years ago
morecry / CharacterChat
View on GitHub
repository for CharacterChat, a personalized social support system
☆75Jul 13, 2024Updated 2 years ago
CUHK-ARISE / LLMPersonality
View on GitHub
Code and data for the paper: On the Reliability of Psychological Scales on Large Language Models
☆31Dec 15, 2025Updated 7 months ago
InitialBug / MarCo-Dialog
View on GitHub
The code of ACL 2020 paper "Multi-Domain Dialogue Acts and Response Co-Generation"
☆32May 6, 2020Updated 6 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
weiyifan1023 / MenatQA
View on GitHub
Code and Data for EMNLP 2023 Paper "MenatQA: A New Dataset for Testing the Temporal Comprehension and Reasoning Abilities of Large Langu…
☆14Apr 7, 2025Updated last year
salavi / Clever_Hans_or_N-ToM
View on GitHub
☆12May 6, 2024Updated 2 years ago
Willyoung2017 / PER-CHAT
View on GitHub
Personalized Response Generation via Generative Split Memory Network
☆12Sep 6, 2021Updated 4 years ago
Jayfeather1024 / Backdoor-Enhanced-Alignment
View on GitHub
☆24Dec 8, 2024Updated last year
dengyang17 / PPDPP
View on GitHub
☆33Jan 16, 2025Updated last year
zhiyuanhubj / Meta-Ability-Alignment
View on GitHub
Official code of paper "Beyond 'Aha!': Toward Systematic Meta-Abilities Alignment in Large Reasoning Models"
☆88May 27, 2025Updated last year
WENGSYX / CMIVQA_Baseline
View on GitHub
CMIVQA
☆18Jun 3, 2024Updated 2 years ago
MNoorFawi / curlora
View on GitHub
The code repository for the CURLoRA research paper. Stable LLM continual fine-tuning and catastrophic forgetting mitigation.
☆53Aug 28, 2024Updated last year
wangjs9 / Muffin
View on GitHub
Codes for Mitigating Unhelpfulness in Emotional Support Conversations with Multifaceted AI Feedback (ACL 2024 Findings)
☆17Jul 2, 2024Updated 2 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
lifan-yuan / FactMix
View on GitHub
Code for COLING 2022 paper "FactMix: Using a Few Labeled In-domain Examples to Generalize to Cross-domain Named Entity Recognition"
☆15Jan 15, 2023Updated 3 years ago
Shawn-Guo-CN / Lossless_Text_Compression_with_Transformer
View on GitHub
This repo is to demo the concept of lossless compression with Transformers as encoder and decoder.
☆14May 2, 2024Updated 2 years ago
ZihanWangKi / GoalEx
View on GitHub
Implementation of the Paper "Goal-Driven Explainable Clustering via Language Descriptions"
☆41May 24, 2023Updated 3 years ago
cooperleong00 / ToxificationReversal
View on GitHub
Code for the paper "Self-Detoxifying Language Models via Toxification Reversal" (EMNLP 2023)
☆18Oct 17, 2023Updated 2 years ago
sunlab-osu / ReasonBERT
View on GitHub
Code and pre-trained models for "ReasonBert: Pre-trained to Reason with Distant Supervision", EMNLP'2021
☆28Feb 1, 2023Updated 3 years ago
sail-sg / AnytimeReasoner
View on GitHub
Optimizing Anytime Reasoning via Budget Relative Policy Optimization
☆54Jul 15, 2025Updated last year
BangLiu / SentenceMatching
View on GitHub
Matching Natural Language Sentences with Hierarchical Sentence Factorization
☆22Apr 26, 2018Updated 8 years ago