maitrix-org/dynamic-alignment-optimization

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/maitrix-org/dynamic-alignment-optimization)

maitrix-org / dynamic-alignment-optimization

[EMNLP'24 (Main)] DRPO(Dynamic Rewarding with Prompt Optimization) is a tuning-free approach for self-alignment. DRPO leverages a search-based optimization framework that allows LLMs to iteratively self-improve and design the best alignment instructions without the need for additional training.

☆24

Alternatives and similar repositories for dynamic-alignment-optimization

Users that are interested in dynamic-alignment-optimization are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

maitrix-org / de-arena
View on GitHub
Official repository for Decentralized Arena via Collective LLM Intelligence
☆18May 19, 2025Updated last year
Hambaobao / Marathon
View on GitHub
Marathon: A Multiple-choice Long Context Evaluation Benchmark for Large Language Models.
☆10May 16, 2024Updated 2 years ago
Shentao-YANG / Preference_Grounded_Guidance
View on GitHub
Source codes for "Preference-grounded Token-level Guidance for Language Model Fine-tuning" (NeurIPS 2023).
☆17Jan 8, 2025Updated last year
ZNLP / Language-Imbalance-Driven-Rewarding
View on GitHub
[ICLR 2025] Language Imbalance Driven Rewarding for Multilingual Self-improving
☆25Apr 6, 2026Updated 3 months ago
RainBowLuoCS / MMEvol
View on GitHub
(ACL 2025) 🔥🔥🔥Code for "Empowering Multimodal Large Language Models with Evol-Instruct"
☆22May 15, 2025Updated last year
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
ruizheng20 / robust_ticket
View on GitHub
Code of Robust Lottery Tickets for Pre-trained Language Models (ACL2022)
☆20Jul 18, 2022Updated 4 years ago
ychen-stat-ml / kernel-adapters
View on GitHub
Code for "Inducer-tuning: Connecting Prefix-tuning and Adapter-tuning" (EMNLP 2022) and "Empowering Parameter-Efficient Transfer Learning…
☆11Feb 6, 2023Updated 3 years ago
cocoabench / cocoa-agent
View on GitHub
An agent framework for building and evaluating general digital agents.
☆41Apr 21, 2026Updated 3 months ago
connorbrinton / polyai-models
View on GitHub
☆14Sep 28, 2020Updated 5 years ago
zhenwang9102 / X-MedRELA
View on GitHub
Source Code for ACL 2020 paper, "Rationalizing Medical Relation Prediction from Corpus-level Statistics"
☆11Sep 6, 2020Updated 5 years ago
Mr-shuo / Richpedia_Website
View on GitHub
☆10Jan 11, 2023Updated 3 years ago
ack-sec / toyberry
View on GitHub
Toy implementation of Strawberry
☆33Sep 24, 2024Updated last year
Linzwcs / AFT
View on GitHub
☆13Jan 22, 2025Updated last year
dsam99 / QueRE
View on GitHub
Code repository for the paper on "Predicting the Performance of Black-Box LLMs through Self-Queries".
☆12Jan 9, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
yuzhaouoe / SAE-based-representation-engineering
View on GitHub
[NAACL'25 Oral] Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering
☆83Jun 20, 2026Updated last month
yuh-zha / Align
View on GitHub
Align, a general text alignment function
☆15Dec 7, 2023Updated 2 years ago
HKUST-KnowComp / PseudoReasoner
View on GitHub
Official code repository for Findings of EMNLP 2022 paper: PseudoReasoner: Leveraging Pseudo Labels for Commonsense Knowledge Base Popula…
☆11Oct 18, 2022Updated 3 years ago
rempsyc / starter-academic
View on GitHub
My personal site, using Wowchemy
☆13Updated this week
pcy1302 / TapEM
View on GitHub
Task-Guided Pair Embedding in Heterogeneous Network (CIKM 2019)
☆12Aug 19, 2021Updated 4 years ago
xiusic / MinPrompt
View on GitHub
MinPrompt: Graph-based Minimal Prompt Data Augmentation for Few-shot Question Answering
☆14May 3, 2024Updated 2 years ago
netpaladinx / DPMPN
View on GitHub
☆21Mar 25, 2023Updated 3 years ago
ljcleo / agent_sense
View on GitHub
Benchmarking Social Intelligence of Language Agents through Interactive Scenarios
☆13Jan 4, 2025Updated last year
cognitiveailab / tg2021task
View on GitHub
Participant Kit for the TextGraphs-15 Shared Task on Explanation Regeneration
☆18Nov 8, 2021Updated 4 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
soyoung97 / AcuRank
View on GitHub
☆15Jul 30, 2025Updated 11 months ago
Yu-chen-Deng / LAPIG
View on GitHub
[TVCG & VR'25] LAPIG: Language Guided Projector Image Generation with Surface Adaptation and Stylization
☆11Apr 16, 2026Updated 3 months ago
October2001 / ProLong
View on GitHub
[ACL 2024 (Oral)] A Prospector of Long-Dependency Data for Large Language Models
☆61Jul 23, 2024Updated last year
linkedin / ControlLLM
View on GitHub
Control LLM
☆23Apr 6, 2025Updated last year
Lyun0912-wu / LongAttn
View on GitHub
LongAttn ：Selecting Long-context Training Data via Token-level Attention
☆15Jul 16, 2025Updated last year
sfeucht / footprints
View on GitHub
https://footprints.baulab.info
☆17Oct 4, 2024Updated last year
alfassy / FETA
View on GitHub
☆22Mar 14, 2024Updated 2 years ago
wangmengsd / writtingskills
View on GitHub
Some skills of English research paper writing
☆16Aug 4, 2020Updated 5 years ago
qtli / EmpDG
View on GitHub
[COLING 2020] EmpDG: Multi-resolution Interactive Empathetic Dialogue Generation
☆50Feb 16, 2021Updated 5 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Ber666 / RAP
View on GitHub
Reasoning with Language Model is Planning with World Model
☆196Aug 25, 2023Updated 2 years ago
KehaoWu / pgdb
View on GitHub
psycopg2(Python中PostgreSQL连接器)的包装版，主要目的是解决psycopg2返回的是元组，这里根据列名将每一个数据变成字典和列表，方便进行操作。
☆18Jul 29, 2021Updated 4 years ago
mlwu22 / RED
View on GitHub
Implementation code for ACL2024：Advancing Parameter Efficiency in Fine-tuning via Representation Editing
☆15Apr 20, 2024Updated 2 years ago
ShunqiM / PM
View on GitHub
☆14Apr 9, 2026Updated 3 months ago
cosmoharrigan / rc-nfq
View on GitHub
RC-NFQ: Regularized Convolutional Neural Fitted Q Iteration. A batch algorithm for deep reinforcement learning. Incorporates dropout regu…
☆12Mar 17, 2021Updated 5 years ago
wenge-research / CRE-SFT
View on GitHub
A supervised fine-tuning method for controllable reasoning length in large language models (一种通过有监督微调实现大语言模型思考长度可控的方法)
☆11May 8, 2025Updated last year
MozerWang / DEMO
View on GitHub
[ACL 2025 (Findings)] DEMO: Reframing Dialogue Interaction with Fine-grained Element Modeling
☆22Dec 16, 2024Updated last year