[EMNLP'24 (Main)] DRPO(Dynamic Rewarding with Prompt Optimization) is a tuning-free approach for self-alignment. DRPO leverages a search-based optimization framework that allows LLMs to iteratively self-improve and design the best alignment instructions without the need for additional training.
☆24Nov 17, 2024Updated last year
Alternatives and similar repositories for dynamic-alignment-optimization
Users that are interested in dynamic-alignment-optimization are comparing it to the libraries listed below
Sorting:
- Marathon: A Multiple-choice Long Context Evaluation Benchmark for Large Language Models.☆10May 16, 2024Updated last year
- [ICLR 2026] Adaptive Social Learning via Mode Policy Optimization for Language Agents☆48Feb 2, 2026Updated 3 weeks ago
- Source codes for "Preference-grounded Token-level Guidance for Language Model Fine-tuning" (NeurIPS 2023).☆17Jan 8, 2025Updated last year
- [COLING 2024 (Oral)] PromISe:Releasing the Capabilities of LLMs with Prompt Introspective Search☆23Aug 26, 2024Updated last year
- ☆37May 15, 2025Updated 9 months ago
- quick playground to animate pippin☆14Nov 11, 2024Updated last year
- [NAACL'25 Oral] Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering☆73Jan 16, 2026Updated last month
- Toy implementation of Strawberry☆33Sep 24, 2024Updated last year
- ☆11Jul 7, 2020Updated 5 years ago
- Repository of IPBench☆19Jan 4, 2026Updated last month
- Libraries, guides, blueprints, and sample code, to enable rapidly building 0-1 applications on iOS, Android and web.☆11May 12, 2023Updated 2 years ago
- This package is essentially a ros-wrapper of neural_cam. More features would be added in the future, geared towards mobile robot platform…☆11Jul 12, 2019Updated 6 years ago
- Prompt + regex lab☆10Nov 22, 2023Updated 2 years ago
- A functional programming library for Python☆17Dec 22, 2025Updated 2 months ago
- Failsafe value retrieval, modification and utils using json-pointer spec☆14Dec 20, 2025Updated 2 months ago
- A conditional expression compiler☆15Jun 26, 2025Updated 8 months ago
- The PyTorch implementation of paper "KERMIT: Knowledge Graph Completion of Enhanced Relation Modeling with Inverse Transformation"☆15Jul 4, 2025Updated 7 months ago
- Development repository for the Digital Terraria Lab implementation of the Sugarscape agent-based societal simulation.☆15Updated this week
- ☆14Apr 14, 2025Updated 10 months ago
- A review of class imbalanced problems using data augumentation and ensemble learning☆10Mar 15, 2023Updated 2 years ago
- A collection of OCR'd and machine-corrected Greek texts. This base repository contains Git submodules for the different works and an inve…☆11Nov 18, 2014Updated 11 years ago
- (ICLR 2025 Spotlight) DEEM: Official implementation of Diffusion models serve as the eyes of large language models for image perception.☆48Jul 1, 2025Updated 7 months ago
- ☆10Jul 30, 2023Updated 2 years ago
- AMD HPC Research Fund Cloud☆17Feb 16, 2026Updated last week
- Code Repository for the EMCL-PKDD 2021 "Multitask Recalibrated Aggregation Network for Medical Code Prediction)☆13Sep 7, 2021Updated 4 years ago
- LongAttn :Selecting Long-context Training Data via Token-level Attention☆15Jul 16, 2025Updated 7 months ago
- Code repository for the paper on "Predicting the Performance of Black-Box LLMs through Self-Queries".☆12Jan 9, 2025Updated last year
- Multimodal SER Model meant to be trained on recognising emotions from speech (text + acoustic data). Fine-tuned the DeBERTaV3 model, resp…☆11Jun 19, 2024Updated last year
- ☆12Mar 21, 2024Updated last year
- Risk Minimization Algorithms in Structured Prediction (JMLR 2016)☆13Jan 26, 2017Updated 9 years ago
- variational free-energy of dense hydrogen☆14Sep 25, 2023Updated 2 years ago
- Python Breeding Optimizer and Simulator: A Python library for simulating and optimizing breeding pipelines.☆11Dec 10, 2024Updated last year
- ☆10May 24, 2021Updated 4 years ago
- Setup an MCP server in 60 seconds.☆13Dec 12, 2024Updated last year
- ☆11Jan 8, 2024Updated 2 years ago
- Code for "Inducer-tuning: Connecting Prefix-tuning and Adapter-tuning" (EMNLP 2022) and "Empowering Parameter-Efficient Transfer Learning…☆11Feb 6, 2023Updated 3 years ago
- Python and Scala APIs for enhanced Spark analytics☆12Mar 15, 2017Updated 8 years ago
- Detecting Drift in a Diabetes Dataset using Taipy☆12May 19, 2025Updated 9 months ago
- Implementation of SelfExtend from the paper "LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning" from Pytorch and Zeta☆13Nov 11, 2024Updated last year