limenlp / safer-instructLinks

This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"

☆17

Alternatives and similar repositories for safer-instruct

Users that are interested in safer-instruct are comparing it to the libraries listed below

Sorting:

allenai / easy-to-hard-generalization
Code for the arXiv preprint "The Unreasonable Effectiveness of Easy Training Data"
☆48Updated last year
john-hewitt / implicit-ins
Codebase for Instruction Following without Instruction Tuning
☆35Updated 10 months ago
HazyResearch / aioli
Aioli: A unified optimization framework for language model data mixing
☆27Updated 6 months ago
yale-nlp / refdpo
☆16Updated last year
yidingjiang / ado
The repository contains code for Adaptive Data Optimization
☆25Updated 7 months ago
ContextualAI / CLAIR_and_APO
Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment
☆60Updated 11 months ago
open-compass / GPassK
[ACL 2025] Are Your LLMs Capable of Stable Reasoning?
☆28Updated 4 months ago
LAMDASZ-ML / Self-Backtracking
☆47Updated 5 months ago
r-three / RAD
Reference implementation for Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model
☆44Updated last year
hamishivi / automated-instruction-selection
Exploration of automated dataset selection approaches at large scales.
☆47Updated 5 months ago
martin-wey / CodeUltraFeedback
CodeUltraFeedback: aligning large language models to coding preferences
☆71Updated last year
Gen-Verse / CURE
Open-Source LLM Coders with Co-Evolving Reinforcement Learning
☆103Updated 2 weeks ago
kyegomez / Reka-Torch
Implementation of the model: "Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models" in PyTorch
☆30Updated 2 weeks ago
locuslab / scaling_laws_data_filtering
☆65Updated last year
yifeiwang77 / Self-Correction
☆20Updated 9 months ago
nuochenpku / LLaMA_Analysis
This is official project in our paper: Is Bigger and Deeper Always Better? Probing LLaMA Across Scales and Layers
☆30Updated last year
googleinterns / localizing-paragraph-memorization
☆14Updated last year
casmlab / NPHardEval
Repository for NPHardEval, a quantified-dynamic benchmark of LLMs
☆57Updated last year
dinobby / MAgICoRE
☆24Updated 10 months ago
tml-epfl / icl-alignment
Is In-Context Learning Sufficient for Instruction Following in LLMs? [ICLR 2025]
☆31Updated 6 months ago
csinva / tree-prompt
Tree prompting: easy-to-use scikit-learn interface for improved prompting.
☆39Updated last year
maszhongming / ParaKnowTransfer
Code for "Seeking Neural Nuggets: Knowledge Transfer in Large Language Models from a Parametric Perspective"
☆32Updated last year
Fu-Dayuan / PreAct
PreAct: Prediction Enhances Agent's Planning Ability (Coling2025)
☆28Updated 7 months ago
xhan77 / in-context-alignment
In-Context Alignment: Chat with Vanilla Language Models Before Fine-Tuning
☆35Updated last year
JHU-CLSP / RATIONALYST
Code for RATIONALYST: Pre-training Process-Supervision for Improving Reasoning https://arxiv.org/pdf/2410.01044
☆35Updated 10 months ago
dinobby / MAGDi
The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models…
☆36Updated last year
du-nlp-lab / MLR-Copilot
☆66Updated 4 months ago
allenai / super-benchmark
☆45Updated 4 months ago
hkust-nlp / B-STaR
B-STAR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners
☆82Updated 2 months ago
GAIR-NLP / scaleeval
Scalable Meta-Evaluation of LLMs as Evaluators
☆42Updated last year