ContextualAI / HALOsLinks

A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).

☆890

Alternatives and similar repositories for HALOs

Users that are interested in HALOs are comparing it to the libraries listed below

Sorting:

allenai / reward-bench
RewardBench: the first evaluation tool for reward models.
☆642Updated 4 months ago
princeton-nlp / SimPO
[NeurIPS 2024] SimPO: Simple Preference Optimization with a Reference-Free Reward
☆923Updated 8 months ago
xfactlab / orpo
Official repository for ORPO
☆463Updated last year
uclaml / SPIN
The official implementation of Self-Play Fine-Tuning (SPIN)
☆1,206Updated last year
tatsu-lab / alpaca_farm
A simulation framework for RLHF and alternatives. Develop your RLHF method without collecting human data.
☆825Updated last year
magpie-align / magpie
[ICLR 2025] Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing. Your efficient and high-quality synthetic data …
☆782Updated 7 months ago
huggingface / cosmopedia
☆544Updated 11 months ago
yule-BUAA / MergeLM
Codebase for Merging Language Models (ICML 2024)
☆853Updated last year
andyzoujm / representation-engineering
Representation Engineering: A Top-Down Approach to AI Transparency
☆900Updated last year
huggingface / Math-Verify
☆971Updated 3 months ago
ezelikman / quiet-star
Code for Quiet-STaR
☆739Updated last year
declare-lab / instruct-eval
This repository contains code to quantitatively evaluate instruction-tuned models such as Alpaca and Flan-T5 on held-out tasks.
☆548Updated last year
ContextualAI / gritlm
Generative Representational Instruction Tuning
☆675Updated 3 months ago
SinclairCoder / Instruction-Tuning-Papers
Reading list of Instruction-tuning. A trend starts from Natrural-Instruction (ACL 2022), FLAN (ICLR 2022) and T0 (ICLR 2022).
☆770Updated 2 years ago
sail-sg / lorahub
[COLM 2024] LoraHub: Efficient Cross-Task Generalization via Dynamic LoRA Composition
☆655Updated last year
NVIDIA / NeMo-Aligner
Scalable toolkit for efficient model alignment
☆842Updated 2 weeks ago
google-research / distilling-step-by-step
☆560Updated 2 years ago
jzhang38 / EasyContext
Memory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.
☆747Updated last year
suzgunmirac / BIG-Bench-Hard
Challenging BIG-Bench Tasks and Whether Chain-of-Thought Can Solve Them
☆519Updated last year
voidism / DoLa
Official implementation for the paper "DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models"
☆520Updated 9 months ago
princeton-nlp / LLM-Shearing
[ICLR 2024] Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning
☆631Updated last year
glgh / awesome-llm-human-preference-datasets
A curated list of Human Preference Datasets for LLM fine-tuning, RLHF, and eval.
☆380Updated 2 years ago
huggingface / search-and-learn
Recipes to scale inference-time compute of open models
☆1,111Updated 5 months ago
OpenBMB / UltraFeedback
A large-scale, fine-grained, diverse preference dataset (and models).
☆352Updated last year
srush / awesome-o1
A bibliography and survey of the papers surrounding o1
☆1,209Updated 11 months ago
sangmichaelxie / doremi
Pytorch implementation of DoReMi, a method for optimizing the data mixture weights in language modeling datasets
☆341Updated last year
GaryYufei / AlignLLMHumanSurvey
Aligning Large Language Models with Human: A Survey
☆735Updated 2 years ago
FranxYao / Long-Context-Data-Engineering
Implementation of paper Data Engineering for Scaling Language Models to 128K Context
☆477Updated last year
sail-sg / oat
🌾 OAT: A research-friendly framework for LLM online alignment, including reinforcement learning, preference learning, etc.
☆539Updated this week
princeton-nlp / LESS
[ICML 2024] LESS: Selecting Influential Data for Targeted Instruction Tuning
☆496Updated last year