shenao-zhang/SELM

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/shenao-zhang/SELM)

shenao-zhang / SELM

The official implementation of Self-Exploring Language Models (SELM)

☆63

Alternatives and similar repositories for SELM

Users that are interested in SELM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

shenao-zhang / reward-augmented-preference
View on GitHub
The official implementation of Preference Data Reward-Augmentation.
☆18May 1, 2025Updated last year
agentification / RAFA_code
View on GitHub
☆147May 2, 2024Updated 2 years ago
jwhj / OREO
View on GitHub
☆116Jan 21, 2025Updated last year
sail-sg / dice
View on GitHub
Official implementation of Bootstrapping Language Models via DPO Implicit Rewards
☆47Apr 15, 2025Updated last year
shenao-zhang / BARL
View on GitHub
Bayes-Adaptive RL for LLM Reasoning
☆45May 28, 2025Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
XueruiSu / Trust-Region-Preference-Approximation
View on GitHub
Trust Region Preference Approximation: A simple and stable reinforcement learning algorithm for LLM reasoning
☆15Jun 28, 2025Updated last year
zkshan2002 / RTO
View on GitHub
☆22Jun 4, 2025Updated last year
Asap7772 / understanding-rlhf
View on GitHub
Learning from preferences is a common paradigm for fine-tuning language models. Yet, many algorithmic design decisions come into play. Ou…
☆32Apr 20, 2024Updated 2 years ago
uclaml / SPPO
View on GitHub
The official implementation of Self-Play Preference Optimization (SPPO)
☆590Jan 23, 2025Updated last year
AlignInc / aligner-replication
View on GitHub
The reproduct of the paper - Aligner: Achieving Efficient Alignment through Weak-to-Strong Correction
☆21May 29, 2024Updated 2 years ago
ConiferLM / Conifer
View on GitHub
Conifer: Improving Complex Constrained Instruction-Following Ability of Large Language Models
☆91Apr 4, 2024Updated 2 years ago
gsbDBI / contextual_bandits_evaluation
View on GitHub
Offline Policy Evaluation via Adaptive Weighting with Data from Contextual Bandits
☆11Oct 21, 2024Updated last year
vmicheli / delta-iris
View on GitHub
Efficient World Models with Context-Aware Tokenization. ICML 2024
☆129Sep 22, 2024Updated last year
Vance0124 / Token-level-Direct-Preference-Optimization
View on GitHub
Reference implementation for Token-level Direct Preference Optimization(TDPO)
☆156Feb 14, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Lagooon / LeanSTaR
View on GitHub
☆44Sep 19, 2024Updated last year
Hritikbansal / sparse_feedback
View on GitHub
☆29Jan 23, 2024Updated 2 years ago
GXimingLu / IPA
View on GitHub
Codebase for Inference-Time Policy Adapters
☆25Nov 3, 2023Updated 2 years ago
RLHFlow / Online-RLHF
View on GitHub
A recipe for online RLHF and online iterative DPO.
☆544Dec 28, 2024Updated last year
LHL3341 / MetaLadder
View on GitHub
MetaLadder: Ascending Mathematical Solution Quality via Analogical-Problem Reasoning Transfer (EMNLP 2025)
☆12Apr 18, 2025Updated last year
mandyyyyii / east
View on GitHub
☆19Aug 4, 2025Updated 11 months ago
austrian-code-wizard / c3po
View on GitHub
☆30Apr 6, 2026Updated 3 months ago
RLHFlow / Self-rewarding-reasoning-LLM
View on GitHub
Recipes to train the self-rewarding reasoning LLMs.
☆231Mar 2, 2025Updated last year
mwatkins1970 / SAE_Feature_Interpretability_Tool
View on GitHub
A tool to assist in the interpretation of learned features in sparse autoencoders (in particular the four SAE's trained by Joseph Bloom o…
☆19Oct 4, 2024Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
yinyueqin / relative-preference-optimization
View on GitHub
Relative Preference Optimization: Enhancing LLM Alignment through Contrasting Responses across Identical and Diverse Prompts
☆26Feb 23, 2024Updated 2 years ago
kkyuhun94 / dalda
View on GitHub
[ECCV'24 Workshops Oral] DALDA: Data Augmentation Leveraging Diffusion Model and LLM with Adaptive Guidance Scaling
☆33Feb 6, 2026Updated 5 months ago
rosewang2008 / posr
View on GitHub
Problem-Oriented Segmentation and Retrieval EMNLP 2024 Findings
☆34Nov 12, 2024Updated last year
chujiezheng / LLM-Extrapolation
View on GitHub
Official repository for ACL 2025 paper "Model Extrapolation Expedites Alignment"
☆75May 20, 2025Updated last year
allenai / reward-bench
View on GitHub
RewardBench: the first evaluation tool for reward models.
☆727Feb 16, 2026Updated 5 months ago
RLHFlow / RLHF-Reward-Modeling
View on GitHub
Recipes to train reward model for RLHF.
☆1,536Apr 24, 2025Updated last year
wzhouad / WPO
View on GitHub
Code and models for EMNLP 2024 paper "WPO: Enhancing RLHF with Weighted Preference Optimization"
☆41Sep 24, 2024Updated last year
InternLM / OREAL
View on GitHub
Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning
☆190Mar 20, 2025Updated last year
chentong0 / copy-bench
View on GitHub
CopyBench: Measuring Literal and Non-Literal Reproduction of Copyright-Protected Text in Language Model Generation
☆14Aug 19, 2025Updated 11 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
ai-resilience / yonsei_ml3_PriME
View on GitHub
☆15Dec 16, 2025Updated 7 months ago
ai-resilience / yonsei_ml3_CoPe
View on GitHub
☆15Dec 16, 2025Updated 7 months ago
itsnamgyu / block-transformer
View on GitHub
Block Transformer: Global-to-Local Language Modeling for Fast Inference (NeurIPS 2024)
☆166Apr 13, 2025Updated last year
ZBox1005 / CoT-UQ
View on GitHub
[ACL 2025] "CoT-UQ: Improving Response-wise Uncertainty Quantification in LLMs with Chain-of-Thought"
☆17Apr 3, 2025Updated last year
Linear95 / SPAG
View on GitHub
Self-playing Adversarial Language Game Enhances LLM Reasoning, NeurIPS 2024
☆145Feb 24, 2025Updated last year
ars22 / e3
View on GitHub
☆20Sep 16, 2025Updated 10 months ago
SALT-NLP / demonstrated-feedback
View on GitHub
☆131Oct 1, 2024Updated last year