vicgalle / awesome-rlaif
A curated and updated list of relevant articles and repositories on Reinforcement Learning from AI Feedback (RLAIF)
β12Updated last year
Alternatives and similar repositories for awesome-rlaif:
Users that are interested in awesome-rlaif are comparing it to the libraries listed below
- π€ A collection of AWESOME structured summaries of Large Language Models (LLMs)β26Updated last year
- Explore visualization tools for understanding Transformer-based large language models (LLMs)β9Updated 4 months ago
- ZYN: Zero-Shot Reward Models with Yes-No Questionsβ33Updated last year
- A forest of autonomous agents.β19Updated 2 months ago
- Refined Direct Preference Optimization with Synthetic Data for Behavioral Alignment of LLMsβ13Updated last year
- Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response formatβ27Updated last year
- The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Modelsβ¦β33Updated last year
- o1 Chain of Thought Examplesβ33Updated 5 months ago
- Reward Model framework for LLM RLHFβ61Updated last year
- Official implementation of LoT paper: "Enhancing Zero-Shot Chain-of-Thought Reasoning in Large Language Models through Logic"β23Updated last year
- An AI character interaction system with emotional modeling and advanced memory managementβ16Updated 5 months ago
- This repository includes a benchmark and code for the paper "Evaluating LLMs at Detecting Errors in LLM Responses".β27Updated 7 months ago
- NewsAgent is an enterprise-grade news aggregation agent designed to fetch, query, and summarize news from multiple sources at scale.β16Updated 2 weeks ago
- Public reports detailing responses to sets of prompts by Large Language Models.β30Updated 2 months ago
- Code for experiments on self-prediction as a way to measure introspection in LLMsβ12Updated 3 months ago
- Survival of the Most Influential Prompts: Efficient Black-Box Prompt Search via Clustering and Pruning (Zhou et al.; EMNLP 2023 Findings)β17Updated last year
- Open Implementations of LLM Analysesβ102Updated 5 months ago
- Minimum Description Length probing for neural network representationsβ19Updated 2 months ago
- β96Updated 9 months ago
- β15Updated 6 months ago
- Knowledge transfer from high-resource to low-resource programming languages for Code LLMsβ12Updated 7 months ago
- A 7B parameter model for mathematical reasoningβ23Updated last month
- β48Updated 4 months ago
- β27Updated last week
- FBI: Finding Blindspots in LLM Evaluations with Interpretable Checklistsβ27Updated 3 weeks ago
- β14Updated last year
- Streamlit app for recommending eval functions using prompt diffsβ27Updated last year
- Repository for NPHardEval, a quantified-dynamic benchmark of LLMsβ52Updated last year
- Multi-Modal Tree of thoughts for DALLE-3 like auto self improvementβ16Updated 4 months ago
- Package to optimize Adversarial Attacks against (Large) Language Models with Varied Objectivesβ67Updated last year