vicgalle / awesome-rlaifLinks
A curated and updated list of relevant articles and repositories on Reinforcement Learning from AI Feedback (RLAIF)
β12Updated last year
Alternatives and similar repositories for awesome-rlaif
Users that are interested in awesome-rlaif are comparing it to the libraries listed below
Sorting:
- π€ A collection of AWESOME structured summaries of Large Language Models (LLMs)β27Updated last year
- Collection of Materials on AI Agentsβ41Updated last month
- The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Modelsβ¦β35Updated last year
- NewsAgent is an enterprise-grade news aggregation agent designed to fetch, query, and summarize news from multiple sources at scale.β18Updated 2 weeks ago
- A curated list of recent efficient video generation methods.β18Updated 7 months ago
- An awesome directory of AI tools. The list here is the data source for the searchable web directory @ https://www.aitoollist.org . Discovβ¦β13Updated 2 months ago
- All my starred repos in an awesome list format that automatically updates my stars, project descriptions and names daily via workflow andβ¦β9Updated this week
- Code for experiments on self-prediction as a way to measure introspection in LLMsβ15Updated 7 months ago
- Repository accompanying the paper https://openreview.net/pdf?id=sSAp8ITBpCβ27Updated last month
- Adapter / facade for language models (OpenAI, Anthropic, Cohere, local transformers, etc)β20Updated last year
- Multi-turn RL framework for aligning models to be tutors instead of answerers.β14Updated last month
- A curated list of awesome resources for Artificial Intelligence Alignment researchβ71Updated 2 years ago
- π Awesome list of interesting topics on Soraβ17Updated this week
- Explore visualization tools for understanding Transformer-based large language models (LLMs)β13Updated 7 months ago
- This repository includes a benchmark and code for the paper "Evaluating LLMs at Detecting Errors in LLM Responses".β30Updated 11 months ago
- PyScrapify is a modular Python web scraper framework built on top of Selenium and BeautifulSoup. Easily extendable with new scrapers off β¦β11Updated last year
- Reasoning by Communicating with Agentsβ29Updated 2 months ago
- β17Updated 3 weeks ago
- Aioli: A unified optimization framework for language model data mixingβ27Updated 6 months ago
- β32Updated last year
- Official implementation of LoT paper: "Enhancing Zero-Shot Chain-of-Thought Reasoning in Large Language Models through Logic"β26Updated last year
- Code Implementation, Evaluations, Documentation, Links and Resources for Min P paperβ39Updated 4 months ago
- OpenPipe Reinforcement Learning Experimentsβ27Updated 4 months ago
- This is a pip package implementing Reinforcement Learning algorithms in non-stationary environments supported by the OpenAI Gym toolkit.β16Updated last year
- Multi-Modal Tree of thoughts for DALLE-3 like auto self improvementβ16Updated 8 months ago
- A forest of autonomous agents.β19Updated 5 months ago
- Curated list of tools, frameworks and resources to work with autonomous agents (autoGPT)β36Updated 2 years ago
- A curated list of awesome introductory programming resources for a variety of specialties within the profession.β12Updated last year
- Paper list for the paper "Authorship Attribution in the Era of Large Language Models: Problems, Methodologies, and Challenges (SIGKDD Expβ¦β14Updated 6 months ago
- Generative AI and Multi-Agent Networks for Creating Digital Twins (Kenyon College's Integrated Program for Humane Studies Program Fall 20β¦β12Updated this week