ymetz / rlhfblenderLinks
RLHF-Blender: A Configurable Interactive Interface for Learning from Diverse Human Feedback
☆12Updated this week
Alternatives and similar repositories for rlhfblender
Users that are interested in rlhfblender are comparing it to the libraries listed below
Sorting:
- ☆26Updated last year
- We develop world models that can be adapted with natural language. Intergrating these models into artificial agents allows humans to effe…☆23Updated last year
- Repo to reproduce the First-Explore paper results☆37Updated 5 months ago
- LLM Dynamic Planner - Combining LLM with PDDL Planners to solve an embodied task☆44Updated 5 months ago
- ☆14Updated last year
- Lottery Ticket Adaptation☆39Updated 6 months ago
- Hypothetical Minds is an autonomous LLM-based agent for diverse multi-agent settings, integrating a Theory of Mind module Theory of Mind …☆30Updated 10 months ago
- ☆23Updated last year
- Code for "Accelerating Training with Neuron Interaction and Nowcasting Networks" [to appear at ICLR 2025]☆19Updated 2 weeks ago
- Pytorch implementation of the Gato paper from Deepmind☆12Updated 2 years ago
- ☆27Updated 2 years ago
- Official implementation of FIND (NeurIPS '23) Function Interpretation Benchmark and Automated Interpretability Agents☆49Updated 8 months ago
- Minimum Description Length probing for neural network representations☆19Updated 4 months ago
- Deep neural network architecture for representing robot experiences in an episodic-like memory which facilitates encoding, recalling, and…☆16Updated 6 years ago
- This is the official repo for Gradient Agreement Filtering (GAF).☆24Updated 4 months ago
- ☆32Updated last year
- Advantage Leftover Lunch Reinforcement Learning (A-LoL RL): Improving Language Models with Advantage-based Offline Policy Gradients☆26Updated 8 months ago
- ☆19Updated last week
- ☆20Updated 2 years ago
- ☆13Updated 10 months ago
- ☆21Updated 3 months ago
- Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format☆27Updated last year
- Official Implementation of NeurIPS'23 Paper "Cross-Episodic Curriculum for Transformer Agents"☆31Updated last year
- Multi-Agent Verification: Scaling Test-Time Compute with Multiple Verifiers☆18Updated 3 months ago
- ARLC, a probabilistic abductive reasoner for solving Raven's progressive matrices.☆18Updated last month
- Documentation for dynamic machine learning systems.☆29Updated 8 months ago
- Repo for solving arc problems with an Neural Cellular Automata☆15Updated 2 weeks ago
- Codes for Evolving Plastic ANNs☆13Updated 2 years ago
- Drop-in environment replacements that make your RL algorithm train faster.☆20Updated 11 months ago
- A web based platform for collecting human actions in reinforcement learning environments☆30Updated last year