ymetz / rlhfblender
RLHF-Blender: A Configurable Interactive Interface for Learning from Diverse Human Feedback
☆10Updated this week
Alternatives and similar repositories for rlhfblender:
Users that are interested in rlhfblender are comparing it to the libraries listed below
- Code for "Accelerating Training with Neuron Interaction and Nowcasting Networks"☆17Updated last week
- Repo to reproduce the First-Explore paper results☆37Updated 3 weeks ago
- PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"☆23Updated last week
- Intelligent Go-Explore: Standing on the Shoulders of Giant Foundation Models☆47Updated 7 months ago
- Clean RL implementation using MLX☆28Updated 10 months ago
- Scalable Computation of Hessian Diagonals☆12Updated 7 months ago
- Lottery Ticket Adaptation☆37Updated 2 months ago
- this is for fun, ain't it grand!☆12Updated 8 months ago
- Q-Probe: A Lightweight Approach to Reward Maximization for Language Models☆40Updated 7 months ago
- ☆28Updated last month
- Structured Neural Networks☆13Updated 7 months ago
- Codes accompanying the paper "LaProp: a Better Way to Combine Momentum with Adaptive Gradient"☆26Updated 4 years ago
- Official Implementation of SFM and the baselines in Jax.☆14Updated 2 months ago
- Causal Agent based on Large Language Model☆37Updated 5 months ago
- ☆25Updated 7 months ago
- The official GitHub page for the survey paper "A Survey of RWKV".☆12Updated 2 weeks ago
- Generative cellular automaton-like learning environments for RL.☆19Updated 3 months ago
- Code accompanying the paper "A Language Model's Guide Through Latent Space". It contains functionality for training and using concept vec…☆19Updated 10 months ago
- Implementation of Spectral State Space Models☆16Updated 10 months ago
- Evaluation of neuro-symbolic engines☆34Updated 5 months ago
- ☆33Updated last year
- ☆11Updated last year
- Exploration into the Scaling Value Iteration Networks paper, from Schmidhuber's group☆36Updated 3 months ago
- Official implementation of the NeurIPS 2023 paper "Discovering General Reinforcement Learning Algorithms with Adversarial Environment Des…☆24Updated 6 months ago
- Causal Analysis of Agent Behavior for AI Safety☆17Updated last year
- Implemenation of the HIERarchical imagionation On Structured State Space Sequence Models (HIEROS) paper☆15Updated 6 months ago
- ☆78Updated 9 months ago
- ☆15Updated last year
- We propose an evolution-based approach to meta-learn synthetic neural environments and reward neural networks for reinforcement learning.☆21Updated last year
- Minimal AlphaZero in PyTorch, trained on Connect4 on a 6x6 board.☆15Updated 2 years ago