GPaolo / novelty_search_gymLinks

A (fairly modular and easily expandable) novelty search implementation for gym-based environments

☆12

Alternatives and similar repositories for novelty_search_gym

Users that are interested in novelty_search_gym are comparing it to the libraries listed below

Sorting:

btnorman / First-Explore
Repo to reproduce the First-Explore paper results
☆37Updated 6 months ago
Sea-Snell / CALM-Dialogue
Official code for the paper "Context-Aware Language Modeling for Goal-Oriented Dialogue Systems"
☆34Updated 2 years ago
google-deepmind / enn_acme
☆31Updated 2 years ago
Farama-Foundation / CrowdPlay
A web based platform for collecting human actions in reinforcement learning environments
☆30Updated last year
eilab-gt / NovGrid
Novelty MiniGrid--NovGrid--is an extension of MiniGrid environment that allows for the world properties and dynamics to change according …
☆35Updated last year
google-deepmind / zipfian_environments
☆28Updated 2 years ago
iPieter / universal-distillation
🧪Create domain-adapted language models by distilling from many pre-trained LMs
☆10Updated 2 years ago
codingfisch / flashrl
Fast reinforcement learning 💨
☆25Updated 3 months ago
google-deepmind / lm_act
LMAct: A Benchmark for In-Context Imitation Learning with Long Multimodal Demonstrations
☆18Updated last month
google-deepmind / csuite
☆44Updated 9 months ago
google-deepmind / agent_debugger
Causal Analysis of Agent Behavior for AI Safety
☆18Updated 2 years ago
facebookresearch / cascade
Implementation of CASCADE in Learning General World Models in a Handful of Reward-Free Deployments (NeurIPS 22).
☆29Updated 2 years ago
AutonomousAgentsLab / cr-dv3
DreamerV3 implementation of Curious Replay, a method for prioritizing experience replay that is tailored to model-based reinforcement lea…
☆36Updated last year
cassidylaidlaw / orpo
☆16Updated 7 months ago
DHDev0 / Muzero-unplugged
Pytorch Implementation of MuZero Unplugged for gym environment. This algorithm is capable of supporting a wide range of action and observ…
☆27Updated 2 years ago
conglu1997 / intelligent-go-explore
Intelligent Go-Explore: Standing on the Shoulders of Giant Foundation Models
☆58Updated 4 months ago
NVlabs / gbrl_sb3
GBRL-based Actor-Critic algorithms implemented in stable-baselines3
☆35Updated last month
upiterbarg / lintseq
[ICLR 2025] "Training LMs on Synthetic Edit Sequences Improves Code Synthesis" (Piterbarg, Pinto, Fergus)
☆19Updated 4 months ago
luchris429 / discovered-policy-optimisation
Code for Discovered Policy Optimisation (NeurIPS 2022)
☆11Updated 2 years ago
jbloomAus / DecisionTransformerInterpretability
Interpreting how transformers simulate agents performing RL tasks
☆85Updated last year
facebookresearch / oni
Learn online intrinsic rewards from LLM feedback
☆41Updated 6 months ago
google-deepmind / emergent_communication_at_scale
☆38Updated 10 months ago
DavidRother / cooking_zoo
CookingZoo: a gym-cooking derivative to simulate a complex cooking environment
☆20Updated 6 months ago
EleutherAGI / summarisation
The Intermediate Goal of the project is to train a GPT like architecture to learn to summarise reddit posts from human preferences, as th…
☆12Updated 3 years ago
yuqingd / cusp
☆15Updated 2 years ago
keraJLi / synthetic-gymnax
Drop-in environment replacements that make your RL algorithm train faster.
☆21Updated last year
microsoft / strategically_efficient_rl
More efficient exploration for reinforcement learning in two-player, zero-sum game
☆21Updated 10 months ago
ThomasMiconi / Meta-Task-Generator
Automatically generate simple meta-learning tasks from a very large space
☆15Updated last year
prajjwal1 / rl_paradigm
☆17Updated last year
ml-jku / LRAM
A Large Recurrent Action Model: xLSTM enables Fast Inference for Robotics Tasks
☆33Updated 8 months ago