hari-sikchi / awesome-ai-safetyLinks
A curated list of awesome AI safety papers, projects and communities.
☆54Updated 5 years ago
Alternatives and similar repositories for awesome-ai-safety
Users that are interested in awesome-ai-safety are comparing it to the libraries listed below
Sorting:
- Summary of key papers in deep reinforcement learning. Heavily based on OpenAI SpinningUp.☆82Updated 5 years ago
- Code for the paper, "Learning Human Objectives by Evaluating Hypothetical Behavior"☆84Updated 5 years ago
- Awesome RL: Papers, Books, Codes, Benchmarks☆116Updated last year
- A curated list of multiagent learning and related area resources.☆77Updated 6 years ago
- Adversarial Example Attacks on Policy Learners☆40Updated 5 years ago
- Interpreting how transformers simulate agents performing RL tasks☆87Updated last year
- Jiminy Cricket Environment (NeurIPS 2021)☆25Updated 3 years ago
- Using RLLib and PycoLab to explore intelligent cooperative behavior in sequential social dilemmas☆50Updated 2 years ago
- A curated list of awesome resources for Artificial Intelligence Alignment research☆71Updated 2 years ago
- JAX library for MARL research☆88Updated last year
- ☆74Updated 2 years ago
- A collection of Reinforcement Learning GitHub code resources divided by frameworks and environments☆66Updated 3 years ago
- ☆18Updated 2 years ago
- TeachMyAgent is a testbed platform for Automatic Curriculum Learning methods in Deep RL.☆76Updated last year
- Code and data for the paper "Understanding Hidden Context in Preference Learning: Consequences for RLHF"☆30Updated last year
- Notes on many interesting RL papers☆26Updated 5 years ago
- This repository contains all code and experiments for competitive policy gradient (CoPG) algorithm.☆24Updated 5 years ago
- Object Centric Atari games☆88Updated last month
- ☆16Updated 3 years ago
- A curated list of awesome Inverse Reinforcement Learning resources.☆41Updated 3 years ago
- Pytorch starter code for UC Berkeley's cs285 assignments☆72Updated 3 years ago
- General-purpose library for extracting interpretable models from Multi-Agent Reinforcement Learning systems☆21Updated 5 years ago
- Code used in our paper "Robust Deep Reinforment Learning through Adversarial Loss"☆33Updated last year
- ☆101Updated last year
- Source for the sample efficient tabular RL submission to the 2019 NIPS workshop on Biological and Artificial RL☆24Updated 3 years ago
- Reward Learning by Simulating the Past☆45Updated 6 years ago
- Materials for the Practical Sessions of the Reinforcement Learning Summer School 2019: Bandits, RL & Deep RL (PyTorch).☆90Updated 6 years ago
- Code repository for On the interaction between supervision and self-play in emergent communication (ICLR 2020)☆15Updated 5 years ago
- 🏆 gym-cooking: Code for "Too many cooks: Bayesian inference for coordinating multi-agent collaboration", Winner of the CogSci 2020 Compu…☆208Updated 4 years ago
- Library to compare and evaluate reward functions☆67Updated last year