david-lindner/safe-grid-gym

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/david-lindner/safe-grid-gym)

david-lindner / safe-grid-gym

A gym interface for AI safety gridworlds created in pycolab.

☆18

Alternatives and similar repositories for safe-grid-gym

Users that are interested in safe-grid-gym are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

jvmncs / safe-grid-agents
View on GitHub
Training (hopefully) safe agents in gridworlds
☆26May 12, 2019Updated 7 years ago
modanesh / anomalous_rl_envs
View on GitHub
Anomalous versions of OpenAI Gym and PyBullet3 environments
☆15Oct 24, 2021Updated 4 years ago
wangbx66 / differentially-private-q-learning
View on GitHub
☆13May 16, 2019Updated 7 years ago
zhaoyi11 / tcrl
View on GitHub
☆26Jan 26, 2024Updated 2 years ago
HiddenBeginner / Deep-Reinforcement-Learnings
View on GitHub
심층강화학습 책 https://hiddenbeginner.github.io/Deep-Reinforcement-Learnings
☆11May 10, 2024Updated 2 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
harish-kamath / rqae
View on GitHub
Residual Quantization Autoencoder, used for interpreting LLMs
☆14Jan 1, 2025Updated last year
lingxuez / bayes-net
View on GitHub
Checking D-separations and I-equivalence in Bayesian Networks.
☆12Feb 11, 2017Updated 9 years ago
google-deepmind / agent_debugger
View on GitHub
Causal Analysis of Agent Behavior for AI Safety
☆21Jun 27, 2023Updated 3 years ago
XiaoxiaoGuo / rcdqn
View on GitHub
This repository contains the source code of the EMNLP 2020 paper Interactive Fiction Game Playing as Multi-Paragraph Reading Comprehensio…
☆20Oct 8, 2020Updated 5 years ago
cswinter / hyperstate
View on GitHub
Opinionated library for managing hyperparameters and mutable state of machine learning training systems.
☆19Aug 4, 2023Updated 2 years ago
hari-sikchi / DVL
View on GitHub
A Dual-RL method DVL: Dual-V Learning for offline and online reinforcement learning
☆16Oct 22, 2023Updated 2 years ago
yardenas / panda-rl-kit
View on GitHub
Deploy RL on your Real-World Franka Emika Panda
☆15Feb 22, 2026Updated 5 months ago
SafeRL-Lab / Uncertainty-in-RL
View on GitHub
The repository is for Reinforcement-Learning Uncertainty research, in which we investigate various uncertain factors in RL.
☆23Jun 16, 2023Updated 3 years ago
neale / avoiding-side-effects
View on GitHub
Code for reproducing the results from the paper Avoiding Side Effects in Complex Environments
☆12Jun 3, 2021Updated 5 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
alexander-turner / attainable-utility-preservation
View on GitHub
☆11Jun 2, 2021Updated 5 years ago
gwthomas / Safe-MBPO
View on GitHub
Code for the NeurIPS 2021 paper "Safe Reinforcement Learning by Imagining the Near Future"
☆52Apr 8, 2022Updated 4 years ago
amazon-science / causal-self-compatibility
View on GitHub
Code to reproduce the experiments from the paper "Self-Compatibility: Evaluating Causal Discovery without Ground Truth"
☆12Mar 9, 2024Updated 2 years ago
agentydragon / swicka
View on GitHub
Qt plotter of stock charts
☆10Mar 29, 2015Updated 11 years ago
world-modelz / dreamax
View on GitHub
A scalable Dreamer implementation in JAX
☆10May 22, 2022Updated 4 years ago
c-cube / ocaml-avro
View on GitHub
[DEPRECATED (use avro-simple)] Runtime library and schema compiler for the Avro serialization format.
☆21Jul 7, 2026Updated 3 weeks ago
andrewschreiber / agent
View on GitHub
Interpretability dashboard for reinforcement learners
☆16Jun 4, 2019Updated 7 years ago
Tim-ats-d / Macron
View on GitHub
A powerful keybind library and daemon for Linux.
☆11Jul 24, 2022Updated 4 years ago
PartnershipOnAI / safelife
View on GitHub
SafeLife: safety benchmarks for reinforcement learning agents
☆61May 13, 2021Updated 5 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
jan-glx / ICPy
View on GitHub
This packages provides a simple python implementation of Invariant Causal Prediction (ICP)
☆13Mar 22, 2024Updated 2 years ago
mcusi / tf_dpgmm
View on GitHub
Variational inference in Dirichlet process Gaussian mixture model (tensorflow implementation)
☆13Oct 8, 2018Updated 7 years ago
lafeychine / scala-native-sfml
View on GitHub
Scala Native 3 bindings for SFML library
☆15Jul 9, 2023Updated 3 years ago
brain-research / LeaveNoTrace
View on GitHub
Leave No Trace is an algorithm for safe reinforcement learning.
☆15Apr 30, 2018Updated 8 years ago
Breakend / SelfDestructingModels
View on GitHub
☆14Aug 9, 2023Updated 2 years ago
RockySJ / ampo
View on GitHub
☆15Oct 20, 2020Updated 5 years ago
kernelmethod / LSHFunctions.jl
View on GitHub
Locality-sensitive hashing (LSH) in Julia.
☆14Aug 31, 2021Updated 4 years ago
Nitron / pelican-alias
View on GitHub
Pelican plugin for creating alias pages (useful for moving from a different URL scheme such as /<year>/<month>/<title>/ as used by Wordpr…
☆18Aug 10, 2020Updated 5 years ago
EleutherAI / attribute
View on GitHub
☆16Nov 14, 2025Updated 8 months ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
oscarkey / constrained-cem-mpc
View on GitHub
An MPC algorithm which supports polytopic state and action constraints, using CEM optimisation.
☆18Oct 1, 2019Updated 6 years ago
junkwhinger / fastautoaugment_jsh
View on GitHub
Unofficial and Partial Implementation of Fast AutoAugment in Pytorch
☆10Oct 3, 2023Updated 2 years ago
CMACH508 / 2020-GNN-MCTS-TSP
View on GitHub
☆13Jun 30, 2020Updated 6 years ago
bellroy / eaforum
View on GitHub
EA Forum
☆14Nov 19, 2018Updated 7 years ago
cstjean / FProfile.jl
View on GitHub
Better profiling reports for Julia
☆14Feb 8, 2020Updated 6 years ago
cgrivera / ai-arena
View on GitHub
The AI Arena: A framework for distributed multi-agent reinforcement learning
☆14Aug 5, 2022Updated 3 years ago
uncharted-technologies / risk-and-uncertainty
View on GitHub
Code that can be used to reproduce the experiments in our paper "Estimating Risk and Uncertainty in Deep Reinforcement Learning"
☆31Nov 22, 2022Updated 3 years ago