SafeLife: safety benchmarks for reinforcement learning agents
☆61May 13, 2021Updated 4 years ago
Alternatives and similar repositories for safelife
Users that are interested in safelife are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for the paper, "Learning Human Objectives by Evaluating Hypothetical Behavior"☆84Dec 13, 2019Updated 6 years ago
- Code for reproducing the results from the paper Avoiding Side Effects in Complex Environments☆12Jun 3, 2021Updated 4 years ago
- A gym environment for Stuart Armstrong's model of a treacherous turn.☆18Jul 28, 2018Updated 7 years ago
- Pin files for contextual, codebase-level AI assistance.☆16Jul 11, 2024Updated last year
- Exploring AI, leveraging the power of Python.☆29Feb 6, 2020Updated 6 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Code for paper "Successor Uncertainties: Exploration and Uncertainty in Temporal Difference Learning" by David Janz*, Jiri Hron*, Przemys…☆21Feb 24, 2023Updated 3 years ago
- ☆11Jun 2, 2021Updated 4 years ago
- Training (hopefully) safe agents in gridworlds☆25May 12, 2019Updated 6 years ago
- Python Library for Function Approximation in Machine Learning☆12Nov 5, 2019Updated 6 years ago
- 📴 OffCon^3: SOTA PyTorch SAC and TD3 Implementations (arxiv: 2101.11331)☆25Jun 20, 2021Updated 4 years ago
- 🔥 A repository for collecting cyberdefense thoughts, books, and documents about AI cyberdefense☆13Jul 2, 2023Updated 2 years ago
- Google Tink's critical Ed25519 bug related to Java "final" keyword☆11Apr 5, 2020Updated 6 years ago
- A gym interface for AI safety gridworlds created in pycolab.☆18May 12, 2022Updated 3 years ago
- Gym wrapper for pysc2☆10Sep 16, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- RL agent to play μRTS with Stable-Baselines3 and PyTorch☆27Jan 23, 2022Updated 4 years ago
- Code repository for On the interaction between supervision and self-play in emergent communication (ICLR 2020)☆15Feb 4, 2020Updated 6 years ago
- Neural Arithmetic Logic Units(arXiv:1808.00508)☆11Aug 6, 2018Updated 7 years ago
- ☆85Nov 19, 2020Updated 5 years ago
- Easy Setup, File-based, Offline Capable Federated Learning and Computations☆22Mar 28, 2026Updated last month
- This is a suite of reinforcement learning environments illustrating various safety properties of intelligent agents.☆633May 18, 2022Updated 3 years ago
- Read fixed width data files with Python 3☆14Mar 20, 2026Updated last month
- A small bookmarks app for Solid☆11Jul 13, 2017Updated 8 years ago
- Gymnasium environment for reinforcement learning with multicopters☆32Jun 4, 2024Updated last year
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- The source code for the gym-microrts paper.☆42Aug 5, 2022Updated 3 years ago
- Web effectivethesis.com (and old version of efektivni-altruismus.cz)☆10Feb 22, 2022Updated 4 years ago
- Several variations of a dot product benchmark.☆11Dec 10, 2012Updated 13 years ago
- Implementation of Schmidhuber's Upside Down Reinforcement Learning paper in PyTorch☆27Jan 16, 2020Updated 6 years ago
- Proof of concept on a predictive maintenance use case using federated learning to continuously improve predictions of the remaining life…☆11Feb 21, 2020Updated 6 years ago
- Jupyter notebooks introducing Twitter OSINT with TWINT☆19Jul 19, 2023Updated 2 years ago
- The Happy Faces Benchmark☆15Jul 20, 2023Updated 2 years ago
- ☆10Feb 15, 2017Updated 9 years ago
- A distributed network based on hash codes and lattices.☆14Aug 16, 2016Updated 9 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆22Sep 9, 2021Updated 4 years ago
- Plot charts from arbtt-stats to terminal☆17Jun 16, 2024Updated last year
- A concise primer on Differential Privacy☆29Jun 24, 2020Updated 5 years ago
- This is a pip package implementing Reinforcement Learning algorithms in non-stationary environments supported by the OpenAI Gym toolkit.☆32Jun 5, 2019Updated 6 years ago
- 📔️ Generate a text-based journal from a template file.☆21Mar 16, 2021Updated 5 years ago
- Slides for an opinionated talk about what it means to be a senior software engineer☆15Jun 17, 2023Updated 2 years ago
- Made for a reading group at the Center for Safe AGI.☆12Feb 23, 2026Updated 2 months ago