Interpretability dashboard for reinforcement learners
☆16Jun 4, 2019Updated 6 years ago
Alternatives and similar repositories for agent
Users that are interested in agent are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Training (hopefully) safe agents in gridworlds☆25May 12, 2019Updated 6 years ago
- An experimental library for metaprogramming with algebraic effects and handlers☆28Mar 17, 2026Updated last week
- Implementation of a new Quantum Oracle for solving the Max-Cut Problem with Grover Search Algorithm☆11Sep 16, 2024Updated last year
- ☆13Aug 9, 2023Updated 2 years ago
- A flat container abstraction for Rust☆16Nov 24, 2025Updated 4 months ago
- Higher Order Reverse Derivatives Efficiently - Automatic Differentiation library. See http://arxiv.org/abs/2507.12640.☆44Updated this week
- ☆13Nov 5, 2024Updated last year
- A gym environment for Stuart Armstrong's model of a treacherous turn.☆18Jul 28, 2018Updated 7 years ago
- Unofficial and Partial Implementation of Fast AutoAugment in Pytorch☆10Oct 3, 2023Updated 2 years ago
- Help protect against malicious build scripts☆27Mar 14, 2026Updated last week
- ☆36Jan 10, 2025Updated last year
- ☆19Dec 4, 2025Updated 3 months ago
- A curated list of awesome resources for Artificial Intelligence Alignment research☆80Jul 14, 2023Updated 2 years ago
- ☆14Apr 14, 2025Updated 11 months ago
- A python implementation of PROCLUS: PROjected CLUStering algorithm.☆10Jan 12, 2015Updated 11 years ago
- An implementation of the Escape Room domain for Hierarchical Reinforcement Learning.☆25May 15, 2019Updated 6 years ago
- Recommendation engine and it's algorithms in python , R .☆12Oct 26, 2018Updated 7 years ago
- A lightweight unified metrics library in Rust for various metrics system.☆18Nov 11, 2025Updated 4 months ago
- Library that provides environments for planning problems☆16Mar 12, 2026Updated last week
- Hands-On TensorBoard for PyTorch Developers, Published by Packt☆11Dec 15, 2025Updated 3 months ago
- ☆58Jun 15, 2022Updated 3 years ago
- Reproduction of the paper "Soft Q-Learning with Mutual Information Regularization" CoRL 2019.☆10Jan 10, 2019Updated 7 years ago
- Minimal (truly) muP implementation, consistent with TP4 and TP5 papers notation☆14Jan 2, 2026Updated 2 months ago
- ☆16Jul 29, 2024Updated last year
- Application Security Vulnerability Periodic Table☆14Aug 25, 2014Updated 11 years ago
- Generative Agent simulation of a Mastodon social network☆25Mar 16, 2026Updated last week
- The Hybrid Public Key Encryption (HPKE) standard in Python☆12Apr 29, 2024Updated last year
- ☆18Dec 15, 2025Updated 3 months ago
- A Python 3 Bandit Visualization Package☆11Oct 16, 2017Updated 8 years ago
- A gym interface for AI safety gridworlds created in pycolab.☆18May 12, 2022Updated 3 years ago
- An implementation of the Jenkins Traub polynomial root finding algorithm☆14Aug 23, 2015Updated 10 years ago
- [AAAI26] Trade-offs in Large Reasoning Models: An Empirical Analysis of Deliberative and Adaptive Reasoning over Foundational Capabilitie…☆10Feb 7, 2026Updated last month
- Arbitrary precision integers in TensorFlow☆11Dec 27, 2022Updated 3 years ago
- Make quick and dirty data mining made easier in Sublime Text☆11Feb 24, 2021Updated 5 years ago
- An attempt to apply reinforcement learning to graph signal recovery problem☆11Aug 25, 2021Updated 4 years ago
- Pytorch implementation of Stable Opponent Shaping (https://openreview.net/pdf?id=SyGjjsC5tQ).☆21Jan 15, 2020Updated 6 years ago
- (Model-written) LLM evals library☆18Dec 13, 2024Updated last year
- Simulated Annealing for MAX-CUT problems on {+1,-1}-weighted complete graphs☆13Feb 2, 2019Updated 7 years ago
- Connect6 (Korean: 육목) for Python.☆11May 15, 2017Updated 8 years ago