Interpretability dashboard for reinforcement learners
☆16Jun 4, 2019Updated 7 years ago
Alternatives and similar repositories for agent
Users that are interested in agent are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Training (hopefully) safe agents in gridworlds☆26May 12, 2019Updated 7 years ago
- An experimental library for metaprogramming with algebraic effects and handlers☆43Jun 25, 2026Updated last week
- Implementation of a new Quantum Oracle for solving the Max-Cut Problem with Grover Search Algorithm☆11Sep 16, 2024Updated last year
- This repository contains the dataset and code for our ACL'23 publication: "MatSci-NLP: Evaluating Scientific Language Models on Materials…☆17Nov 21, 2023Updated 2 years ago
- Proteus 2.0☆10Updated this week
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆14Dec 10, 2017Updated 8 years ago
- Higher Order Reverse Derivatives Efficiently - Automatic Differentiation library. See http://arxiv.org/abs/2507.12640.☆45Apr 29, 2026Updated 2 months ago
- A gym environment for Stuart Armstrong's model of a treacherous turn.☆18Jul 28, 2018Updated 7 years ago
- ☆35Jan 10, 2025Updated last year
- ☆19Dec 4, 2025Updated 7 months ago
- A python implementation of PROCLUS: PROjected CLUStering algorithm.☆10Jan 12, 2015Updated 11 years ago
- Code from http://www.ark.cs.cmu.edu/mheilman/questions/☆12Apr 23, 2013Updated 13 years ago
- ☆44Feb 18, 2026Updated 4 months ago
- Mac port of Torcs, The Open Racing Car Simulator☆11Jun 16, 2010Updated 16 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Library that provides environments for planning problems☆17Apr 24, 2026Updated 2 months ago
- Hands-On TensorBoard for PyTorch Developers, Published by Packt☆11Dec 15, 2025Updated 6 months ago
- ☆15May 11, 2019Updated 7 years ago
- Reproduction of the paper "Soft Q-Learning with Mutual Information Regularization" CoRL 2019.☆10Jan 10, 2019Updated 7 years ago
- Minimal (truly) muP implementation, consistent with TP4 and TP5 papers notation☆14Jan 2, 2026Updated 6 months ago
- Application Security Vulnerability Periodic Table☆14Aug 25, 2014Updated 11 years ago
- Deep Reinforcement Learning☆17Sep 1, 2017Updated 8 years ago
- Silicon Society Sandbox (SiliSocS) is an versatile and extensible experimentation system for EASE-configured generative multi-agent simul…☆29Jun 27, 2026Updated last week
- A Python 3 Bandit Visualization Package☆11Oct 16, 2017Updated 8 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Accelerated Methods for Deep Reinforcement Learning☆49Mar 20, 2019Updated 7 years ago
- [AAAI26] Trade-offs in Large Reasoning Models: An Empirical Analysis of Deliberative and Adaptive Reasoning over Foundational Capabilitie…☆11Feb 7, 2026Updated 4 months ago
- Efficient wildcard matching against strings☆15May 21, 2024Updated 2 years ago
- Arbitrary precision integers in TensorFlow☆11Dec 27, 2022Updated 3 years ago
- Fast interpolative decompositions in Python☆10Jan 4, 2021Updated 5 years ago
- Gomoku AI based AlphaZero Algorithm☆10Feb 27, 2019Updated 7 years ago
- An attempt to apply reinforcement learning to graph signal recovery problem☆11Aug 25, 2021Updated 4 years ago
- The Cape Privacy Python SDK☆23Oct 5, 2023Updated 2 years ago
- Pytorch implementation of Stable Opponent Shaping (https://openreview.net/pdf?id=SyGjjsC5tQ).☆21Jan 15, 2020Updated 6 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆28Dec 15, 2025Updated 6 months ago
- Open-source Human Feedback Library☆11Oct 25, 2023Updated 2 years ago
- ☆12Jul 15, 2016Updated 9 years ago
- Simulated Annealing for MAX-CUT problems on {+1,-1}-weighted complete graphs☆13Feb 2, 2019Updated 7 years ago
- Connect6 (Korean: 육목) for Python.☆11May 15, 2017Updated 9 years ago
- grid world reinforcement learning for tensorflow js☆20Jun 18, 2018Updated 8 years ago
- make a tunnel with two port.☆12Jan 28, 2019Updated 7 years ago