Interpretability dashboard for reinforcement learners
☆16Jun 4, 2019Updated 6 years ago
Alternatives and similar repositories for agent
Users that are interested in agent are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Training (hopefully) safe agents in gridworlds☆25May 12, 2019Updated 7 years ago
- This project includes various scripts for Ensage.☆11Jan 5, 2015Updated 11 years ago
- An experimental library for metaprogramming with algebraic effects and handlers☆41Updated this week
- Implementation of a new Quantum Oracle for solving the Max-Cut Problem with Grover Search Algorithm☆11Sep 16, 2024Updated last year
- This repository contains the dataset and code for our ACL'23 publication: "MatSci-NLP: Evaluating Scientific Language Models on Materials…☆17Nov 21, 2023Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆14Aug 9, 2023Updated 2 years ago
- A gym environment for Stuart Armstrong's model of a treacherous turn.☆18Jul 28, 2018Updated 7 years ago
- A python implementation of PROCLUS: PROjected CLUStering algorithm.☆10Jan 12, 2015Updated 11 years ago
- An implementation of the Escape Room domain for Hierarchical Reinforcement Learning.☆25May 15, 2019Updated 7 years ago
- Recommendation engine and it's algorithms in python , R .☆12Oct 26, 2018Updated 7 years ago
- ArXiv'18 implementation of amortized maximum likelihood (AML) for high-quality, weakly-supervised shape completion.☆11Nov 30, 2018Updated 7 years ago
- Mac port of Torcs, The Open Racing Car Simulator☆11Jun 16, 2010Updated 15 years ago
- Library that provides environments for planning problems☆16Apr 24, 2026Updated last month
- ☆15Apr 14, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Reproduction of the paper "Soft Q-Learning with Mutual Information Regularization" CoRL 2019.☆10Jan 10, 2019Updated 7 years ago
- Minimal (truly) muP implementation, consistent with TP4 and TP5 papers notation☆14Jan 2, 2026Updated 4 months ago
- Application Security Vulnerability Periodic Table☆14Aug 25, 2014Updated 11 years ago
- Deep Reinforcement Learning☆17Sep 1, 2017Updated 8 years ago
- Generative Agent simulation of a Mastodon social network☆26May 7, 2026Updated 2 weeks ago
- A Python 3 Bandit Visualization Package☆11Oct 16, 2017Updated 8 years ago
- A gym interface for AI safety gridworlds created in pycolab.☆18May 12, 2022Updated 4 years ago
- [AAAI26] Trade-offs in Large Reasoning Models: An Empirical Analysis of Deliberative and Adaptive Reasoning over Foundational Capabilitie…☆10Feb 7, 2026Updated 3 months ago
- An implementation of the Jenkins Traub polynomial root finding algorithm☆14Aug 23, 2015Updated 10 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Gomoku AI based AlphaZero Algorithm☆10Feb 27, 2019Updated 7 years ago
- Make quick and dirty data mining made easier in Sublime Text☆11Feb 24, 2021Updated 5 years ago
- An attempt to apply reinforcement learning to graph signal recovery problem☆11Aug 25, 2021Updated 4 years ago
- Pytorch implementation of Stable Opponent Shaping (https://openreview.net/pdf?id=SyGjjsC5tQ).☆21Jan 15, 2020Updated 6 years ago
- ☆20Mar 15, 2017Updated 9 years ago
- ☆26Dec 15, 2025Updated 5 months ago
- ☆30May 1, 2019Updated 7 years ago
- Open-source Human Feedback Library☆11Oct 25, 2023Updated 2 years ago
- Connect6 (Korean: 육목) for Python.☆11May 15, 2017Updated 9 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- C#的GUI五子棋大作业 包括禁手 AI 简单直播功能☆10Dec 14, 2018Updated 7 years ago
- Code for the NeurIPS 2021 paper "Deep Bandits Show-Off: Simple and Efficient Exploration with Deep Networkst"☆14Sep 12, 2022Updated 3 years ago
- ☆10Apr 27, 2022Updated 4 years ago
- Pin files for contextual, codebase-level AI assistance.☆16Jul 11, 2024Updated last year
- A SFSpeechRecognizer-based voice recordings transcriber for macOS☆26Oct 31, 2022Updated 3 years ago
- The repo for Shen Group's FMAB repo☆11Jan 21, 2021Updated 5 years ago
- Watches for change in your maildir, and runs mbsync when change are found.☆18Jul 23, 2014Updated 11 years ago