Interpretability dashboard for reinforcement learners
☆16Jun 4, 2019Updated 6 years ago
Alternatives and similar repositories for agent
Users that are interested in agent are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Training (hopefully) safe agents in gridworlds☆25May 12, 2019Updated 6 years ago
- Implementation of a new Quantum Oracle for solving the Max-Cut Problem with Grover Search Algorithm☆11Sep 16, 2024Updated last year
- Proteus 2.0☆10Updated this week
- Experiments with representation engineering☆14Feb 28, 2024Updated 2 years ago
- ☆13Aug 9, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Repo for the paper on Escalation Risks of AI systems☆44Apr 12, 2024Updated 2 years ago
- A flat container abstraction for Rust☆16Nov 24, 2025Updated 4 months ago
- Higher Order Reverse Derivatives Efficiently - Automatic Differentiation library. See http://arxiv.org/abs/2507.12640.☆44Apr 4, 2026Updated last week
- A gym environment for Stuart Armstrong's model of a treacherous turn.☆18Jul 28, 2018Updated 7 years ago
- Unofficial and Partial Implementation of Fast AutoAugment in Pytorch☆10Oct 3, 2023Updated 2 years ago
- Help protect against malicious build scripts☆27Updated this week
- A curated list of awesome resources for Artificial Intelligence Alignment research☆81Jul 14, 2023Updated 2 years ago
- ☆14Apr 14, 2025Updated last year
- A python implementation of PROCLUS: PROjected CLUStering algorithm.☆10Jan 12, 2015Updated 11 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Recommendation engine and it's algorithms in python , R .☆12Oct 26, 2018Updated 7 years ago
- Library that provides environments for planning problems☆16Mar 30, 2026Updated 2 weeks ago
- Hands-On TensorBoard for PyTorch Developers, Published by Packt☆11Dec 15, 2025Updated 3 months ago
- ☆15May 11, 2019Updated 6 years ago
- Reproduction of the paper "Soft Q-Learning with Mutual Information Regularization" CoRL 2019.☆10Jan 10, 2019Updated 7 years ago
- Minimal (truly) muP implementation, consistent with TP4 and TP5 papers notation☆14Jan 2, 2026Updated 3 months ago
- A Python 3 Bandit Visualization Package☆11Oct 16, 2017Updated 8 years ago
- A gym interface for AI safety gridworlds created in pycolab.☆18May 12, 2022Updated 3 years ago
- Efficient wildcard matching against strings☆15May 21, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- An implementation of the Jenkins Traub polynomial root finding algorithm☆14Aug 23, 2015Updated 10 years ago
- [AAAI26] Trade-offs in Large Reasoning Models: An Empirical Analysis of Deliberative and Adaptive Reasoning over Foundational Capabilitie…☆10Feb 7, 2026Updated 2 months ago
- Arbitrary precision integers in TensorFlow☆11Dec 27, 2022Updated 3 years ago
- uct tree search + supervised lerning for atari games☆12Feb 14, 2017Updated 9 years ago
- Gomoku AI based AlphaZero Algorithm☆10Feb 27, 2019Updated 7 years ago
- Make quick and dirty data mining made easier in Sublime Text☆11Feb 24, 2021Updated 5 years ago
- An attempt to apply reinforcement learning to graph signal recovery problem☆11Aug 25, 2021Updated 4 years ago
- Pytorch implementation of Stable Opponent Shaping (https://openreview.net/pdf?id=SyGjjsC5tQ).☆21Jan 15, 2020Updated 6 years ago
- ☆20Mar 15, 2017Updated 9 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A curated list of awesome AI safety papers, projects and communities.☆64Feb 9, 2020Updated 6 years ago
- Optimization as a service TUI☆11Mar 11, 2024Updated 2 years ago
- (Model-written) LLM evals library☆18Dec 13, 2024Updated last year
- Connect6 (Korean: 육목) for Python.☆11May 15, 2017Updated 8 years ago
- Simulated Annealing for MAX-CUT problems on {+1,-1}-weighted complete graphs☆13Feb 2, 2019Updated 7 years ago
- Code for reproducing our paper "Low Rank Adapting Models for Sparse Autoencoder Features"☆17Mar 31, 2025Updated last year
- Code for the NeurIPS 2021 paper "Deep Bandits Show-Off: Simple and Efficient Exploration with Deep Networkst"☆14Sep 12, 2022Updated 3 years ago