Interpretability dashboard for reinforcement learners
☆16Jun 4, 2019Updated 6 years ago
Alternatives and similar repositories for agent
Users that are interested in agent are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Training (hopefully) safe agents in gridworlds☆25May 12, 2019Updated 6 years ago
- An experimental library for metaprogramming with algebraic effects and handlers☆32Updated this week
- Implementation of a new Quantum Oracle for solving the Max-Cut Problem with Grover Search Algorithm☆11Sep 16, 2024Updated last year
- Experiments with representation engineering☆14Feb 28, 2024Updated 2 years ago
- Proteus 2.0☆10Apr 21, 2026Updated last week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Repo for the paper on Escalation Risks of AI systems☆44Apr 12, 2024Updated 2 years ago
- A gym environment for Stuart Armstrong's model of a treacherous turn.☆18Jul 28, 2018Updated 7 years ago
- Unofficial and Partial Implementation of Fast AutoAugment in Pytorch☆10Oct 3, 2023Updated 2 years ago
- ☆36Jan 10, 2025Updated last year
- ☆19Dec 4, 2025Updated 5 months ago
- An implementation of the Escape Room domain for Hierarchical Reinforcement Learning.☆25May 15, 2019Updated 6 years ago
- Recommendation engine and it's algorithms in python , R .☆12Oct 26, 2018Updated 7 years ago
- ArXiv'18 implementation of amortized maximum likelihood (AML) for high-quality, weakly-supervised shape completion.☆11Nov 30, 2018Updated 7 years ago
- ☆15Apr 14, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Library that provides environments for planning problems☆16Apr 24, 2026Updated last week
- Hands-On TensorBoard for PyTorch Developers, Published by Packt☆11Dec 15, 2025Updated 4 months ago
- ☆15May 11, 2019Updated 6 years ago
- Reproduction of the paper "Soft Q-Learning with Mutual Information Regularization" CoRL 2019.☆10Jan 10, 2019Updated 7 years ago
- Minimal (truly) muP implementation, consistent with TP4 and TP5 papers notation☆14Jan 2, 2026Updated 4 months ago
- ☆16Jul 29, 2024Updated last year
- Application Security Vulnerability Periodic Table☆14Aug 25, 2014Updated 11 years ago
- A gym interface for AI safety gridworlds created in pycolab.☆18May 12, 2022Updated 3 years ago
- An implementation of the Jenkins Traub polynomial root finding algorithm☆14Aug 23, 2015Updated 10 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Fast interpolative decompositions in Python☆10Jan 4, 2021Updated 5 years ago
- uct tree search + supervised lerning for atari games☆12Feb 14, 2017Updated 9 years ago
- Gomoku AI based AlphaZero Algorithm☆10Feb 27, 2019Updated 7 years ago
- An attempt to apply reinforcement learning to graph signal recovery problem☆11Aug 25, 2021Updated 4 years ago
- Pytorch implementation of Stable Opponent Shaping (https://openreview.net/pdf?id=SyGjjsC5tQ).☆21Jan 15, 2020Updated 6 years ago
- ☆20Mar 15, 2017Updated 9 years ago
- ☆25Dec 15, 2025Updated 4 months ago
- ☆12Jul 15, 2016Updated 9 years ago
- (Model-written) LLM evals library☆18Dec 13, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Simulated Annealing for MAX-CUT problems on {+1,-1}-weighted complete graphs☆13Feb 2, 2019Updated 7 years ago
- make a tunnel with two port.☆12Jan 28, 2019Updated 7 years ago
- Code for the NeurIPS 2021 paper "Deep Bandits Show-Off: Simple and Efficient Exploration with Deep Networkst"☆14Sep 12, 2022Updated 3 years ago
- RL CIRL Research☆13Dec 8, 2022Updated 3 years ago
- Pin files for contextual, codebase-level AI assistance.☆16Jul 11, 2024Updated last year
- A SFSpeechRecognizer-based voice recordings transcriber for macOS☆26Oct 31, 2022Updated 3 years ago
- trivial transparent SMTP proxy☆13Dec 6, 2022Updated 3 years ago