☆10Aug 17, 2022Updated 3 years ago
Alternatives and similar repositories for disc
Users that are interested in disc are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Proximal Policy Option-Critic☆26Jan 4, 2019Updated 7 years ago
- Code for VIREL: A Variational Inference Framework for Reinforcement Learning☆14Dec 1, 2019Updated 6 years ago
- ☆14Jun 26, 2019Updated 6 years ago
- Code to reproduce Supervised Policy Update (ICLR 2019)☆17Dec 8, 2022Updated 3 years ago
- Code for Diagnosing Bottlenecks in Deep Q-learning. Contains implementations of tabular environments plus solvers.☆18May 14, 2019Updated 6 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Implementation prototype of the Deep Deterministic Off-Policy Gradient (DD-OPG) method.☆11Jun 12, 2019Updated 6 years ago
- Keras implementation of guide actor-critic for continuous control☆11Mar 12, 2018Updated 8 years ago
- ☆17Mar 21, 2021Updated 5 years ago
- Author's PyTorch implementation of SR-DICE for marginalized importance sampling☆28Dec 7, 2021Updated 4 years ago
- Count based exploration with the successor representation for Unity ML's Pyramid☆12Jun 19, 2019Updated 6 years ago
- Convergent Policy Optimization for Safe Reinforcement Learning☆11Oct 26, 2019Updated 6 years ago
- Generalized Proximal Policy Optimization with Sample Reuse (GePPO)☆28Jul 24, 2023Updated 2 years ago
- Files from the published Alpha Star paper by DeepMind☆18Nov 14, 2019Updated 6 years ago
- Code for Optimistic Exploration even with a Pessimistic Initialisation☆14Aug 4, 2020Updated 5 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- P3O paper code☆30Aug 7, 2019Updated 6 years ago
- This repository considers the implementation of the paper "FoX: Formation-aware exploration in multi-agent reinforcement learning" which …☆24Oct 24, 2024Updated last year
- ☆31Jul 1, 2019Updated 6 years ago
- Implementation of Data Efficient Reinforcement Learning in Pytorch☆20Aug 6, 2019Updated 6 years ago
- N-Layered FeUdal Networks based on FeUdal Networks adapted to suit PySC2 observations☆19Sep 17, 2019Updated 6 years ago
- Digital Lens-less Holographic Microscopy plugin for ImageJ.☆13Mar 6, 2022Updated 4 years ago
- Takes input from other players champions/items, predicts what they build and calculates best team/comp for you☆22Aug 3, 2021Updated 4 years ago
- [ICLR 2022 Spotlight] Code for Reinforcement Learning with Sparse Rewards using Guidance from Offline Demonstration☆28Feb 10, 2022Updated 4 years ago
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆32Feb 21, 2020Updated 6 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Code for paper "Successor Uncertainties: Exploration and Uncertainty in Temporal Difference Learning" by David Janz*, Jiri Hron*, Przemys…☆21Feb 24, 2023Updated 3 years ago
- Repository with environment and training scripts for paper "Cross-Environment-Cooperation Enables Zero-shot Multi-agent Cooperation"☆19Sep 12, 2025Updated 7 months ago
- Reward Estimation for Variance Reduction in Deep Reinforcement Learning☆22Oct 26, 2018Updated 7 years ago
- M-CURL: Masked Contrastive Representation Learning for Reinforcement Learning☆28Nov 5, 2020Updated 5 years ago
- Deep Learning Project☆23Jan 18, 2020Updated 6 years ago
- Thompson Sampling based Monte Carlo Tree Search for MDPs and POMDPs☆15Jun 20, 2016Updated 9 years ago
- Automatic Data-Regularized Actor-Critic (Auto-DrAC)☆103Mar 24, 2023Updated 3 years ago
- Options of Interest: Temporal Abstraction with Interest Functions AAAI 2020☆25Jul 31, 2020Updated 5 years ago
- Trust Region Preference Approximation: A simple and stable reinforcement learning algorithm for LLM reasoning☆15Jun 28, 2025Updated 9 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Exploring techniques to generate diverse conventions in multi-agent settings☆15Nov 14, 2023Updated 2 years ago
- ☆13May 21, 2023Updated 2 years ago
- Map-Elites based on Evolution Strategies☆33Feb 11, 2022Updated 4 years ago
- ☆13Oct 11, 2022Updated 3 years ago
- Efficient Exploration via State Marginal Matching (2019)☆69Jun 30, 2019Updated 6 years ago
- ☆17Feb 21, 2020Updated 6 years ago
- original source code of the ASE 2019 paper: Wuji: Automatic Online Combat Game Testing Using Evolutionary Deep Reinforcement Learning☆28Jun 8, 2020Updated 5 years ago