☆10Aug 17, 2022Updated 3 years ago
Alternatives and similar repositories for disc
Users that are interested in disc are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Proximal Policy Option-Critic☆26Jan 4, 2019Updated 7 years ago
- Code for VIREL: A Variational Inference Framework for Reinforcement Learning☆14Dec 1, 2019Updated 6 years ago
- ☆14Jun 26, 2019Updated 6 years ago
- Code to reproduce Supervised Policy Update (ICLR 2019)☆17Dec 8, 2022Updated 3 years ago
- Code for Diagnosing Bottlenecks in Deep Q-learning. Contains implementations of tabular environments plus solvers.☆18May 14, 2019Updated 6 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Implementation prototype of the Deep Deterministic Off-Policy Gradient (DD-OPG) method.☆11Jun 12, 2019Updated 6 years ago
- Keras implementation of guide actor-critic for continuous control☆11Mar 12, 2018Updated 8 years ago
- ☆17Mar 21, 2021Updated 5 years ago
- Author's PyTorch implementation of SR-DICE for marginalized importance sampling☆28Dec 7, 2021Updated 4 years ago
- Count based exploration with the successor representation for Unity ML's Pyramid☆12Jun 19, 2019Updated 6 years ago
- Convergent Policy Optimization for Safe Reinforcement Learning☆11Oct 26, 2019Updated 6 years ago
- Generalized Proximal Policy Optimization with Sample Reuse (GePPO)☆28Jul 24, 2023Updated 2 years ago
- Files from the published Alpha Star paper by DeepMind☆18Nov 14, 2019Updated 6 years ago
- Code for Optimistic Exploration even with a Pessimistic Initialisation☆14Aug 4, 2020Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- P3O paper code☆30Aug 7, 2019Updated 6 years ago
- This repository considers the implementation of the paper "FoX: Formation-aware exploration in multi-agent reinforcement learning" which …☆23Oct 24, 2024Updated last year
- Digital Lens-less Holographic Microscopy plugin for ImageJ.☆12Mar 6, 2022Updated 4 years ago
- ☆31Jul 1, 2019Updated 6 years ago
- Implementation of Data Efficient Reinforcement Learning in Pytorch☆20Aug 6, 2019Updated 6 years ago
- N-Layered FeUdal Networks based on FeUdal Networks adapted to suit PySC2 observations☆19Sep 17, 2019Updated 6 years ago
- Takes input from other players champions/items, predicts what they build and calculates best team/comp for you☆21Aug 3, 2021Updated 4 years ago
- [ICLR 2022 Spotlight] Code for Reinforcement Learning with Sparse Rewards using Guidance from Offline Demonstration☆28Feb 10, 2022Updated 4 years ago
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆32Feb 21, 2020Updated 6 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Code for paper "Successor Uncertainties: Exploration and Uncertainty in Temporal Difference Learning" by David Janz*, Jiri Hron*, Przemys…☆21Feb 24, 2023Updated 3 years ago
- Repository with environment and training scripts for paper "Cross-Environment-Cooperation Enables Zero-shot Multi-agent Cooperation"☆19Sep 12, 2025Updated 6 months ago
- Reward Estimation for Variance Reduction in Deep Reinforcement Learning☆22Oct 26, 2018Updated 7 years ago
- TOKEN-IMPORTANCE GUIDED DIRECT PREFERENCE OPTIMIZATION☆24Jan 26, 2026Updated 2 months ago
- M-CURL: Masked Contrastive Representation Learning for Reinforcement Learning☆28Nov 5, 2020Updated 5 years ago
- Deep Learning Project☆23Jan 18, 2020Updated 6 years ago
- Thompson Sampling based Monte Carlo Tree Search for MDPs and POMDPs☆15Jun 20, 2016Updated 9 years ago
- Automatic Data-Regularized Actor-Critic (Auto-DrAC)☆103Mar 24, 2023Updated 3 years ago
- Options of Interest: Temporal Abstraction with Interest Functions AAAI 2020☆25Jul 31, 2020Updated 5 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Trust Region Preference Approximation: A simple and stable reinforcement learning algorithm for LLM reasoning☆15Jun 28, 2025Updated 9 months ago
- Exploring techniques to generate diverse conventions in multi-agent settings☆15Nov 14, 2023Updated 2 years ago
- ☆13May 21, 2023Updated 2 years ago
- Map-Elites based on Evolution Strategies☆33Feb 11, 2022Updated 4 years ago
- ☆13Oct 11, 2022Updated 3 years ago
- Efficient Exploration via State Marginal Matching (2019)☆69Jun 30, 2019Updated 6 years ago