stable-baselines for TF2.x
☆22Jan 22, 2022Updated 4 years ago
Alternatives and similar repositories for stable-baselines-tf2
Users that are interested in stable-baselines-tf2 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [Experimental] TensorFlow 2 version of stable-baselines, temporary repository☆45Jan 25, 2020Updated 6 years ago
- Implementation of "Does the Markov Decision Process Fit the Data: Testing for the Markov Property in Sequential Decision Making”(ICML 202…☆14May 10, 2021Updated 5 years ago
- Pessimistic Bootstrapping for Uncertainty-Driven Offline Reinforcement Learning☆29Feb 21, 2022Updated 4 years ago
- JAX implementation of Graph Attention Networks☆13Jan 29, 2022Updated 4 years ago
- Transformer(attention-is-all-you-need)的pytorch实现,带run demo,可以跑通☆10Apr 16, 2019Updated 7 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A paper replication project for Time-driven feature-aware jointly deep reinforcement learning☆11Mar 12, 2021Updated 5 years ago
- Repo to showcase solution examples and learning content curated by the advanced analytics experts within Microsoft Finance☆17Sep 2, 2022Updated 3 years ago
- Go Bindings for Intel® QAT☆11Aug 12, 2024Updated last year
- A julia based machine learning package for boosting any loss, activation and constraint.☆10Jun 27, 2019Updated 6 years ago
- Official implementation of "Seeing is Understanding: Unlocking Causal Attention into Modality-Mutual Attention for Multimodal LLMs"☆21May 11, 2026Updated last month
- The code for experiments conducted to verify the correctness of mirror learning.☆11Jun 3, 2022Updated 4 years ago
- Counterfactual explanations for Reinforcement Learning agents on Atari☆12Apr 3, 2023Updated 3 years ago
- ☆10Feb 20, 2024Updated 2 years ago
- ☆15Sep 9, 2017Updated 8 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- European Summer School on AI Course "Machines Climbing Pearl's Ladder of Causation"☆13Feb 20, 2024Updated 2 years ago
- ☆14Jun 21, 2019Updated 6 years ago
- Official repository for "Construction of Hierarchical Neural Architecture Search Spaces based on Context-free Grammars" (NeurIPS 2023)☆17Oct 26, 2023Updated 2 years ago
- Data for eight zones in New England as used in the 2017 Global Energy Forecasting Competition (GEFCom2017).☆12Mar 7, 2020Updated 6 years ago
- QF-based Hybrid DRL Portfolio Investment System☆14Aug 13, 2023Updated 2 years ago
- Reimplementation of ToMNet with some extensions for RL as well☆14Apr 28, 2018Updated 8 years ago
- PyTorch implementation for "Discovery of Incremental Skills" (DISk) algorithm from ICLR 2022 paper "One After Another: Learning Increment…☆20Mar 22, 2022Updated 4 years ago
- ☆20Sep 22, 2024Updated last year
- implement GANs and VAE using pytorch☆13Mar 14, 2018Updated 8 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Code to accompany the paper "The Information Geometry of Unsupervised Reinforcement Learning"☆20Oct 6, 2021Updated 4 years ago
- Solution to Cartpole balancing problem with the help of reinforcement learning and Deep Neural Networks.☆11May 5, 2023Updated 3 years ago
- Basic PyTorch Implementation of 'Neural Architecture Search with Reinforcement Learning' (https://arxiv.org/abs/1611.01578)☆13Feb 24, 2018Updated 8 years ago
- A deep reinforcement learning model for portfolio management. For more info, check☆14Jun 2, 2020Updated 6 years ago
- Codes for my thesis project: replicating and modifying quant GANs.☆18Aug 23, 2021Updated 4 years ago
- Wasserstein GAN with gradient penalty (WGAN-GP) applied to financial time series.☆17Oct 17, 2018Updated 7 years ago
- ☆11May 3, 2019Updated 7 years ago
- Final project for CS3892 - Big Data☆13May 11, 2016Updated 10 years ago
- ☆12Jan 2, 2024Updated 2 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- PyTorch implementation for "On the Critical Role of Conventions in Adaptive Human-AI Collaboration", ICLR 2021☆15Mar 9, 2021Updated 5 years ago
- ☆19Apr 22, 2024Updated 2 years ago
- Model Primitive Hierarchical Reinforcement Learning☆13Dec 8, 2022Updated 3 years ago
- SkillHack: A Benchmark for Skill Transfer in Open-Ended Reinforcement Learning☆17Oct 23, 2022Updated 3 years ago
- DILMA: Differentiable Language Model Adversarial Attacks on Categorical Sequence Classifiers☆12Oct 7, 2020Updated 5 years ago
- Official pytorch implementation for our ICLR 2023 paper "Latent State Marginalization as a Low-cost Approach for Improving Exploration".☆24Feb 9, 2023Updated 3 years ago
- Implementation of the Prioritized Option-Critic on the Four-Rooms Environment☆17Dec 24, 2017Updated 8 years ago