stable-baselines for TF2.x
☆22Jan 22, 2022Updated 4 years ago
Alternatives and similar repositories for stable-baselines-tf2
Users that are interested in stable-baselines-tf2 are comparing it to the libraries listed below
Sorting:
- Multiple Generalized Additive Models implemented in Python (EBM, XGB, Spline, FLAM). Code for our KDD 2021 paper "How Interpretable and T…☆13Aug 15, 2021Updated 4 years ago
- A starter template for AWS Elastic Beanstalk worker project☆14Mar 27, 2019Updated 6 years ago
- Pessimistic Bootstrapping for Uncertainty-Driven Offline Reinforcement Learning☆29Feb 21, 2022Updated 4 years ago
- ☆10Dec 31, 2020Updated 5 years ago
- JAX implementation of Graph Attention Networks☆13Jan 29, 2022Updated 4 years ago
- This is a collection of link prediction tutorials that I wrote. Some notebooks are accompanied by medium blogposts that you can find here…☆11Jan 16, 2021Updated 5 years ago
- Repo to showcase solution examples and learning content curated by the advanced analytics experts within Microsoft Finance☆17Sep 2, 2022Updated 3 years ago
- ☆16May 9, 2022Updated 3 years ago
- A julia based machine learning package for boosting any loss, activation and constraint.☆10Jun 27, 2019Updated 6 years ago
- Study of US electrical grid. Build models for predicting demand based on weather.☆11Jul 19, 2024Updated last year
- Official implementation of "Seeing is Understanding: Unlocking Causal Attention into Modality-Mutual Attention for Multimodal LLMs"☆19May 23, 2025Updated 9 months ago
- The code for experiments conducted to verify the correctness of mirror learning.☆11Jun 3, 2022Updated 3 years ago
- Counterfactual explanations for Reinforcement Learning agents on Atari☆12Apr 3, 2023Updated 2 years ago
- ☆10Feb 20, 2024Updated 2 years ago
- ☆16Jun 25, 2022Updated 3 years ago
- European Summer School on AI Course "Machines Climbing Pearl's Ladder of Causation"☆14Feb 20, 2024Updated 2 years ago
- ☆14Jun 21, 2019Updated 6 years ago
- AAAI 2022 paper - Unifying Model Explainability and Robustness for Joint Text Classification and Rationale Extraction☆18Dec 23, 2021Updated 4 years ago
- Example Code for the Conditional Action Trees Paper☆12May 24, 2021Updated 4 years ago
- Official repository for "Construction of Hierarchical Neural Architecture Search Spaces based on Context-free Grammars" (NeurIPS 2023)☆17Oct 26, 2023Updated 2 years ago
- Data for eight zones in New England as used in the 2017 Global Energy Forecasting Competition (GEFCom2017).☆11Mar 7, 2020Updated 6 years ago
- An ultimately comprehensive paper list of sports analytics, including papers, codes, and related websites☆19Mar 2, 2023Updated 3 years ago
- QF-based Hybrid DRL Portfolio Investment System☆14Aug 13, 2023Updated 2 years ago
- PyTorch implementation for "Discovery of Incremental Skills" (DISk) algorithm from ICLR 2022 paper "One After Another: Learning Increment…☆20Mar 22, 2022Updated 3 years ago
- Machine Learning for Trading☆14Jul 25, 2018Updated 7 years ago
- This is the official implementation for the paper "Learning to Scaffold: Optimizing Model Explanations for Teaching"☆20May 19, 2022Updated 3 years ago
- 匿名为课程/老师评论和评分的地方,为后来同学选课提供参考。☆12Mar 20, 2018Updated 8 years ago
- implement GANs and VAE using pytorch☆13Mar 14, 2018Updated 8 years ago
- Solution to Cartpole balancing problem with the help of reinforcement learning and Deep Neural Networks.☆11May 5, 2023Updated 2 years ago
- A distance calculator that is able to return distance between two ports based on the derived sea route.☆14Sep 11, 2020Updated 5 years ago
- Basic PyTorch Implementation of 'Neural Architecture Search with Reinforcement Learning' (https://arxiv.org/abs/1611.01578)☆13Feb 24, 2018Updated 8 years ago
- A deep reinforcement learning model for portfolio management. For more info, check☆14Jun 2, 2020Updated 5 years ago
- [NeurIPS, 2020 - Reproducibility Challenge]: [RE] Towards Interpretable Reinforcement Learning Using Attention Augmented Agents☆13Apr 26, 2021Updated 4 years ago
- Codes for my thesis project: replicating and modifying quant GANs.☆18Aug 23, 2021Updated 4 years ago
- This is a work in progress Pytorch implementation of the recently proposed ES-RNN by Slawek Smyl, winner of the M4 competition☆12Apr 9, 2019Updated 6 years ago
- Wasserstein GAN with gradient penalty (WGAN-GP) applied to financial time series.☆17Oct 17, 2018Updated 7 years ago
- ☆11May 3, 2019Updated 6 years ago
- Final project for CS3892 - Big Data☆13May 11, 2016Updated 9 years ago
- PyTorch implementation for "On the Critical Role of Conventions in Adaptive Human-AI Collaboration", ICLR 2021☆15Mar 9, 2021Updated 5 years ago