FarnazAdib / Crash_course_on_RLView external linksLinks
This is a self-contained repository to explain two basic Reinforcement (RL) algorithms.
☆83Sep 19, 2024Updated last year
Alternatives and similar repositories for Crash_course_on_RL
Users that are interested in Crash_course_on_RL are comparing it to the libraries listed below
Sorting:
- Code for the paper {Pang, Bo, and Zhong-Ping Jiang. "Reinforcement Learning for Adaptive Optimal Stationary Control of Linear Stochastic …☆29Dec 5, 2021Updated 4 years ago
- ☆51Mar 3, 2024Updated last year
- OpenControl is a python package that implements basic algorithms for the analysis and design of optimal feedback controllers.☆15Jul 16, 2021Updated 4 years ago
- EMNLP 2020: Filtering before Iteratively Referring for Knowledge-Grounded Response Selection in Retrieval-Based Chatbots☆12Dec 15, 2020Updated 5 years ago
- This repository contains the code for the TextGraphs-15 paper "Modeling Graph Structure via Relative Position for Text Generation from Kn…☆13Aug 10, 2021Updated 4 years ago
- A WGAN-GP that utilizes a compositional pattern producing network as the generator☆11Sep 9, 2021Updated 4 years ago
- Python implementation of Gibbs sampling for the naı̈ve Bayes model presented by Resnik and Hardisty☆14Feb 10, 2018Updated 8 years ago
- Matlab Code base for T-RO 20 paper on robust Control Barrier Functions (CBFs) with Gaussian Process Regression for estimating the disturb…☆18Nov 9, 2021Updated 4 years ago
- Implementation of stable-baselines3 in rust with burn☆19Nov 24, 2025Updated 2 months ago
- Generating Training Data Made Easy☆43Jul 3, 2020Updated 5 years ago
- ☆11Mar 4, 2021Updated 4 years ago
- Albert for Conversational Question Answering Challenge☆22Jun 12, 2023Updated 2 years ago
- ☆21Jun 13, 2019Updated 6 years ago
- Vector Quantile Regression☆19Apr 3, 2025Updated 10 months ago
- Several ADP algorithms code for the data-driven optimal control of linear time-varying systems☆23Sep 17, 2022Updated 3 years ago
- A general purpose numerical simulator supporting nested dynamical systems and a convenient macro-based data logger.☆20Jan 27, 2025Updated last year
- A lightweight command line interface for the management of arbitrary machine learning tasks☆19Jan 29, 2021Updated 5 years ago
- Repository of SMAI homeworks, Monsoon 2019-20.☆19Dec 7, 2019Updated 6 years ago
- Kervolution implementation using TF2.0☆20Dec 8, 2022Updated 3 years ago
- Reinforcement based gain calculation for a tracking LQR using actor-critic method☆24Mar 27, 2021Updated 4 years ago
- ☆27Jan 8, 2026Updated last month
- PyTorch agents and tools for (Deep) Reinforcement Learning☆25Jan 3, 2025Updated last year
- Andrew Ng's ML course☆22Jul 3, 2018Updated 7 years ago
- Low-Order modelling of Floating offshore wind Turbines/Farms for grid integration research☆18Aug 9, 2025Updated 6 months ago
- ☆11Sep 18, 2025Updated 4 months ago
- ☆38Aug 24, 2024Updated last year
- Source code for examples in Book "Robust Adaptive Dynamic Programming"☆144Jan 2, 2023Updated 3 years ago
- Learning Lyapunov functions and control policies of nonlinear dynamical systems☆144May 3, 2021Updated 4 years ago
- ☆43Oct 19, 2022Updated 3 years ago
- Pytorch Code for S2IGAN☆41Aug 11, 2020Updated 5 years ago
- ☆11Apr 27, 2020Updated 5 years ago
- Functional testing Java-EE applications☆10Sep 26, 2017Updated 8 years ago
- A Library for Scaling Mixed-Integer Optimization-Based Machine Learning.☆12Jun 24, 2024Updated last year
- The NSE has a website that displays the option chain in near real-time. This program retrieves this data from the NSE site and then gener…☆11Aug 21, 2021Updated 4 years ago
- ☆10Apr 4, 2018Updated 7 years ago
- This is my paper.☆10Jun 24, 2023Updated 2 years ago
- Partially Observable Multi-Agent RL with Transformers☆17Updated this week
- MediaPipeを用いたハンドジェスチャーによる簡単なマウス操作を行うプログラムです。☆12Mar 17, 2021Updated 4 years ago
- Interactive Brokers TWS API -- "The Big Red Button" - one button to cancel all orders and close all positions.☆11Apr 10, 2018Updated 7 years ago