Note: "Deep Reinforcement Learning: An Overview"
☆12Mar 26, 2018Updated 7 years ago
Alternatives and similar repositories for RL_overview_note
Users that are interested in RL_overview_note are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A Starcraft 2 proxy reaper AI based on the CommandCenter bot. Inspired by the pro player ByuN, the king of Reaper Micro.☆11Sep 19, 2018Updated 7 years ago
- ☆47Jun 19, 2018Updated 7 years ago
- EMNLP 2022: Leveraging Locality in Abstractive Text Summarization☆18Oct 21, 2024Updated last year
- Slides for the tutorial talk on Bayesian Machine Learning at PyCon 2017☆10May 19, 2017Updated 8 years ago
- Using Facebook phrophet forecasting tool for Asteroid impact on earth time series study☆11Mar 11, 2018Updated 8 years ago
- A game to show how organize and mantains a gdg comunity☆12Jan 2, 2018Updated 8 years ago
- A list of papers for machine learning, reinforcement learning, NLP or something interesting☆13Mar 20, 2021Updated 5 years ago
- Tools for the Parse-27k Dataset - evaluation routines and some simple scripts to get started...☆10Jul 16, 2016Updated 9 years ago
- eXtreme MultiLabel Classification tutorial notebook for Machine Learners (with video)☆13Jan 29, 2018Updated 8 years ago
- Workshop about DVC VSCode Extension☆13Sep 25, 2024Updated last year
- Scripts for exporting Kaldi labeled data into TensorFlow☆12Jul 31, 2019Updated 6 years ago
- Parallel Sobel Operator Using CUDA Programming☆13Apr 12, 2013Updated 12 years ago
- Code & Data for the paper "Learning to Deceive with Attention-Based Explanations"☆18Jan 22, 2021Updated 5 years ago
- This is the repo for the Nylas AI Hackaton☆15Oct 1, 2023Updated 2 years ago
- Instruction to data diversification☆24Nov 24, 2020Updated 5 years ago
- Matconvnet implement of Person re-identification baseline. We arrived Rank@1=87.74% mAP=69.46% only with softmax loss.☆12Feb 1, 2018Updated 8 years ago
- Comparison between Sarsa and Q-Learning algorithms on risk handling☆17Jul 10, 2017Updated 8 years ago
- Contrast between ShuffleNet V2 and MnasNet.(Non-official implement In PyTorch)☆12Oct 25, 2018Updated 7 years ago
- Add _ as a shorthand in shell mode for the last shell output☆16Aug 30, 2022Updated 3 years ago
- Estimate the frequency and severity of claims to compute prior and posterior premiums. The GLM method is used with Poisson, Negative Bin…☆10Apr 26, 2018Updated 7 years ago
- Jupyter Notebooks for the Python Data Science Handbook☆17Feb 19, 2017Updated 9 years ago
- RL environment replicating the werewolf game to study emergent communication☆20May 25, 2023Updated 2 years ago
- Сhecks that the Structes are created by the Factory☆15Mar 11, 2026Updated last week
- Supporting tools for PyTorch in biology research.☆19Mar 10, 2022Updated 4 years ago
- The GDG Spain official website☆18Feb 5, 2026Updated last month
- ☆27Oct 13, 2022Updated 3 years ago
- Minimalist Operating System designed to implement as much functionality as possible with a budget of 1000 Lines of Code☆12Sep 28, 2016Updated 9 years ago
- ☆10Jun 4, 2024Updated last year
- A simple script for generating Pascal VOC devkit-style annotations for the WIDER faces dataset☆21Dec 14, 2017Updated 8 years ago
- Wrapper around MLForecast for more plug and play forecasting☆10Oct 23, 2023Updated 2 years ago
- Plugin for beancount, the plaintext accounting software.☆15Jun 27, 2025Updated 8 months ago
- Curriculum vitae of Lester James V. Miranda☆10Jan 13, 2026Updated 2 months ago
- "Applying Regularized Schrödinger-Bridge-Based Stochastic Process in Generative Modeling"☆11Aug 16, 2022Updated 3 years ago
- Implementation of backward elimination algorithm used for dimensionality reduction for improving the performance of risk calculation in l…☆12Jul 25, 2018Updated 7 years ago
- Friday Forecasting Talks materials☆11May 24, 2024Updated last year
- Random Pluto notebooks in Julia☆12Oct 23, 2025Updated 5 months ago
- Code and teaching material for the workshops at the RBA and RBNZ☆22Mar 12, 2017Updated 9 years ago
- 📦 Python library providing Two-Piece distributions functionality. It covers the subfamilies: TP Scale, TP Shape, and Double TP.☆12May 16, 2024Updated last year
- Job Scheduling Simulator for Reinforcement Learning Models☆18Apr 24, 2019Updated 6 years ago