☆41Aug 24, 2018Updated 7 years ago
Alternatives and similar repositories for reinforcement-learning-kdd
Users that are interested in reinforcement-learning-kdd are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for "Best arm identification in multi-armed bandits with delayed feedback", AISTATS 2018.☆20Apr 3, 2018Updated 8 years ago
- Companion repository for the KDD'18 hands-on tutorial on Higher-Order Data Analytics for Temporal Network Data☆46Mar 15, 2019Updated 7 years ago
- KDD Hands-On Tutorial (2018)☆29Dec 8, 2022Updated 3 years ago
- Code for ICLR 2022 Paper (HyperDQN: A Randomized Exploration Method for Deep Reinforcement Learning)☆12Nov 28, 2023Updated 2 years ago
- ☆14Jul 14, 2018Updated 7 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Auto Encoder on Tensorflow☆12Oct 18, 2017Updated 8 years ago
- Collection of useful notebooks to be used with the Spark Notebook (https://github.com/andypetrella/spark-notebook)☆22Dec 6, 2018Updated 7 years ago
- Tutorial for PyData London 2019 on AB Test by cluster☆13Jul 12, 2019Updated 6 years ago
- pyrff: Python implementation of random fourier feature approximations for gaussian processes☆29May 4, 2026Updated last month
- A simple tutorial of TensorFlow + TensorFlow / NumPy exercises☆12Feb 17, 2017Updated 9 years ago
- Lightweight coding agent that runs in your terminal☆44Jun 24, 2025Updated 11 months ago
- lagom: A PyTorch infrastructure for rapid prototyping of reinforcement learning algorithms.☆378Nov 19, 2022Updated 3 years ago
- Open source implementation of SeaRNN (ICLR 2018, https://openreview.net/forum?id=HkUR_y-RZ)☆48Jul 4, 2018Updated 7 years ago
- Code for: Murray EJ, Robins JM, George R. Seage III, Freedberg KA, Hernan MA. A Comparison of Agent-Based Models and the Parametric G-For…☆10Nov 27, 2017Updated 8 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Notes for short course on econometrics in Stan☆13Jun 17, 2017Updated 9 years ago
- Various DQN method with cartpole☆11May 30, 2018Updated 8 years ago
- ☆10Jun 6, 2017Updated 9 years ago
- Discussion for Stan for economists☆10Mar 29, 2016Updated 10 years ago
- ☆27May 17, 2019Updated 7 years ago
- Code for paper "Episodic Memory Deep Q-Networks" (https://arxiv.org/abs/1805.07603), IJCAI 2018☆63Sep 5, 2018Updated 7 years ago
- ☆28Jan 2, 2023Updated 3 years ago
- Hypothesis testing (Parametric/Non-Parametric)☆11Oct 8, 2019Updated 6 years ago
- The mlr package online tutorial☆20Jul 20, 2018Updated 7 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Code for 'Contrastive Multi-Document Question Generation'☆11Oct 16, 2022Updated 3 years ago
- ☆22Aug 28, 2020Updated 5 years ago
- Reward Estimation for Variance Reduction in Deep Reinforcement Learning☆11May 8, 2018Updated 8 years ago
- ☆17Feb 19, 2018Updated 8 years ago
- Some starter code for training/testing some basic CNN models given our data.☆10Feb 15, 2017Updated 9 years ago
- Logarithmic Reinforcement Learning☆28Apr 7, 2023Updated 3 years ago
- A simple parameterized storylet manager for Twine and Sugarcube☆14Feb 28, 2021Updated 5 years ago
- ☆25Dec 8, 2022Updated 3 years ago
- Scaleable input gradient regularization☆22Jul 8, 2019Updated 6 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆11Feb 20, 2017Updated 9 years ago
- Autonomous exploration, active learning and human guidance with open-source Poppy humanoid robot platform and Explauto library☆18May 22, 2018Updated 8 years ago
- Presidential election Monte Carlo simulation in Go based on latest polling from Huffington Post API☆30Oct 30, 2016Updated 9 years ago
- Monitoring Apache Kafka with Prometheus and Grafana☆10Sep 29, 2023Updated 2 years ago
- Idiomatic and reactive Scala client for Aerospike database☆10Dec 28, 2024Updated last year
- Off-policy Learning in Two-stage Recommender Systems. https://dl.acm.org/doi/pdf/10.1145/3366423.3380130☆30Jun 11, 2020Updated 6 years ago
- A talk illustrating some of the Advanced features of PyMC3☆11Dec 19, 2017Updated 8 years ago