AndreaTirinzoni / iw-transfer-rlView external linksLinks
Code for the paper "Importance Weighted Transfer of Samples in Reinforcement Learning" (ICML 2018).
☆16May 29, 2018Updated 7 years ago
Alternatives and similar repositories for iw-transfer-rl
Users that are interested in iw-transfer-rl are comparing it to the libraries listed below
Sorting:
- Formula Student Technion Driverless - Implementation☆25Jul 30, 2019Updated 6 years ago
- Code for Environment Probing Interaction Policies [ICLR 2019]☆29Jun 17, 2019Updated 6 years ago
- [SIGKDD' 24] PyTorch implementation of Temporal Prototype-Aware Learning for Active Voltage Control on Power Distribution Networks☆13Jul 28, 2024Updated last year
- Deep Reinforcement Learning based Autonomous Driving Agents☆10Jul 7, 2022Updated 3 years ago
- Gym implementation of connector to Deepmind lab☆12Mar 26, 2019Updated 6 years ago
- Model-based time series clustering using variational inference.☆12Oct 28, 2018Updated 7 years ago
- Attentional Mechanism incorporated in Asynchronous Advantage Actor Critic a3c/a2c deep mind☆10Jan 9, 2018Updated 8 years ago
- Genetic algorithm for reducing the power loss in an electrical network consisting out of 119 nodes.☆12May 5, 2017Updated 8 years ago
- A simple multicohort LTV calculator for subscriptions☆11Mar 7, 2023Updated 2 years ago
- Offline Policy Evaluation via Adaptive Weighting with Data from Contextual Bandits☆10Oct 21, 2024Updated last year
- Gym wrapper for Vizdoom environments☆12Dec 14, 2018Updated 7 years ago
- 🎮 A configurable Breakout environment for reinforcement learning☆11Mar 20, 2018Updated 7 years ago
- Multi-Objective Causal Bayesian Optimisation, a new paradigm for finding Pareto-optimal interventions in multi-outcome causal models☆17Jun 2, 2025Updated 8 months ago
- JAX/Haiku implementation of "Auction Learning as a Two-Player Game"☆11Jul 6, 2024Updated last year
- Layered distributions using FLAX/JAX☆10Dec 13, 2020Updated 5 years ago
- yet another reinforcement learning package☆12May 24, 2022Updated 3 years ago
- A library to create lore plots (logistic regression of the prevalence of a categorical variable in function of a continuous feature)☆16Feb 1, 2026Updated 2 weeks ago
- Jax implementation of VIT-VQGAN☆10Jan 25, 2024Updated 2 years ago
- Meta Reinforcement Learning Experiments☆35Aug 22, 2017Updated 8 years ago
- Implicit Differentiable Optimal Control (IDOC) with JAX☆12May 11, 2022Updated 3 years ago
- MATLAB implementation of the universal directed information estimators in Jiantao Jiao, Haim H. Permuter, Lei Zhao, Young-Han Kim, and Ts…☆11Apr 2, 2019Updated 6 years ago
- Implements the "Trending Value" strategy introduced by James O'Shaughnessey in his book, "What Works on Wall Street"☆12Jun 16, 2022Updated 3 years ago
- JAX implementation of GPTQ quantization algorithm☆10Jul 19, 2023Updated 2 years ago
- Mis proyectos de marketing aplicando AI☆11Oct 31, 2025Updated 3 months ago
- Distributed Priortized Experience Replay☆10Aug 8, 2018Updated 7 years ago
- ☆10May 17, 2021Updated 4 years ago
- Variational Information Maximization for Feature Selection☆11Aug 24, 2016Updated 9 years ago
- Command line wrapper to run `uv publish` using default credentials from `~/.pypirc`☆14Feb 7, 2026Updated last week
- ☆11Nov 14, 2022Updated 3 years ago
- ☆12May 10, 2018Updated 7 years ago
- Adaptable generative prediction using recursive least square algorithm☆15Apr 23, 2019Updated 6 years ago
- Datasets for Online Controlled Experiments☆12Apr 4, 2025Updated 10 months ago
- Code for implemeting a conditional DDPM trained on CIFAR10☆12Jan 15, 2024Updated 2 years ago
- Code for https://arxiv.org/abs/1811.00145☆12Feb 13, 2021Updated 5 years ago
- ☆11Oct 19, 2020Updated 5 years ago
- ☆15May 15, 2021Updated 4 years ago
- ☆10Dec 3, 2020Updated 5 years ago
- MuJoCo Models for Personal Robot 2 (PR2)☆11Aug 25, 2018Updated 7 years ago
- ☆12Aug 13, 2022Updated 3 years ago