Extending rllab to event-driven multiagent environments
☆13Oct 1, 2018Updated 7 years ago
Alternatives and similar repositories for event-driven-rllab
Users that are interested in event-driven-rllab are comparing it to the libraries listed below
Sorting:
- Code for Discovered Policy Optimisation (NeurIPS 2022)☆12Jun 15, 2023Updated 2 years ago
- Generalised UDRL☆37May 12, 2022Updated 3 years ago
- A short conceptual replication of "Prefrontal cortex as a meta-reinforcement learning system" in Jax.☆18Feb 27, 2023Updated 3 years ago
- Official PyTorch Implementation for Metric Residual Networks for Sample Efficient Goal-Conditioned Reinforcement Learning☆19Jan 11, 2023Updated 3 years ago
- Learning Robust Dynamics Through Variational Sparse Gating☆20Oct 19, 2022Updated 3 years ago
- CFR implementation of a poker bot.☆12Feb 17, 2023Updated 3 years ago
- Reward shaping approach for instruction following settings, leveraging language at multiple levels of abstraction.☆21Mar 9, 2021Updated 4 years ago
- A collection of different PyTorch wrappers for training neural networks and reinforcement algorithms☆13Dec 15, 2022Updated 3 years ago
- Codebase for the paper "How Crucial is Transformer in Decision Transformer?". Containing experiments on different pendulum tasks and code…☆28Mar 24, 2023Updated 2 years ago
- Tutorial: Writing R and Python Packages with Multithreaded C++ Code using BLAS, AVX2/AVX512, OpenMP, C++11 Threads and Cuda GPU accelerat…☆13Nov 27, 2022Updated 3 years ago
- Code for ICLR 2022 paper Rethinking Goal-Conditioned Supervised Learning and Its Connection to Offline RL.☆28Feb 21, 2022Updated 4 years ago
- DECAF is a tool that measure the performance of cloud gaming platforms such as Google Stadia, Amazon Luna, NVIDIA GeForceNow.☆12Dec 17, 2021Updated 4 years ago
- PyTorch implementation of Episodic Meta Reinforcement Learning on variants of the "Two-Step" task. Reproduces the results found in three …☆37Dec 12, 2020Updated 5 years ago
- Some microbenchmarks and design docs before commencement☆12Feb 1, 2021Updated 5 years ago
- ☆37Mar 31, 2020Updated 5 years ago
- Skeleton for scalable and flexible Jax RL implementations☆96Jul 1, 2023Updated 2 years ago
- ☆10Sep 22, 2021Updated 4 years ago
- A Texas Holdem poker framework written in C++ 20.☆11Apr 23, 2023Updated 2 years ago
- 🚀全流程自己训练一个VLA 「大模型」1小时从0训练26M参数的视觉多模态VLM!🌏 Train a 26M-parameter VLM from scratch in just 1 hours!☆27Oct 16, 2025Updated 4 months ago
- A QA system based on k8s-specific knowledge build on ChatGLM2-6B, serving by Ray.☆10Sep 14, 2023Updated 2 years ago
- 🌿快速生成文件夹目录结构,支持定义目录层级,支持生成到 markdown 文件。☆13Oct 19, 2022Updated 3 years ago
- ☆13May 21, 2024Updated last year
- Decentralized Reinforcment Learning: Global Decision-Making via Local Economic Transactions (ICML 2020)☆43Dec 8, 2022Updated 3 years ago
- Pytorch implementation of LOLA (https://arxiv.org/abs/1709.04326) using DiCE (https://arxiv.org/abs/1802.05098)☆96Aug 21, 2018Updated 7 years ago
- An implementation of DreamerV2 written in JAX, with support for running multiple random seeds of an experiment on a single GPU.☆18Jan 16, 2023Updated 3 years ago
- JAX implementation of GPTQ quantization algorithm☆10Jul 19, 2023Updated 2 years ago
- Attentional Mechanism incorporated in Asynchronous Advantage Actor Critic a3c/a2c deep mind☆10Jan 9, 2018Updated 8 years ago
- 3rd placed submission to the NeurIPS MineRL competition 2019☆10Mar 24, 2023Updated 2 years ago
- MATLAB implementation of the universal directed information estimators in Jiantao Jiao, Haim H. Permuter, Lei Zhao, Young-Han Kim, and Ts…☆11Apr 2, 2019Updated 6 years ago
- Layered distributions using FLAX/JAX☆10Dec 13, 2020Updated 5 years ago
- ☆13Apr 11, 2022Updated 3 years ago
- A simple multicohort LTV calculator for subscriptions☆11Mar 7, 2023Updated 2 years ago
- Implicit Differentiable Optimal Control (IDOC) with JAX☆12May 11, 2022Updated 3 years ago
- Offline Policy Evaluation via Adaptive Weighting with Data from Contextual Bandits☆10Oct 21, 2024Updated last year
- Gym wrapper for Vizdoom environments☆12Dec 14, 2018Updated 7 years ago
- Gym wrapper for pysc2☆10Sep 16, 2022Updated 3 years ago
- Model-based time series clustering using variational inference.☆12Oct 28, 2018Updated 7 years ago
- Mis proyectos de marketing aplicando AI☆11Oct 31, 2025Updated 4 months ago
- Jax implementation of VIT-VQGAN☆10Jan 25, 2024Updated 2 years ago