Reinforcement learning agents and environment for Easy21, a modified version of Blackjack
☆14May 7, 2017Updated 9 years ago
Alternatives and similar repositories for easy21
Users that are interested in easy21 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- solutions to David Silver's RL course project Easy21☆19Jun 28, 2016Updated 10 years ago
- a repository to build selenium solution continously☆11Jun 13, 2017Updated 9 years ago
- BFAST3D: Bayesian Fast Accurate Spatial Tricks in 3D. For fMRI analysis.☆11Sep 30, 2020Updated 5 years ago
- How to use tensorboard in fastai☆21Jul 10, 2019Updated 6 years ago
- Minimal web frontend for esniper, a lightweight eBay sniping tool☆10Dec 16, 2018Updated 7 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- RDMA-Based RPC☆15Sep 1, 2023Updated 2 years ago
- personalized collection of books☆15Jan 24, 2021Updated 5 years ago
- ☆25Aug 1, 2016Updated 9 years ago
- CoRM: Compactable Remote Memory over RDMA☆20Jun 18, 2021Updated 5 years ago
- https://github.com/mitsuba-renderer/mitsuba2 in docker☆10Jun 13, 2020Updated 6 years ago
- 🤖 Implementation of Self Normalizing Networks (SNN) in PyTorch.☆13Jun 19, 2017Updated 9 years ago
- FNV hash collision generator☆12Mar 2, 2017Updated 9 years ago
- POPGym Library in JAX☆14Apr 15, 2024Updated 2 years ago
- Miscellaneous code for doing NLP with Theano☆13May 16, 2020Updated 6 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A proxy for reverse engineering a communication protocol☆10Jan 17, 2021Updated 5 years ago
- Exploring the use of options in creating small worlds for faster learning in RL Domains☆16Jan 23, 2012Updated 14 years ago
- ☆14May 27, 2009Updated 17 years ago
- Implementation of Receding Horizon Curiosity Algrithm☆13Mar 24, 2023Updated 3 years ago
- Reinforcement Learning Assignment: Easy21☆12Jul 4, 2016Updated 9 years ago
- Stochastic Machines for Unsupervised Learning implemented in Pytorch.☆10Sep 3, 2017Updated 8 years ago
- Dockerfiles for the reveal.js presentation framework☆13Sep 30, 2019Updated 6 years ago
- Jax implementation of "Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models"☆15May 10, 2024Updated 2 years ago
- Baremetal Backtracing on RISC-V☆16Jun 22, 2021Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Scaling Up Memory Disaggregated Applications with SMART☆34Apr 23, 2024Updated 2 years ago
- A docker container to run your docker-reveal.js based slideshow without headhache.☆13Nov 10, 2016Updated 9 years ago
- This is a TensorFlow implementation of DeepMind's A Distributional Perspective on Reinforcement Learning.(C51-DDPG)☆11Sep 14, 2017Updated 8 years ago
- Variational Reinforcement Learning☆17Jul 25, 2024Updated last year
- Avoiding catastrophic failures in reinforcement learning by learning to shape rewards.☆10Nov 13, 2017Updated 8 years ago
- Companion code to the book Blender Add-on Cookbook☆19Mar 18, 2017Updated 9 years ago
- UnrealCV for image rendering from 3D model☆14May 21, 2020Updated 6 years ago
- This is the dataset generation code for ADEPT (Approximate Derenderer, Extended Physics, and Tracking). http://physadept.csail.mit.edu/☆15Sep 26, 2022Updated 3 years ago
- Python wrappers for GTSAM 3☆13May 22, 2017Updated 9 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A framework for local feature evaluation for Python and MATLAB.☆13Jul 6, 2023Updated 2 years ago
- Hermit: Low-Latency, High-Throughput, and Transparent Remote Memory via Feedback-Directed Asynchrony☆35May 29, 2024Updated 2 years ago
- Code for "Bridging the Gap between f-GANs and Wasserstein GANs", ICML 2020☆14Jul 18, 2020Updated 5 years ago
- Blog post☆17Feb 16, 2024Updated 2 years ago
- MongoDB Browser for results created with the Sacred framework☆16May 22, 2019Updated 7 years ago
- Separating value functions across time-scales.☆18May 13, 2019Updated 7 years ago
- Memory-Based Meta-Learning on Non-Stationary Distributions☆18Mar 14, 2024Updated 2 years ago