Reinforcement Learning in Pacman
☆12May 5, 2018Updated 8 years ago
Alternatives and similar repositories for RL_Pacman
Users that are interested in RL_Pacman are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A Monte Carlo Tree Search Agent used to control agents in a Pacman competition.☆16Jan 30, 2015Updated 11 years ago
- Implementation of several multiagent trajectory generation algorithms☆12Jul 21, 2020Updated 5 years ago
- Polish stopwords collection☆15Mar 5, 2020Updated 6 years ago
- Uses gpt-2 to find all completions of a sentence over a certain probability threshold.☆13Mar 17, 2020Updated 6 years ago
- Hands-On Reinforcement Learning with TensorFlow & TRFL☆14Jan 18, 2021Updated 5 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Accompanies the paper "Learnability and Semantic Universals" ; trains recurrent neural networks to learn to verify sentences with quantif…☆11Aug 10, 2019Updated 6 years ago
- ☆11Aug 13, 2020Updated 5 years ago
- By fine tuning GPT2 on News Aggregator data☆15Jan 24, 2021Updated 5 years ago
- A bottom-up model for the simulation of heat demand profiles of urban areas☆13Dec 11, 2023Updated 2 years ago
- A very simple python stemmer for Polish language based on Porter's Algorithm☆20Dec 4, 2017Updated 8 years ago
- Matlab scripts that extract single subject grey matter networks from grey matter segmented T1 weighted images☆17Jun 4, 2020Updated 5 years ago
- Dockerfile that is used for the JModelica regression testing of the Buildings library and of BuildingsPy☆16Nov 22, 2023Updated 2 years ago
- NCSU CSC-326 Course Page☆12Dec 5, 2018Updated 7 years ago
- Code associated with the NeurIPS19 paper "Weighted Linear Bandits in Non-Stationary Environments"☆17Nov 14, 2019Updated 6 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆20Sep 14, 2019Updated 6 years ago
- Reinforcement Learning approaches for learning communication in Multi Agent Systems.☆18Jan 27, 2019Updated 7 years ago
- Code for the paper "Collaborative Graph Walk for Semi-supervised Multi-Label Node Classification" - ICDM 2019☆13Mar 25, 2023Updated 3 years ago
- Here, we compare Q(\sigma) learning presented by Sutton and Barto in [1] to Tree-Backup, n-step Expected Sarsa, and n-step Sarsa.☆15Feb 17, 2017Updated 9 years ago
- Calculate phase-amplitude coupling in Python (and Matlab).☆27Sep 5, 2017Updated 8 years ago
- On the pitfalls of measuring emergent communication☆34Mar 12, 2019Updated 7 years ago
- Pytorch implementation of "Succinct and Robust Multi-Agent Communication With Temporal Message Control"☆27Dec 6, 2020Updated 5 years ago
- Data set and source code used in "Emotion Recognition Using Smart Watch Sensor Data: Mixed-Design Study."☆30Jul 6, 2023Updated 2 years ago
- Python port of Stempel, an algorithmic stemmer for Polish language.☆39Aug 29, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Tensorflow implementation of Asynchronous Advantage Actor Critic (A3C) from "Asynchronous Methods for Deep Reinforcement Learning".☆25Apr 20, 2017Updated 9 years ago
- An implementation of the A3C deep reinforcement learning method using a LSTM layer. Created with Tensorflow.☆29Oct 18, 2017Updated 8 years ago
- A list of all the freely available datasets of energy variables (electricity demand, wind/solar/hydro-power) reconstructions based on cli…☆31Feb 7, 2022Updated 4 years ago
- Implementation of the Option-Critic Architecture☆42Dec 9, 2018Updated 7 years ago
- This is a pip package implementing Reinforcement Learning algorithms in non-stationary environments supported by the OpenAI Gym toolkit.☆32Jun 5, 2019Updated 6 years ago
- Active Learning in R☆47May 21, 2017Updated 8 years ago
- A simple building energy model written in Python.☆31Feb 14, 2022Updated 4 years ago
- A Python Implementation of the N4SID algorithm☆31Jun 28, 2020Updated 5 years ago
- Non stationary bandit for experiments with Reinforcement Learning☆33Mar 24, 2017Updated 9 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Solving openAI's game 'BipedalWalker-v2' with Deep Reinforcement Learning☆27May 26, 2020Updated 5 years ago
- A repository of Psychrometric calculation and plotting tools☆42Oct 22, 2024Updated last year
- Implementation of proximal policy optimization(PPO) with tensorflow☆35Feb 10, 2018Updated 8 years ago
- A3C-LSTM algorithm tested on CartPole OpenAI Gym environment☆48Jul 4, 2018Updated 7 years ago
- Continual Reinforcement Learning in 3D Non-stationary Environments☆39Jun 16, 2019Updated 6 years ago
- A Tensorflow implementation of the Option-Critic Architecture☆74Jun 1, 2017Updated 8 years ago
- A collection of oemof examples and notebooks.☆49Jul 3, 2022Updated 3 years ago