Reinforcement Learning in Pacman
☆12May 5, 2018Updated 8 years ago
Alternatives and similar repositories for RL_Pacman
Users that are interested in RL_Pacman are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implement some reinforcement learning algorithms, test and visualize on Pacman.☆34Dec 3, 2018Updated 7 years ago
- Implementation of several multiagent trajectory generation algorithms☆12Jul 21, 2020Updated 5 years ago
- Polish stopwords collection☆16Mar 5, 2020Updated 6 years ago
- Uses gpt-2 to find all completions of a sentence over a certain probability threshold.☆13Mar 17, 2020Updated 6 years ago
- Hands-On Reinforcement Learning with TensorFlow & TRFL☆14Jan 18, 2021Updated 5 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Accompanies the paper "Learnability and Semantic Universals" ; trains recurrent neural networks to learn to verify sentences with quantif…☆11Aug 10, 2019Updated 6 years ago
- ☆11Aug 13, 2020Updated 5 years ago
- SPM Cluster Size Threshold estimation☆13Feb 6, 2023Updated 3 years ago
- By fine tuning GPT2 on News Aggregator data☆15Jan 24, 2021Updated 5 years ago
- Evaluation of Sentence Representations in Polish☆23Dec 29, 2022Updated 3 years ago
- A very simple python stemmer for Polish language based on Porter's Algorithm☆20Dec 4, 2017Updated 8 years ago
- Matlab scripts that extract single subject grey matter networks from grey matter segmented T1 weighted images☆17Jun 4, 2020Updated 6 years ago
- 开箱即用的海康机器人相机工具包☆13Aug 4, 2022Updated 3 years ago
- Dockerfile that is used for the JModelica regression testing of the Buildings library and of BuildingsPy☆16Nov 22, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆11Dec 26, 2022Updated 3 years ago
- NCSU CSC-326 Course Page☆12Dec 5, 2018Updated 7 years ago
- ☆20Sep 14, 2019Updated 6 years ago
- HUST-CS-2019 操作系统课程及其实验☆12Oct 26, 2022Updated 3 years ago
- The goal of this repository is to detect the outliers for a dataset & see the impact of these outliers on predictive models☆22Jun 1, 2018Updated 8 years ago
- Assignments for CS294-112.☆16Jul 13, 2018Updated 7 years ago
- Here, we compare Q(\sigma) learning presented by Sutton and Barto in [1] to Tree-Backup, n-step Expected Sarsa, and n-step Sarsa.☆15Feb 17, 2017Updated 9 years ago
- fedex-commercial-invoice☆21Apr 28, 2016Updated 10 years ago
- PlaneModel environment for RL☆25Sep 24, 2025Updated 8 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- AutoML Two-Sample Test☆19Aug 3, 2022Updated 3 years ago
- Robomaster Simulator☆12Nov 27, 2024Updated last year
- On the pitfalls of measuring emergent communication☆34Mar 12, 2019Updated 7 years ago
- Scientific Data Format☆23Jul 10, 2017Updated 8 years ago
- Pytorch implementation of "Succinct and Robust Multi-Agent Communication With Temporal Message Control"☆27Dec 6, 2020Updated 5 years ago
- Data set and source code used in "Emotion Recognition Using Smart Watch Sensor Data: Mixed-Design Study."☆30Jul 6, 2023Updated 2 years ago
- PyTorch implementation of "Recurrent Convolutional Neural Network for Text Classification"☆16Oct 20, 2020Updated 5 years ago
- Python port of Stempel, an algorithmic stemmer for Polish language.☆39Aug 29, 2024Updated last year
- Tensorflow implementation of Asynchronous Advantage Actor Critic (A3C) from "Asynchronous Methods for Deep Reinforcement Learning".☆24Apr 20, 2017Updated 9 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- An implementation of the A3C deep reinforcement learning method using a LSTM layer. Created with Tensorflow.☆29Oct 18, 2017Updated 8 years ago
- Official Repository for "Agent Modelling under Partial Observability for Deep Reinforcement Learning"☆43Oct 5, 2022Updated 3 years ago
- A list of all the freely available datasets of energy variables (electricity demand, wind/solar/hydro-power) reconstructions based on cli…☆32Feb 7, 2022Updated 4 years ago
- Implementation of the Option-Critic Architecture☆42Dec 9, 2018Updated 7 years ago
- This is a pip package implementing Reinforcement Learning algorithms in non-stationary environments supported by the OpenAI Gym toolkit.☆32Jun 5, 2019Updated 7 years ago
- Active Learning in R☆47May 21, 2017Updated 9 years ago
- A simple building energy model written in Python.☆31Feb 14, 2022Updated 4 years ago