2048 environment for Reinforcement Learning and DQN algorithm
☆40May 27, 2022Updated 3 years ago
Alternatives and similar repositories for 2048_env
Users that are interested in 2048_env are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Open AI gym environment for the game 2048☆76Updated this week
- [ICLR 2024] DMBP: Diffusion Model-Based Predictor for Robust Offline Reinforcement Learning against State Observations Perturbations.☆17May 24, 2024Updated last year
- Code for NeurIPS 2022 paper "Robust offline Reinforcement Learning via Conservative Smoothing"☆24Feb 15, 2023Updated 3 years ago
- Code for ICLR 2022 paper Rethinking Goal-Conditioned Supervised Learning and Its Connection to Offline RL.☆28Feb 21, 2022Updated 4 years ago
- testing MLP, DQN, PPO, SAC, policy-gradient by snakeAI☆11May 6, 2025Updated 10 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Code for PolyTask: Learning Unified Policies through Behavior Distillation☆11Oct 13, 2023Updated 2 years ago
- Source code for the IROS21 paper Efficient Task Planning for Mobile Manipulation: a Virtual Kinematic Chain Perspective☆11Aug 2, 2021Updated 4 years ago
- Privacy-preserving Voice Analysis via Disentangled Representations☆11Aug 30, 2021Updated 4 years ago
- Implement many Sparse Reward algorithms in Gym Fetch environment☆89Jul 9, 2020Updated 5 years ago
- Model-based Hindsight Experience Replay☆10Jun 8, 2022Updated 3 years ago
- 一个简单易用,稳定高效的及时通讯框架(支持端口多开,同时支持socket与websocket消息互通)A simple and easy to use, stable and efficient timely communication framework (support…☆13Sep 19, 2024Updated last year
- ☆29Oct 10, 2018Updated 7 years ago
- ☆25Nov 30, 2020Updated 5 years ago
- Policy Transfer across Visual and Dynamics Domain Gaps via Iterative Grounding (RSS 2021)☆12Oct 22, 2021Updated 4 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆15Jan 18, 2026Updated 2 months ago
- A simplistic implementation of DQN that works under CartPole-v0 with rendered pixels as input☆13Feb 28, 2019Updated 7 years ago
- We introduce a way to extend sparse dictionary learning to deep architectures.☆17Jan 13, 2022Updated 4 years ago
- Benchmark dataset for the paper "Towards Next-Generation Recommender Systems: A Benchmark for Personalized Recommendation Assistant with …☆23May 20, 2025Updated 10 months ago
- Tensorflow 2 source code for the PI-SAC agent from "Predictive Information Accelerates Learning in RL" (NeurIPS 2020)☆45Jun 8, 2023Updated 2 years ago
- This repo holds trending techniques for sensor fusion task using Transformers☆14Feb 21, 2023Updated 3 years ago
- Reinforcement Learning (PPO) applied to a multiplayer simple card game (Witches)☆10Jun 7, 2020Updated 5 years ago
- 使用WPF编写的BLE(低功耗蓝牙)应用☆16Jul 29, 2023Updated 2 years ago
- An easy to understand implementation of the paper "Model-Based Reinforcement Learning for Atari"☆17Sep 27, 2019Updated 6 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Refinforcement learning framework☆14Mar 25, 2023Updated 3 years ago
- OpenAI Gym 课程练习笔记☆15Apr 16, 2024Updated last year
- ☆10Nov 23, 2020Updated 5 years ago
- NNUE-like engine of Gomoku game☆13Aug 23, 2025Updated 7 months ago
- Some scripts to turn an OpenWrt router into a passive find3 scanner☆26Oct 11, 2020Updated 5 years ago
- A simple highway traffic simulation for self-driving car agents in occupancy grid world☆16May 28, 2019Updated 6 years ago
- Deep Learning 2021 in School of Data Science, USTC☆12May 17, 2023Updated 2 years ago
- ☆16Mar 4, 2026Updated 3 weeks ago
- A Dynamic Visual Benchmark for Evaluating Mathematical Reasoning Robustness of Vision Language Models☆28Nov 25, 2024Updated last year
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- CS234 Reinforcement Learning: Keras implementation of Recurrent Deterministic Policy Gradient (https://arxiv.org/abs/1512.04455)☆10Jun 10, 2017Updated 8 years ago
- Pathfinding Using Reinforcement Learning☆12May 21, 2019Updated 6 years ago
- Learning from Guided Play: A Scheduled Hierarchical Approach for Improving Exploration in Adversarial Imitation Learning Source Code☆17Aug 23, 2024Updated last year
- Templates and examples for ACL and EMNLP conference posters.☆14Oct 5, 2024Updated last year
- Implementation of WGAN-QC☆16Nov 25, 2019Updated 6 years ago
- 爬取各大OJ题目☆10Aug 28, 2017Updated 8 years ago
- Recurrent Network-based Deterministic Policy Gradient for Solving Bipedal Walking Challenge on Rugged Terrains☆12Oct 16, 2017Updated 8 years ago