2048 environment for Reinforcement Learning and DQN algorithm
☆40May 27, 2022Updated 3 years ago
Alternatives and similar repositories for 2048_env
Users that are interested in 2048_env are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Modular-HER is revised from OpenAI baselines and supports many improvements for Hindsight Experience Replay as modules.☆17Jun 23, 2021Updated 4 years ago
- Code for ICLR 2022 paper Rethinking Goal-Conditioned Supervised Learning and Its Connection to Offline RL.☆28Feb 21, 2022Updated 4 years ago
- testing MLP, DQN, PPO, SAC, policy-gradient by snakeAI☆11May 6, 2025Updated 11 months ago
- Robust Reinforcement Learning Benchmark☆12Sep 22, 2024Updated last year
- Implement many Sparse Reward algorithms in Gym Fetch environment☆89Jul 9, 2020Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Model-based Hindsight Experience Replay☆10Jun 8, 2022Updated 3 years ago
- This repository is the accompanying code for the paper CFVFP. This paper presents a new algorithm for solving incomplete information game…☆14Feb 23, 2025Updated last year
- ☆29Oct 10, 2018Updated 7 years ago
- ☆25Nov 30, 2020Updated 5 years ago
- This is the repository for paper EscapeBench: Pushing Language Models to Think Outside the Box☆18Dec 19, 2024Updated last year
- Code for https://arxiv.org/abs/1811.00145☆12Feb 13, 2021Updated 5 years ago
- Policy Transfer across Visual and Dynamics Domain Gaps via Iterative Grounding (RSS 2021)☆12Oct 22, 2021Updated 4 years ago
- Code for "Minimizing Weighted Counterfactual Regret with Optimistic Online Mirror Descent", IJCAI 2024 (Oral)☆16Aug 27, 2024Updated last year
- ☆16Apr 14, 2026Updated 2 weeks ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Repository for running LLMs efficiently on Mac silicon (M1, M2, M3). Features Jupyter notebook for Meta-Llama-3 setup using MLX framework…☆11May 4, 2024Updated last year
- Benchmark dataset for the paper "Towards Next-Generation Recommender Systems: A Benchmark for Personalized Recommendation Assistant with …☆26May 20, 2025Updated 11 months ago
- Tensorflow 2 source code for the PI-SAC agent from "Predictive Information Accelerates Learning in RL" (NeurIPS 2020)☆45Jun 8, 2023Updated 2 years ago
- An easy to understand implementation of the paper "Model-Based Reinforcement Learning for Atari"☆18Sep 27, 2019Updated 6 years ago
- This is a Repository for the infinite horizon controller and the preview path tracking controller for Carla-Vehicle assets.☆13Jul 2, 2019Updated 6 years ago
- ☆18Jan 3, 2022Updated 4 years ago
- Refinforcement learning framework☆14Mar 25, 2023Updated 3 years ago
- Language independent SSL-based Speaker Anonymization system☆20May 28, 2024Updated last year
- ☆10Nov 23, 2020Updated 5 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Implementation for paper SideWindowFilter☆10Nov 28, 2019Updated 6 years ago
- KiCAD plugin written in Python for programatically placing clusters of components onto a PCB from a layout file.☆10Jun 30, 2021Updated 4 years ago
- Some scripts to turn an OpenWrt router into a passive find3 scanner☆26Oct 11, 2020Updated 5 years ago
- [WWW '24] UnifiedSSR: A Unified Framework of Sequential Search and Recommendation☆12Feb 16, 2024Updated 2 years ago
- A simple highway traffic simulation for self-driving car agents in occupancy grid world☆16May 28, 2019Updated 6 years ago
- OpenAI Gym 课程练习笔记☆15Apr 16, 2024Updated 2 years ago
- [ICLR 2024 Spotlight] Code for ICLR 2024 paper "Towards Robust Offline Reinforcement Learning under Diverse Data Corruption"☆22Nov 25, 2024Updated last year
- CS234 Reinforcement Learning: Keras implementation of Recurrent Deterministic Policy Gradient (https://arxiv.org/abs/1512.04455)☆10Jun 10, 2017Updated 8 years ago
- Reinforcement learning - Batched Impala - PyTorch - Mario Kart☆13Jul 21, 2020Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A Dynamic Visual Benchmark for Evaluating Mathematical Reasoning Robustness of Vision Language Models☆30Nov 25, 2024Updated last year
- Fixed version of tg-cli with support of channels and groups.☆13Jul 7, 2017Updated 8 years ago
- collecting publicly available distillation datasets based on DepSeek-R1☆27Mar 12, 2025Updated last year
- Pathfinding Using Reinforcement Learning☆12May 21, 2019Updated 6 years ago
- Learning from Guided Play: A Scheduled Hierarchical Approach for Improving Exploration in Adversarial Imitation Learning Source Code☆17Aug 23, 2024Updated last year
- Templates and examples for ACL and EMNLP conference posters.☆14Oct 5, 2024Updated last year
- Implementation of WGAN-QC☆16Nov 25, 2019Updated 6 years ago