2048 environment for Reinforcement Learning and DQN algorithm
☆40May 27, 2022Updated 4 years ago
Alternatives and similar repositories for 2048_env
Users that are interested in 2048_env are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICLR 2024] DMBP: Diffusion Model-Based Predictor for Robust Offline Reinforcement Learning against State Observations Perturbations.☆17May 24, 2024Updated 2 years ago
- Code for NeurIPS 2022 paper "Robust offline Reinforcement Learning via Conservative Smoothing"☆24Feb 15, 2023Updated 3 years ago
- Modular-HER is revised from OpenAI baselines and supports many improvements for Hindsight Experience Replay as modules.☆17Jun 23, 2021Updated 5 years ago
- Project for HIDING SPEAKER’S SEX IN SPEECH USING ZERO-EVIDENCE SPEAKER REPRESENTATION IN AN ANALYSIS/SYNTHESIS PIPELINE☆15Nov 30, 2022Updated 3 years ago
- Code for ICLR 2022 paper Rethinking Goal-Conditioned Supervised Learning and Its Connection to Offline RL.☆27Feb 21, 2022Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Source code for the IROS21 paper Efficient Task Planning for Mobile Manipulation: a Virtual Kinematic Chain Perspective☆11Aug 2, 2021Updated 4 years ago
- Privacy-preserving Voice Analysis via Disentangled Representations☆12Aug 30, 2021Updated 4 years ago
- ☆24Sep 1, 2025Updated 10 months ago
- 一个简单易用,稳定高效的及时通讯框架(支持端口多开,同时支持socket与websocket消息互通)A simple and easy to use, stable and efficient timely communication framework (support…☆13Sep 19, 2024Updated last year
- code of IJCAI submission "Soft Hindsight Experience Replay"☆13Mar 23, 2020Updated 6 years ago
- Convolutional Neural Networks☆13Feb 7, 2021Updated 5 years ago
- ☆29Oct 10, 2018Updated 7 years ago
- This is the repository for paper EscapeBench: Pushing Language Models to Think Outside the Box☆18Dec 19, 2024Updated last year
- source code for AAMAS 2023 Imperfect-information Card Game Competition☆13Mar 21, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Help to connect to the network in CPU, and othe schools in similar situation can modify the code to get it work☆12May 21, 2026Updated last month
- Code for https://arxiv.org/abs/1811.00145☆12Feb 13, 2021Updated 5 years ago
- Policy Transfer across Visual and Dynamics Domain Gaps via Iterative Grounding (RSS 2021)☆12Oct 22, 2021Updated 4 years ago
- (中国鱼) Chinese Chess with NNUE fork from stockfish and pikafish☆15Nov 26, 2022Updated 3 years ago
- ☆16Apr 14, 2026Updated 2 months ago
- A simplistic implementation of DQN that works under CartPole-v0 with rendered pixels as input☆13Feb 28, 2019Updated 7 years ago
- Repository for running LLMs efficiently on Mac silicon (M1, M2, M3). Features Jupyter notebook for Meta-Llama-3 setup using MLX framework…☆11May 4, 2024Updated 2 years ago
- Tensorflow 2 source code for the PI-SAC agent from "Predictive Information Accelerates Learning in RL" (NeurIPS 2020)☆45Jun 8, 2023Updated 3 years ago
- distributed log tail viewer☆18Dec 3, 2015Updated 10 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Reinforcement Learning (PPO) applied to a multiplayer simple card game (Witches)☆10Jun 7, 2020Updated 6 years ago
- An easy to understand implementation of the paper "Model-Based Reinforcement Learning for Atari"☆18Sep 27, 2019Updated 6 years ago
- This is a Repository for the infinite horizon controller and the preview path tracking controller for Carla-Vehicle assets.☆13Jul 2, 2019Updated 7 years ago
- Refinforcement learning framework☆15Mar 25, 2023Updated 3 years ago
- ☆10Nov 23, 2020Updated 5 years ago
- Implementation for paper SideWindowFilter☆10Nov 28, 2019Updated 6 years ago
- Official implementation of paper "CoIRL-AD: Collaborative and Competitive Imitation–Reinforcement Learning for Autonomous Driving"☆40Mar 30, 2026Updated 3 months ago
- ROS package for the Robotiq 85 Gripper using RS485 communication☆18Feb 22, 2016Updated 10 years ago
- A simple highway traffic simulation for self-driving car agents in occupancy grid world☆16May 28, 2019Updated 7 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆18Updated this week
- Fixed version of tg-cli with support of channels and groups.☆13Jul 7, 2017Updated 8 years ago
- Pathfinding Using Reinforcement Learning☆12May 21, 2019Updated 7 years ago
- Learning from Guided Play: A Scheduled Hierarchical Approach for Improving Exploration in Adversarial Imitation Learning Source Code☆17Aug 23, 2024Updated last year
- Recurrent Network-based Deterministic Policy Gradient for Solving Bipedal Walking Challenge on Rugged Terrains☆12Oct 16, 2017Updated 8 years ago
- A PROJECT FOR IRONY Usage Lagrange.Core + NapCat.Framework☆12Dec 12, 2024Updated last year
- High TPS Solana client powered by Rakurai.☆16Sep 27, 2024Updated last year