2048 environment for Reinforcement Learning and DQN algorithm
☆40May 27, 2022Updated 4 years ago
Alternatives and similar repositories for 2048_env
Users that are interested in 2048_env are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for ICLR 2022 paper Rethinking Goal-Conditioned Supervised Learning and Its Connection to Offline RL.☆27Feb 21, 2022Updated 4 years ago
- testing MLP, DQN, PPO, SAC, policy-gradient by snakeAI☆11May 6, 2025Updated last year
- Source code for the IROS21 paper Efficient Task Planning for Mobile Manipulation: a Virtual Kinematic Chain Perspective☆11Aug 2, 2021Updated 4 years ago
- Privacy-preserving Voice Analysis via Disentangled Representations☆12Aug 30, 2021Updated 4 years ago
- Robust Reinforcement Learning Benchmark☆12Sep 22, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- SVIP: Towards Verifiable Inference of Open-Source Large Language Models☆15Jun 3, 2025Updated last year
- code of IJCAI submission "Soft Hindsight Experience Replay"☆13Mar 23, 2020Updated 6 years ago
- Convolutional Neural Networks☆13Feb 7, 2021Updated 5 years ago
- ☆19Jan 2, 2024Updated 2 years ago
- ☆29Oct 10, 2018Updated 7 years ago
- ☆25Nov 30, 2020Updated 5 years ago
- source code for AAMAS 2023 Imperfect-information Card Game Competition☆13Mar 21, 2024Updated 2 years ago
- Code for https://arxiv.org/abs/1811.00145☆12Feb 13, 2021Updated 5 years ago
- Implementation of DyMA-CL, MARL algorithm☆30Apr 18, 2020Updated 6 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Policy Transfer across Visual and Dynamics Domain Gaps via Iterative Grounding (RSS 2021)☆12Oct 22, 2021Updated 4 years ago
- A simplistic implementation of DQN that works under CartPole-v0 with rendered pixels as input☆13Feb 28, 2019Updated 7 years ago
- We introduce a way to extend sparse dictionary learning to deep architectures.☆17Jan 13, 2022Updated 4 years ago
- Repository for running LLMs efficiently on Mac silicon (M1, M2, M3). Features Jupyter notebook for Meta-Llama-3 setup using MLX framework…☆11May 4, 2024Updated 2 years ago
- Benchmark dataset for the paper "Towards Next-Generation Recommender Systems: A Benchmark for Personalized Recommendation Assistant with …☆27May 20, 2025Updated last year
- Tensorflow 2 source code for the PI-SAC agent from "Predictive Information Accelerates Learning in RL" (NeurIPS 2020)☆45Jun 8, 2023Updated 3 years ago
- 使用WPF编写的BLE(低功耗蓝牙)应用☆16Jul 29, 2023Updated 2 years ago
- An easy to understand implementation of the paper "Model-Based Reinforcement Learning for Atari"☆18Sep 27, 2019Updated 6 years ago
- This is a Repository for the infinite horizon controller and the preview path tracking controller for Carla-Vehicle assets.☆13Jul 2, 2019Updated 6 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆18Jan 3, 2022Updated 4 years ago
- Language independent SSL-based Speaker Anonymization system☆20May 28, 2024Updated 2 years ago
- ☆10Nov 23, 2020Updated 5 years ago
- this is for the ACM MM paper---Backdoor Attack on Crowd Counting☆17Jul 10, 2022Updated 3 years ago
- Implementation for paper SideWindowFilter☆10Nov 28, 2019Updated 6 years ago
- KiCAD plugin written in Python for programatically placing clusters of components onto a PCB from a layout file.☆10Jun 30, 2021Updated 4 years ago
- [WWW '24] UnifiedSSR: A Unified Framework of Sequential Search and Recommendation☆12Feb 16, 2024Updated 2 years ago
- Official implementation of paper "CoIRL-AD: Collaborative and Competitive Imitation–Reinforcement Learning for Autonomous Driving"☆40Mar 30, 2026Updated 2 months ago
- A simple highway traffic simulation for self-driving car agents in occupancy grid world☆16May 28, 2019Updated 7 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- OpenAI Gym 课程练习笔记☆15Apr 16, 2024Updated 2 years ago
- Deep Learning 2021 in School of Data Science, USTC☆12May 17, 2023Updated 3 years ago
- [ICLR 2024 Spotlight] Code for ICLR 2024 paper "Towards Robust Offline Reinforcement Learning under Diverse Data Corruption"☆22Nov 25, 2024Updated last year
- CS234 Reinforcement Learning: Keras implementation of Recurrent Deterministic Policy Gradient (https://arxiv.org/abs/1512.04455)☆10Jun 10, 2017Updated 9 years ago
- Reinforcement learning - Batched Impala - PyTorch - Mario Kart☆13Jul 21, 2020Updated 5 years ago
- Pathfinding Using Reinforcement Learning☆12May 21, 2019Updated 7 years ago
- Templates and examples for ACL and EMNLP conference posters.☆14Oct 5, 2024Updated last year