A reinforcement learning algorithm for the 2048 game
☆20Mar 25, 2014Updated 11 years ago
Alternatives and similar repositories for reinforcement-2048
Users that are interested in reinforcement-2048 are comparing it to the libraries listed below
Sorting:
- [DEPRECATED] My collection of Deep Learning Resources☆12Jul 11, 2016Updated 9 years ago
- Relation Schema Induction using SICTF☆16Sep 20, 2018Updated 7 years ago
- Code for Towards Unifying Behavioral and Response Diversity for Open-ended Learning in Zero-sum Games☆24Feb 27, 2022Updated 4 years ago
- Results reproductions & comparisons between OpenSpiel implementations, associated paper & originating works☆18Mar 2, 2021Updated 5 years ago
- ☆22May 20, 2021Updated 4 years ago
- Reinforcement learning in 3D.☆21Mar 29, 2017Updated 8 years ago
- AI for the game Uno☆17Aug 6, 2019Updated 6 years ago
- Ranking Policy Gradient☆23Nov 27, 2019Updated 6 years ago
- ☆10Jul 15, 2020Updated 5 years ago
- Modified tensorflow implementation of 'Asynchronous Methods for Deep Reinforcement Learning'☆21Dec 15, 2016Updated 9 years ago
- Optim4RL is a Jax framework of learning to optimize for reinforcement learning.☆28Nov 27, 2024Updated last year
- Official implementation for: Consistency Models as a Rich and Efficient Policy Class for Reinforcement Learning ICLR'24☆26Aug 28, 2024Updated last year
- ☆33Nov 21, 2022Updated 3 years ago
- ☆13Dec 13, 2024Updated last year
- Implementation of PCA algorithm using Gram-Scmidt modification on NIPALS☆10Jun 13, 2015Updated 10 years ago
- 基于Dijkstra算法的武汉地铁路径规划☆10Jul 1, 2022Updated 3 years ago
- Non stationary bandit for experiments with Reinforcement Learning☆33Mar 24, 2017Updated 8 years ago
- A comparison of Google SlateQ algorithm with traditional Reinforcement Learning algorithms☆39Dec 27, 2022Updated 3 years ago
- Factoried Personalized Markov Chains for Next Basket Recommendation in R and Python☆13Jan 7, 2018Updated 8 years ago
- ☆10Nov 15, 2023Updated 2 years ago
- ☆10Jul 8, 2021Updated 4 years ago
- Uncovering User Interest from Biased and Noised Watch Time in Video Recommendation. In Recsys23.☆11Jul 18, 2023Updated 2 years ago
- ☆10Oct 11, 2022Updated 3 years ago
- Book: Practical Probabilistic Machine Learning in Python☆10Apr 3, 2021Updated 4 years ago
- Run large scale tensor and coupled matrix-tensor factorization on top of stock Hadoop.☆18Dec 28, 2017Updated 8 years ago
- Stochastic Markov Games☆12Oct 5, 2017Updated 8 years ago
- This is the implementation of paper Model Free Episodic Control☆36Sep 30, 2019Updated 6 years ago
- ☆21Dec 18, 2013Updated 12 years ago
- 一个开源数学大模型项目,旨在探索大模型是否具有数学创造能力,以及大模型在前沿数学研究中的潜在能力。☆17May 16, 2025Updated 9 months ago
- Maddpg_flight code☆11Jul 4, 2018Updated 7 years ago
- Model-based clustering package for mixed data☆13Jun 16, 2025Updated 8 months ago
- Query Expansion using word2vec☆11Jul 18, 2019Updated 6 years ago
- a very fast parser for sparse matrix at libsvm format☆10Nov 13, 2017Updated 8 years ago
- Robust Reinforcement Learning Benchmark☆12Sep 22, 2024Updated last year
- ☆11Dec 20, 2023Updated 2 years ago
- ☆10Aug 10, 2017Updated 8 years ago
- Materials for the "Recommender Systems through the lens of Decision Theory" tutorial delivered at the 30th Web Conference (WWW '21).☆11Apr 13, 2021Updated 4 years ago
- Cis Recommender☆16May 1, 2012Updated 13 years ago
- Variational Discriminator Bottleneck: Improving Imitation Learning, Inverse RL, and GANs by Constraining Information Flow - Tensorlfow Im…☆13Feb 2, 2019Updated 7 years ago