Underflow/reinforcement-2048

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Underflow/reinforcement-2048)

Underflow / reinforcement-2048

A reinforcement learning algorithm for the 2048 game

☆20

Alternatives and similar repositories for reinforcement-2048

Users that are interested in reinforcement-2048 are comparing it to the libraries listed below

Sorting:

krrish94 / DeepLearningResources
View on GitHub
[DEPRECATED] My collection of Deep Learning Resources
☆12Jul 11, 2016Updated 9 years ago
malllabiisc / sictf
View on GitHub
Relation Schema Induction using SICTF
☆16Sep 20, 2018Updated 7 years ago
sjtu-marl / bd_rd_psro
View on GitHub
Code for Towards Unifying Behavioral and Response Diversity for Open-ended Learning in Zero-sum Games
☆24Feb 27, 2022Updated 4 years ago
aicenter / openspiel_reproductions
View on GitHub
Results reproductions & comparisons between OpenSpiel implementations, associated paper & originating works
☆18Mar 2, 2021Updated 5 years ago
diversepsro / diverse_psro
View on GitHub
☆22May 20, 2021Updated 4 years ago
avdmitry / rl_3d
View on GitHub
Reinforcement learning in 3D.
☆21Mar 29, 2017Updated 8 years ago
unixpickle / uno-ai
View on GitHub
AI for the game Uno
☆17Aug 6, 2019Updated 6 years ago
illidanlab / rpg
View on GitHub
Ranking Policy Gradient
☆23Nov 27, 2019Updated 6 years ago
rohinarora / Neural-Networks-Pruning
View on GitHub
☆10Jul 15, 2020Updated 5 years ago
gliese581gg / batch-A3C_tensorflow
View on GitHub
Modified tensorflow implementation of 'Asynchronous Methods for Deep Reinforcement Learning'
☆21Dec 15, 2016Updated 9 years ago
sail-sg / optim4rl
View on GitHub
Optim4RL is a Jax framework of learning to optimize for reinforcement learning.
☆28Nov 27, 2024Updated last year
quantumiracle / Consistency_Model_For_Reinforcement_Learning
View on GitHub
Official implementation for: Consistency Models as a Rich and Efficient Policy Class for Reinforcement Learning ICLR'24
☆26Aug 28, 2024Updated last year
wangyuhuix / TRGPPO
View on GitHub
☆33Nov 21, 2022Updated 3 years ago
baekrok / DASH-Direction-Aware-SHrinking
View on GitHub
☆13Dec 13, 2024Updated last year
ParallelDots / Gram-Schmidt-PCA
View on GitHub
Implementation of PCA algorithm using Gram-Scmidt modification on NIPALS
☆10Jun 13, 2015Updated 10 years ago
zhouyuanmin / dijkstra_wuhan
View on GitHub
基于Dijkstra算法的武汉地铁路径规划
☆10Jul 1, 2022Updated 3 years ago
dquail / NonStationaryBandit
View on GitHub
Non stationary bandit for experiments with Reinforcement Learning
☆33Mar 24, 2017Updated 8 years ago
collinprather / SlateQ
View on GitHub
A comparison of Google SlateQ algorithm with traditional Reinforcement Learning algorithms
☆39Dec 27, 2022Updated 3 years ago
stathwang / FPMC
View on GitHub
Factoried Personalized Markov Chains for Next Basket Recommendation in R and Python
☆13Jan 7, 2018Updated 8 years ago
ml-feedback-sys / materials-f23
View on GitHub
☆10Nov 15, 2023Updated 2 years ago
hnjia00 / Delayed-Feedback
View on GitHub
☆10Jul 8, 2021Updated 4 years ago
hyz20 / D2Co
View on GitHub
Uncovering User Interest from Biased and Noised Watch Time in Video Recommendation. In Recsys23.
☆11Jul 18, 2023Updated 2 years ago
flowersteam / EAGER
View on GitHub
☆10Oct 11, 2022Updated 3 years ago
pymc-learn / pymc-learn-book
View on GitHub
Book: Practical Probabilistic Machine Learning in Python
☆10Apr 3, 2021Updated 4 years ago
alexbeutel / FlexiFaCT
View on GitHub
Run large scale tensor and coupled matrix-tensor factorization on top of stock Hadoop.
☆18Dec 28, 2017Updated 8 years ago
aijunbai / markov-game
View on GitHub
Stochastic Markov Games
☆12Oct 5, 2017Updated 8 years ago
ShibiHe / Model-Free-Episodic-Control
View on GitHub
This is the implementation of paper Model Free Episodic Control
☆36Sep 30, 2019Updated 6 years ago
narseo / ril_analyzer
View on GitHub
☆21Dec 18, 2013Updated 12 years ago
DeepMathLLM / DeepMath
View on GitHub
一个开源数学大模型项目，旨在探索大模型是否具有数学创造能力，以及大模型在前沿数学研究中的潜在能力。
☆17May 16, 2025Updated 9 months ago
TerryYiDa / Flight_maddpg
View on GitHub
Maddpg_flight code
☆11Jul 4, 2018Updated 7 years ago
modal-inria / MixtComp
View on GitHub
Model-based clustering package for mixed data
☆13Jun 16, 2025Updated 8 months ago
dwaipayanroy / QE_With_W2V
View on GitHub
Query Expansion using word2vec
☆11Jul 18, 2019Updated 6 years ago
yoch / svmloader
View on GitHub
a very fast parser for sparse matrix at libsvm format
☆10Nov 13, 2017Updated 8 years ago
SafeRL-Lab / Robust-RL-Baselines
View on GitHub
Robust Reinforcement Learning Benchmark
☆12Sep 22, 2024Updated last year
svrijenhoek / RADio
View on GitHub
☆11Dec 20, 2023Updated 2 years ago
CBVRP-ICIP-2017 / CBVRP-ICIP2017
View on GitHub
☆10Aug 10, 2017Updated 8 years ago
olivierjeunen / decision-theory-www-2021
View on GitHub
Materials for the "Recommender Systems through the lens of Decision Theory" tutorial delivered at the 30th Web Conference (WWW '21).
☆11Apr 13, 2021Updated 4 years ago
zhangsi / CisRec
View on GitHub
Cis Recommender
☆16May 1, 2012Updated 13 years ago
shamanez / Variational-Discriminator-Bottleneck-Tensorflow-Implementation
View on GitHub
Variational Discriminator Bottleneck: Improving Imitation Learning, Inverse RL, and GANs by Constraining Information Flow - Tensorlfow Im…
☆13Feb 2, 2019Updated 7 years ago