Underflow / reinforcement-2048View external linksLinks
A reinforcement learning algorithm for the 2048 game
☆20Mar 25, 2014Updated 11 years ago
Alternatives and similar repositories for reinforcement-2048
Users that are interested in reinforcement-2048 are comparing it to the libraries listed below
Sorting:
- [DEPRECATED] My collection of Deep Learning Resources☆12Jul 11, 2016Updated 9 years ago
- Relation Schema Induction using SICTF☆16Sep 20, 2018Updated 7 years ago
- ☆22May 20, 2021Updated 4 years ago
- Code for Towards Unifying Behavioral and Response Diversity for Open-ended Learning in Zero-sum Games☆24Feb 27, 2022Updated 3 years ago
- Results reproductions & comparisons between OpenSpiel implementations, associated paper & originating works☆18Mar 2, 2021Updated 4 years ago
- AI for the game Uno☆17Aug 6, 2019Updated 6 years ago
- Ranking Policy Gradient☆23Nov 27, 2019Updated 6 years ago
- ☆10Jul 15, 2020Updated 5 years ago
- Modified tensorflow implementation of 'Asynchronous Methods for Deep Reinforcement Learning'☆21Dec 15, 2016Updated 9 years ago
- Optim4RL is a Jax framework of learning to optimize for reinforcement learning.☆28Nov 27, 2024Updated last year
- Official implementation for: Consistency Models as a Rich and Efficient Policy Class for Reinforcement Learning ICLR'24☆26Aug 28, 2024Updated last year
- ☆33Nov 21, 2022Updated 3 years ago
- ☆13Dec 13, 2024Updated last year
- Implementation of PCA algorithm using Gram-Scmidt modification on NIPALS☆10Jun 13, 2015Updated 10 years ago
- 基于Dijkstra算法的武汉地铁路径规划☆10Jul 1, 2022Updated 3 years ago
- ☆10Nov 15, 2023Updated 2 years ago
- Stochastic Markov Games☆12Oct 5, 2017Updated 8 years ago
- A comparison of Google SlateQ algorithm with traditional Reinforcement Learning algorithms☆39Dec 27, 2022Updated 3 years ago
- Book: Practical Probabilistic Machine Learning in Python☆10Apr 3, 2021Updated 4 years ago
- Uncovering User Interest from Biased and Noised Watch Time in Video Recommendation. In Recsys23.☆11Jul 18, 2023Updated 2 years ago
- ☆10Oct 11, 2022Updated 3 years ago
- Factoried Personalized Markov Chains for Next Basket Recommendation in R and Python☆13Jan 7, 2018Updated 8 years ago
- ☆10Jul 8, 2021Updated 4 years ago
- Run large scale tensor and coupled matrix-tensor factorization on top of stock Hadoop.☆18Dec 28, 2017Updated 8 years ago
- This is the implementation of paper Model Free Episodic Control☆36Sep 30, 2019Updated 6 years ago
- Offline Policy Evaluation via Adaptive Weighting with Data from Contextual Bandits☆10Oct 21, 2024Updated last year
- Deep Learning Tutorial notes and code. See the wiki for more info.☆10Oct 29, 2015Updated 10 years ago
- ☆10Jul 5, 2016Updated 9 years ago
- ☆13Oct 25, 2019Updated 6 years ago
- ☆11Oct 14, 2021Updated 4 years ago
- 一个开源数学大模型项目,旨在探索大模型是否具有数学创造能力,以及大模型在前沿数学研究中的 潜在能力。☆17May 16, 2025Updated 8 months ago
- ☆11May 26, 2020Updated 5 years ago
- Materials for the "Recommender Systems through the lens of Decision Theory" tutorial delivered at the 30th Web Conference (WWW '21).☆11Apr 13, 2021Updated 4 years ago
- Running TaintDroid from the command line and analyze output☆17Mar 28, 2012Updated 13 years ago
- Maddpg_flight code☆11Jul 4, 2018Updated 7 years ago
- A2C, ACKTR and A2T implementations for ViZDoom☆10Dec 18, 2017Updated 8 years ago
- ☆12Mar 6, 2023Updated 2 years ago
- Fair Benchmarks☆10Mar 14, 2019Updated 6 years ago
- NFBIA Summer School 2015 Deep Learning in Medical Image Analysis☆10Sep 15, 2015Updated 10 years ago