An attempt at applying Deep RL on the board game 2048
☆17Jan 5, 2017Updated 9 years ago
Alternatives and similar repositories for 2048-RL-DRQN
Users that are interested in 2048-RL-DRQN are comparing it to the libraries listed below
Sorting:
- Tensorflow DQN and DRQN agent playing doom☆35May 5, 2017Updated 8 years ago
- 常见中文知识图谱的链接☆22May 23, 2017Updated 8 years ago
- Pulsar-plot of Strava runs in Swift 3☆12Apr 2, 2018Updated 7 years ago
- Information-oriented Metric (IOM)☆11Sep 2, 2020Updated 5 years ago
- Code for SemEval-16 Task6 subtaskA and subtaskB.☆10Mar 31, 2016Updated 9 years ago
- A comprehensive framework to explore whether embodied multimodal models are plausibly resilient☆13Nov 19, 2025Updated 3 months ago
- Blocks World -- Simulator, Code, and Models (Misra et al. EMNLP 2017)☆40Feb 7, 2019Updated 7 years ago
- Attention is All You Need in Sonnet☆38Aug 30, 2017Updated 8 years ago
- Normalizer for honeypot data.☆11Dec 6, 2023Updated 2 years ago
- Today I Learnd☆10Mar 30, 2021Updated 4 years ago
- Temporal Random Indexing☆14Oct 3, 2024Updated last year
- Flatland code for our submission to the NeurIPS 2020 Flatland challenge (round 1 winner, 4th place at round 2).☆11Feb 2, 2021Updated 5 years ago
- implement some outlier detection algorithms☆11Sep 25, 2015Updated 10 years ago
- ☆11Feb 17, 2026Updated 2 weeks ago
- Tree-LSTM + Self-Structured Attention -- a method to summarize textual data by topics☆10Apr 26, 2018Updated 7 years ago
- Arduino and Raspberry Pi Source Code for Bee Hive Temperature Monitoring Project http://beemonitor.org/setup☆10Jun 12, 2016Updated 9 years ago
- Using Pilco algorithm to find a controller for few robotic problems☆44Jul 31, 2015Updated 10 years ago
- reinfore learning tool box, contains trpo, a3c algorithm for continous action space☆42Jan 27, 2018Updated 8 years ago
- JobScheduler 1: Master, Agent☆11Jan 27, 2026Updated last month
- opennlp-solr-examples☆10Jul 1, 2022Updated 3 years ago
- ☆14Dec 8, 2022Updated 3 years ago
- Scalable learning with pragmatics☆11Mar 31, 2018Updated 7 years ago
- Semi-supervised Latent Dirichlet Allocation (LDA)☆12Dec 21, 2017Updated 8 years ago
- my Reinforcement Learning playground☆10Oct 7, 2018Updated 7 years ago
- This repo contains my code for Hikeathon contest conducted by Analytics Vidhya☆10Apr 8, 2019Updated 6 years ago
- Implementation of QA Networks☆10Jul 14, 2016Updated 9 years ago
- textual entailment with structural attentions and composition☆12Dec 20, 2016Updated 9 years ago
- Tools for performing hyperparameter search with Scikit-Learn and Dask http://dask-searchcv.readthedocs.io☆11Nov 16, 2017Updated 8 years ago
- A custom extractor designed to read parquet for Azure Data Lake Analytics☆13Feb 13, 2018Updated 8 years ago
- Incremental Learning the Hierarchical Softmax Function for Neural Language Models☆11Dec 6, 2016Updated 9 years ago
- Speaker Role Contextual Model for Dialogues☆15Sep 30, 2017Updated 8 years ago
- Angular (Angular 2, Angular io) components based on "PowerBI-Angular" and "PowerBI-Javascript" to use PowerBI Embedded Features☆10Mar 16, 2018Updated 7 years ago
- ☆14Mar 19, 2021Updated 4 years ago
- 数学基础☆13Feb 8, 2018Updated 8 years ago
- Using Kendo UI for Angular with the Angular QuickStart☆12Nov 15, 2017Updated 8 years ago
- ☆14Sep 25, 2023Updated 2 years ago
- 2016CCF大数据与计算智能大赛——搜狗用户画像☆10Aug 18, 2017Updated 8 years ago
- Deep Reinforcement Learning framework based on TensorFlow and OpenAI Gym☆13Apr 30, 2018Updated 7 years ago
- Little Book of R for Bayesian Statistics☆24Oct 31, 2017Updated 8 years ago