Some notes and experience about David Silver's Reinforcement Learning Course
☆47Jun 24, 2019Updated 7 years ago
Alternatives and similar repositories for D.Silver_RL_Course
Users that are interested in D.Silver_RL_Course are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆12May 14, 2024Updated 2 years ago
- Stochastic Sequential Action Control for Continuous-Time Belief Space Planning in Julia☆16Jun 22, 2022Updated 4 years ago
- This is the source code of our paper PALT in EMNLP2022.☆12Nov 19, 2022Updated 3 years ago
- Pytorch Implementation of AAMAS 2021 paper <Energy-Based Imitation Learning>☆12Oct 8, 2021Updated 4 years ago
- NLP Project + pytorch☆10Oct 17, 2020Updated 5 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- A python module designed for agile RL algorithm developing.☆26Jul 11, 2024Updated last year
- Code for recreating the results of our RSS 2020 paper, 'Learning Memory-Based Control for Human-Scale Bipedal Locomotion.'☆10Aug 18, 2022Updated 3 years ago
- This repository contains the scripts used during my participation on CIKM Cup 2016 (see http://cikmcup.org/ and https://competitions.coda…☆11Nov 4, 2016Updated 9 years ago
- Customizable RecSys Simulator for OpenAI Gym☆26Dec 7, 2021Updated 4 years ago
- ☆16Feb 3, 2020Updated 6 years ago
- Code for a multi-agent particle environment used in the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"☆15Aug 30, 2021Updated 4 years ago
- CK workflow, portable packages and other artifacts for the ReQuEST-ASPLOS'18 submission:☆12Jan 16, 2019Updated 7 years ago
- Evaluate the Quality of Critique☆37Jun 1, 2024Updated 2 years ago
- ☆41Feb 12, 2019Updated 7 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Deep learning chat based on DL4J☆11Mar 31, 2017Updated 9 years ago
- This repo implements GAN-based models for Dialogue Generation (DP-GAN, SeqGAN, and our own proposed DPAC-GAN)☆29Mar 24, 2024Updated 2 years ago
- official repo for `thinking with images through-self-calling`☆25Dec 28, 2025Updated 6 months ago
- Proof recording for Lean 3☆27Sep 30, 2021Updated 4 years ago
- Library for experimenting with state-of-the-art evaluation metrics like UScore☆12May 27, 2023Updated 3 years ago
- Implementation of Differential Learning Rate in Keras☆11Jun 4, 2019Updated 7 years ago
- Code for creating recurrent neural network with rotational dynamics. Model is discussed in detail in "Rotational Dynamics Reduce Interfer…☆17Jul 23, 2020Updated 5 years ago
- This is the code for G2MILP, a deep learning-based mixed-integer linear programming (MILP) instance generator.☆36Oct 3, 2024Updated last year
- Notes for the Reinforcement Learning course by David Silver along with implementation of various algorithms.☆863Mar 31, 2022Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆17Feb 21, 2020Updated 6 years ago
- awesome deep learning papers for reinforcement learning☆17Jan 10, 2018Updated 8 years ago
- ☆13Jun 14, 2017Updated 9 years ago
- A simple tool for serving and monitoring nvidia-smi in the browser☆14Mar 25, 2021Updated 5 years ago
- Table logger using Rich☆13Aug 13, 2025Updated 10 months ago
- X-Trainer collaborative arm platform (±0.05 mm) with VR/gamepad teleop data adapters and NVIDIA GPU-accelerated simulation.☆42Mar 27, 2026Updated 3 months ago
- Convolutional Deep Semantic Similarity Model☆20Feb 15, 2023Updated 3 years ago
- This is the companion code for the method reported in the paper "Learning game-theoretic models of multiagent trajectories using implicit…☆12Feb 8, 2021Updated 5 years ago
- Scalable Meta-Evaluation of LLMs as Evaluators☆43Feb 15, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- JAX implementation of the Mistral 7b v0.1 model☆13Mar 27, 2024Updated 2 years ago
- ☆13Jul 14, 2024Updated last year
- ☆17Feb 20, 2016Updated 10 years ago
- This repository demonstrates the application of our proposed task-free continual learning method on a synthetic experiment.☆13Jun 24, 2019Updated 7 years ago
- This is the codebase for our ICRA 2020 submission, GraphRQI: Classifying Driver Behaviors Using Graph Spectrums.☆13Dec 8, 2019Updated 6 years ago
- 学习强化学习过程中的笔记和代码☆12Jul 27, 2020Updated 5 years ago
- دیتاست های فارسی اینستاگرام جهت تحقیق و تمرین persian instagram dataset☆10Nov 1, 2019Updated 6 years ago