Value & Policy Iteration for the frozenlake environment of OpenAI
☆15May 14, 2019Updated 6 years ago
Alternatives and similar repositories for frozenlake
Users that are interested in frozenlake are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Tensorflow implementation of Asynchronous Advantage Actor Critic (A3C) from "Asynchronous Methods for Deep Reinforcement Learning".☆25Apr 20, 2017Updated 8 years ago
- Lime: Explaining the predictions of any machine learning classifier☆16May 27, 2019Updated 6 years ago
- Offline RL experiments☆15Oct 1, 2022Updated 3 years ago
- Deep Q-Network (DQN) to play classic Atari Games☆11Sep 18, 2017Updated 8 years ago
- Bayesian FlowNetS in Tensorflow☆21Dec 20, 2017Updated 8 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Text generation for the Shakespeare model☆13Apr 26, 2017Updated 8 years ago
- This is the code for "Iphone XS Supply Chain" By Siraj Raval on Youtube☆19Sep 18, 2018Updated 7 years ago
- Notebook from my blog☆15Apr 9, 2017Updated 9 years ago
- ☆15May 31, 2017Updated 8 years ago
- 의사결정(DP) + 강화학습(RL) + 온라인광고(OA) + 파이썬웹(Pyweb)☆10Nov 30, 2016Updated 9 years ago
- A Qwen .5B reasoning model trained on OpenR1-Math-220k☆14Oct 11, 2025Updated 6 months ago
- Official Codebase for "Aligning Diffusion Behaviors with Q-functions for Efficient Continuous Control" (NeurIPS 2024)☆15Oct 29, 2024Updated last year
- 统计微信朋友圈送出的赞票与得到的赞票人员比例☆11May 3, 2016Updated 9 years ago
- Official repo for vidar and vidarc: video foundation model for robotics.☆40Dec 22, 2025Updated 3 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Pytorch implementation of BEAR in "Stabilizing Off-Policy Q-Learning via Bootstrapping Error Reduction"☆11Oct 29, 2019Updated 6 years ago
- Tabula Rasa Tic-Tac-Toe☆10Jan 3, 2019Updated 7 years ago
- Generative Adversarial Network☆15Oct 12, 2018Updated 7 years ago
- Neo4j 大规模 三元组 CVS 导入进数据库☆11Jul 31, 2020Updated 5 years ago
- official implementation of RoSAS: Deep Semi-supervised Anomaly Detection with Contamination-resilient Continuous Supervision☆12Jul 18, 2023Updated 2 years ago
- NLP tool for optimizing a resume for a job description, computing similarity, and extracting skills☆16Jun 7, 2017Updated 8 years ago
- Analyze a real-time IPv4 packet stream and export metrics about the data flows☆14Jan 29, 2020Updated 6 years ago
- A visualization system for RoboCup@Home robots☆10Jul 12, 2019Updated 6 years ago
- This is the code for "Internet of Things Optimization" By Siraj Raval on Youtube☆30Sep 24, 2018Updated 7 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Basic PyTorch Implementation of 'Neural Architecture Search with Reinforcement Learning' (https://arxiv.org/abs/1611.01578)☆13Feb 24, 2018Updated 8 years ago
- Collection of presentation of my work on various platforms and meetups☆22Feb 2, 2026Updated 2 months ago
- Quick-Data-Science-Experiments☆19Dec 12, 2017Updated 8 years ago
- Artificial intelligence for Jetson RaceCar☆15Sep 1, 2017Updated 8 years ago
- A repo to design basic Policy Gradient labs☆12Jul 6, 2023Updated 2 years ago
- Recommendation system for music.☆15Mar 12, 2023Updated 3 years ago
- Experiment with "one-shot learning" techniques to recognize a voice signature☆24Mar 29, 2020Updated 6 years ago
- Wrapper for the wit.ai natural language API☆10Apr 10, 2018Updated 8 years ago
- Face Recognition Using Siamese Neural Networks based on paper Koch et al., "Siamese Neural Networks for One-shot Image Recognition"☆16May 29, 2017Updated 8 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Make sense of deep neural networks using TensorBoard☆20May 9, 2017Updated 8 years ago
- Source code for paper Mroueh, Sercu, Rigotti, Padhi, dos Santos, "Sobolev Independence Criterion", NeurIPS 2019☆14Jun 17, 2024Updated last year
- Multi Stopwatch for Python☆12Sep 28, 2019Updated 6 years ago
- This repository contains code for the paper Direct Preference Optimization with an Offset (ODPO).☆19Feb 17, 2025Updated last year
- Codes accompanying the paper "Offline Reinforcement Learning with Value-Based Episodic Memory" (ICLR 2022 https://arxiv.org/abs/2110.0979…☆15Mar 9, 2022Updated 4 years ago
- Python server for NAO Communication project☆11Aug 22, 2018Updated 7 years ago
- Working area for the Jetson RACECAR Project☆11Jan 25, 2016Updated 10 years ago