A simplistic implementation of DQN that works under CartPole-v0 with rendered pixels as input
☆13Feb 28, 2019Updated 7 years ago
Alternatives and similar repositories for Simple-DQN-Pytorch
Users that are interested in Simple-DQN-Pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆13Apr 5, 2023Updated 3 years ago
- Calibration the rigid transformation between the stereo and odometry☆13Jul 24, 2016Updated 9 years ago
- ☆27Apr 26, 2024Updated 2 years ago
- Multilingual acoustic word embedding approaches applied and evaluated on GlobalPhone data.☆11Nov 3, 2020Updated 5 years ago
- ddl_0725☆14May 23, 2019Updated 7 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- PyTorch implementation of DQN☆13Sep 27, 2019Updated 6 years ago
- This project enables to visualize NuScene data such as Point Cloud data for Radar, Lidar and Images captures using various sensors☆11Feb 4, 2021Updated 5 years ago
- A minimalistic header only C++11 Neural Network library based on Eigen::Tensor☆20Jan 15, 2018Updated 8 years ago
- Recognizing common speech commands using Keras and Tensorflow.☆10Dec 17, 2018Updated 7 years ago
- ☆11Jan 21, 2021Updated 5 years ago
- ☆11Oct 19, 2023Updated 2 years ago
- URDF description of the JVRC humanoid model☆15Jan 9, 2025Updated last year
- Code for CIKM'18 paper, Linked Causal Variational Autoencoder for Inferring Paired Spillover Effects.☆15Jan 15, 2023Updated 3 years ago
- ☆10Feb 17, 2026Updated 3 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Jaxplorer is a Jax reinforcement learning (RL) framework for exploring new ideas.☆13Jul 19, 2024Updated last year
- Code for the paper "FastAdaSP: An Efficient Multitask Inference Framework for Large Speech Language Models". @ EMNLP'24(Oral)☆17Nov 14, 2024Updated last year
- ☆15Jul 4, 2024Updated last year
- STRODE: Stochastic Boundary Ordinary Differential Equation☆13Jul 20, 2021Updated 4 years ago
- ☆16Jun 1, 2023Updated 2 years ago
- ICML Workshop 18 - Auto-Classification of Retinal Diseases in the Limit of Sparse Data Using a Two-Streams Machine Learning Model☆17Jun 22, 2020Updated 5 years ago
- An implementation of Deepmind's MuZero algorithm.☆16Aug 23, 2021Updated 4 years ago
- [ACL 2025 Findings] Text2World: Benchmarking Large Language Models for Symbolic World Model Generation☆29Feb 25, 2025Updated last year
- Deep Reinforcement Learning Nanodegree program from Udacity☆10Nov 3, 2018Updated 7 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆30Mar 11, 2026Updated 2 months ago
- Arduino and Raspberry Pi Source Code for Bee Hive Temperature Monitoring Project http://beemonitor.org/setup☆10Jun 12, 2016Updated 9 years ago
- Temporally Correlated Episodic Reinforcement Learning, ICLR 24☆12Apr 8, 2024Updated 2 years ago
- Code for Estimating Multi-cause Treatment Effects via Single-cause Perturbation (NeurIPS 2021)☆14Jan 5, 2022Updated 4 years ago
- my Reinforcement Learning playground☆10Oct 7, 2018Updated 7 years ago
- Model Predictive Control of a quadrotor for trajectory tracking.☆13May 8, 2023Updated 3 years ago
- Neural networks do line art stylization☆14Dec 30, 2020Updated 5 years ago
- openai-gym style RL benchmark for interconnection network congestion control study☆17May 12, 2022Updated 4 years ago
- Comparing obstacle avoidance formulations☆11Oct 22, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- mjlab reinforcement learning for the BDX-R robot☆64May 18, 2026Updated last week
- Polyaxon Clients & Langange SDKS☆14May 15, 2026Updated last week
- Soccer toy example simulator used in Reinforcement Learning☆12Mar 11, 2018Updated 8 years ago
- This repository contains code and config files that accompany the blog post:☆16Aug 30, 2019Updated 6 years ago
- 一日不读书,胸臆无佳想;一月不读书,耳目失清爽。☆17Jul 24, 2020Updated 5 years ago
- Code accompanying the latent-action-priors paper.☆12Mar 5, 2025Updated last year
- ☆14Nov 16, 2024Updated last year