A simplistic implementation of DQN that works under CartPole-v0 with rendered pixels as input
☆13Feb 28, 2019Updated 7 years ago
Alternatives and similar repositories for Simple-DQN-Pytorch
Users that are interested in Simple-DQN-Pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆13Apr 5, 2023Updated 2 years ago
- ☆24Jan 26, 2024Updated 2 years ago
- Multilingual acoustic word embedding approaches applied and evaluated on GlobalPhone data.☆11Nov 3, 2020Updated 5 years ago
- This project enables to visualize NuScene data such as Point Cloud data for Radar, Lidar and Images captures using various sensors☆11Feb 4, 2021Updated 5 years ago
- (Personal project) Pruning algorithm for DNNs using "lottery ticket" pruning☆10Dec 8, 2022Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- TensorFlow implementation of Deep Reinforcement Learning papers☆28Dec 31, 2016Updated 9 years ago
- Recognizing common speech commands using Keras and Tensorflow.☆10Dec 17, 2018Updated 7 years ago
- ☆11Jan 21, 2021Updated 5 years ago
- ☆11Oct 19, 2023Updated 2 years ago
- A tool for calling (and calling out to) large language models.☆16Aug 13, 2024Updated last year
- URDF description of the JVRC humanoid model☆15Jan 9, 2025Updated last year
- Code for CIKM'18 paper, Linked Causal Variational Autoencoder for Inferring Paired Spillover Effects.☆15Jan 15, 2023Updated 3 years ago
- Jaxplorer is a Jax reinforcement learning (RL) framework for exploring new ideas.☆13Jul 19, 2024Updated last year
- Code for the paper "FastAdaSP: An Efficient Multitask Inference Framework for Large Speech Language Models". @ EMNLP'24(Oral)☆17Nov 14, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- STRODE: Stochastic Boundary Ordinary Differential Equation☆13Jul 20, 2021Updated 4 years ago
- Complex YOLO with Uncertainty☆19Dec 27, 2018Updated 7 years ago
- ☆15Jun 1, 2023Updated 2 years ago
- ICML Workshop 18 - Auto-Classification of Retinal Diseases in the Limit of Sparse Data Using a Two-Streams Machine Learning Model☆17Jun 22, 2020Updated 5 years ago
- ☆17May 5, 2024Updated last year
- Code and data from the paper "Targeted Nonlinear Adversarial Perturbations in Images and Videos".☆11Sep 8, 2018Updated 7 years ago
- [ACL 2025 Findings] Text2World: Benchmarking Large Language Models for Symbolic World Model Generation☆28Feb 25, 2025Updated last year
- Deep Reinforcement Learning Nanodegree program from Udacity☆10Nov 3, 2018Updated 7 years ago
- Tool to bridge Blender animation and physics-based robotic simulation☆17Feb 27, 2026Updated last month
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆16Jun 13, 2022Updated 3 years ago
- ☆14Dec 23, 2020Updated 5 years ago
- Official implementation of the paper "RaceMOP: Mapless Online Path Planning for Multi-Agent Autonomous Racing using Residual Policy Learn…☆10Oct 23, 2024Updated last year
- Temporally Correlated Episodic Reinforcement Learning, ICLR 24☆12Apr 8, 2024Updated last year
- Code repo for "S3R-Net: A Single-Stage Approach to Self-Supervised Shadow Removal" (NTIRE workshop @ CVPR 2024)☆11Jun 15, 2024Updated last year
- Code for Estimating Multi-cause Treatment Effects via Single-cause Perturbation (NeurIPS 2021)☆14Jan 5, 2022Updated 4 years ago
- Model Predictive Control of a quadrotor for trajectory tracking.☆13May 8, 2023Updated 2 years ago
- Comparing obstacle avoidance formulations☆10Oct 22, 2022Updated 3 years ago
- Image Caption, Show and Tell.☆21May 25, 2017Updated 8 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Soccer toy example simulator used in Reinforcement Learning☆12Mar 11, 2018Updated 8 years ago
- Code accompanying the latent-action-priors paper.☆12Mar 5, 2025Updated last year
- 一日不读书,胸臆无佳想;一月不读书,耳目失清爽。☆17Jul 24, 2020Updated 5 years ago
- ☆10Feb 27, 2022Updated 4 years ago
- learning to play atari games with reinforcement learning☆10Jan 4, 2016Updated 10 years ago
- JVRC1 model files for MuJoCo☆10Apr 8, 2025Updated 11 months ago
- ☆13Apr 28, 2025Updated 10 months ago