Deep Reinforcement Learning Algorithms Implementation in PyTorch
☆27Feb 11, 2025Updated last year
Alternatives and similar repositories for rl_pytorch
Users that are interested in rl_pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Meme serving with NLP☆35May 20, 2023Updated 2 years ago
- Simulate binance fee mechanism by RL agents☆33Sep 20, 2019Updated 6 years ago
- ☆20Apr 10, 2018Updated 8 years ago
- Today I Learned☆23Mar 8, 2020Updated 6 years ago
- Collection of reinforcement learning algorithms☆16Sep 29, 2025Updated 7 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Implement IMPALA architecture from Distributed Deep-RL Paper.☆15Oct 18, 2018Updated 7 years ago
- [Developmental] Quarto Extension to Enable Google Colaboratory Links with Quarto Documents☆17May 18, 2025Updated 11 months ago
- weekly reinforcement learning paper reviews☆33Jan 8, 2018Updated 8 years ago
- Example plots☆13Sep 1, 2024Updated last year
- Continual Reinforcement Learning in 3D Non-stationary Environments☆39Jun 16, 2019Updated 6 years ago
- Count based exploration with the successor representation for Unity ML's Pyramid☆12Jun 19, 2019Updated 6 years ago
- The ALL Arduino Nano 33 BLE Sense Classifier is an experiment to explore how low powered microcontrollers, specifically the Arduino Nano …☆10Jul 21, 2021Updated 4 years ago
- PyTorch implementation of R2D2 (Recurrent Replay Distributed DPG (not DQN))☆14Mar 22, 2019Updated 7 years ago
- ☆131Apr 9, 2026Updated 3 weeks ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Experiment utility code, specifically designed for use with Compute Canada.☆11Jan 27, 2025Updated last year
- Random Network Distillation pytorch☆261Mar 4, 2019Updated 7 years ago
- Re-implementation of Progressive Neural Networks with PyTorch☆15Jul 25, 2024Updated last year
- The source code of our ACL paper "A Training-free and Reference-free Summarization Evaluation Metric via Centrality-weighted Relevance an…☆14May 6, 2023Updated 2 years ago
- An implementation of MuZero in JAX.☆58Nov 8, 2022Updated 3 years ago
- ☆26Feb 26, 2026Updated 2 months ago
- ☆15Apr 7, 2024Updated 2 years ago
- [AAMAS 2023] Code for the paper "Automatic Noise Filtering with Dynamic Sparse Training in Deep Reinforcement Learning"☆12Feb 22, 2024Updated 2 years ago
- Contextual Bandits Action Elimination DQN☆21Jun 25, 2018Updated 7 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆27Apr 30, 2019Updated 7 years ago
- Implementation for ACER in tensorflow and sonnet by deepmind☆11Aug 28, 2017Updated 8 years ago
- Source code for "A deep dive into reinforcement learning"☆13Dec 17, 2019Updated 6 years ago
- Repository for codes of 'Deep Reinforcement Learning'☆218Oct 4, 2019Updated 6 years ago
- Code for SPIBB-DQN and Soft-SPIBB-DQN☆11May 5, 2020Updated 5 years ago
- Implementation of Deep Reinforcement Learning from Self-Play in Imperfect-Information Games (Heinrich and Silver, 2016)☆48Nov 30, 2018Updated 7 years ago
- This repository provides a framework to serve LLM(Large Language Model) based applications such as Chatbot.☆18Apr 20, 2023Updated 3 years ago
- pytorch maml with Multi-GPUs, fast and simplest implementation☆13Dec 4, 2020Updated 5 years ago
- ratsnlp, KOGPT2와 recipegpt github를 참고하여 음식명과 식재료명을 입력하면 레시피를 생성해주는 모델을 제작하였습니다!!☆11Dec 28, 2021Updated 4 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Arduino TinyML trash classification example☆10Dec 21, 2020Updated 5 years ago
- Soft Actor-Critic☆158Mar 13, 2018Updated 8 years ago
- DARP+STC algorithm for mCPP problem☆16Mar 29, 2019Updated 7 years ago
- Implementation of Diversity Is All You Need (DIAYN) on top of Stable Baselines 3.☆13Jul 11, 2022Updated 3 years ago
- Official codebase for Improving Computational Efficiency in Visual Reinforcement Learning via Stored Embeddings.☆21Mar 5, 2021Updated 5 years ago
- ☆57Mar 27, 2019Updated 7 years ago
- Deep Planning Network: Control from pixels by latent planning with learned dynamics☆376Oct 15, 2021Updated 4 years ago