Implementations of various RL and Deep RL algorithms in TensorFlow, PyTorch and Keras.
☆16Sep 18, 2024Updated last year
Alternatives and similar repositories for reinforcement-learning
Users that are interested in reinforcement-learning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A collection of utilities for machine learning experiments.☆11Jan 8, 2026Updated 3 months ago
- Official Implementation of SFM and the baselines in Jax.☆21May 31, 2025Updated 10 months ago
- CVE-2019-0708 Exploit Tool☆18Jul 18, 2019Updated 6 years ago
- ☆11Feb 16, 2025Updated last year
- SARSA, Q-Learning, Expected SARSA, SARSA(λ) and Double Q-learning Implementation and Analysis☆30Aug 19, 2019Updated 6 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- PyTorch Implementation of Hamilton-Jacobi DQN☆16May 12, 2021Updated 4 years ago
- Multi-Critic Policy Gradient Optimization for Quadcopter Coordination☆14Aug 10, 2021Updated 4 years ago
- Official implementation of the δ-model presented in the ICML 2024 paper "A Distributional Analogue to the Successor Representation".☆24Nov 8, 2024Updated last year
- A repository hosting some of my own vulnerability reports and proof-of-concepts.☆15Aug 8, 2019Updated 6 years ago
- data processing code for MIMIC-IV 2.2☆14Jan 26, 2024Updated 2 years ago
- Simple demo for Databricks!☆14Sep 11, 2023Updated 2 years ago
- Personal Stock Tracker for COS Online Store☆10Updated this week
- [MLHC 2021] Model Selection for Offline RL: Practical Considerations for Healthcare Settings. https://arxiv.org/abs/2107.11003☆11Oct 6, 2022Updated 3 years ago
- 个人网站☆12Mar 22, 2026Updated 3 weeks ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆10Nov 16, 2020Updated 5 years ago
- WebTranport module that libp2p uses and that implements the interface-transport spec☆15Aug 9, 2023Updated 2 years ago
- Simple implementation of object detection from video using opencv dnn, tensorflow, and pytorch☆10Sep 17, 2019Updated 6 years ago
- Ebuilds to install ROCM on Gentoo Linux☆37Jan 14, 2023Updated 3 years ago
- This is a project based on OpenAI's multi-agent-emergence-environments (Emergent Tool Use from Multi-Agent Autocurricula, Baker et al.), …☆13Jan 5, 2021Updated 5 years ago
- Stateless CLI tool to easily pin CAR files to IPFS pinning services. Client for the IPFS Pinning Service API that speaks HTTP and Bitswap…☆16Dec 15, 2023Updated 2 years ago
- Revisiting Rainbow☆76Jun 9, 2021Updated 4 years ago
- A comparison of RCN/CNN/SVM/KNN on EMNIST-letters dataset☆10Dec 18, 2017Updated 8 years ago
- Smart grid pricing by reinforcement learning☆19Dec 19, 2018Updated 7 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- A Programming language of directories. Just for fun.☆15Feb 2, 2023Updated 3 years ago
- Light Seeking and Obstacle Avoiding Robot☆10Feb 7, 2017Updated 9 years ago
- Various python scripts for reinforcement learning algorithms.☆10Aug 7, 2018Updated 7 years ago
- Template for Vite.js docs translation repositories☆14Updated this week
- Get CLIP ViT text tokens about an image, visualize attention as a heatmap.☆15Aug 8, 2023Updated 2 years ago
- This repository contains the necessary files for my profile's README. This includes multiple GitHub Actions as well as dynamic content.☆12Updated this week
- CVE-2022-1040☆18Sep 25, 2022Updated 3 years ago
- ☆12Aug 1, 2023Updated 2 years ago
- Rust grammar for Lezer☆22Feb 14, 2024Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Evolving expressions using genetic algorithms☆17Jun 20, 2020Updated 5 years ago
- Testing different RL algorithms for multi-agent environments. From SARSA, QLearning to Independent Q-Learning, Joint Action Learning and …☆12Mar 29, 2019Updated 7 years ago
- ☆18Oct 16, 2022Updated 3 years ago
- Wavefront .obj loader&viewer with GUI controls. Written in C++/OpenGL, supports .mtl.☆21Jul 11, 2015Updated 10 years ago
- Course materials for a 3-day seminar "Machine Learning and NLP: Advances and Applications" at New College of Florida☆12Feb 10, 2022Updated 4 years ago
- Compare Q-Learning and Expected Value SARSA.☆11Oct 7, 2018Updated 7 years ago
- Solving MuJoCo environments with Deep Deterministic Policy Gradients☆14Sep 17, 2018Updated 7 years ago