π€ Implements of Reinforcement Learning algorithms.
β117Apr 1, 2018Updated 7 years ago
Alternatives and similar repositories for Reinforcement-Learning
Users that are interested in Reinforcement-Learning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- πΈ Papers and Code Implements for Quantitative-Tradingβ38May 9, 2018Updated 7 years ago
- π Personae is a repo of implements and environment of Deep Reinforcement Learning & Supervised Learning for Quantitative Trading.β1,401Nov 29, 2018Updated 7 years ago
- Fixed-point arithmetic in C++β12Sep 25, 2013Updated 12 years ago
- Implementation of Structural Correspondence Learningβ15Apr 22, 2018Updated 7 years ago
- Implementation of selected reinforcement learning algorithms in Tensorflow. A3C, DDPG, REINFORCE, DQN, etc.β154May 28, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI β’ AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- R package for Bayesian quantile vector autoregression estimation, forecast and impulse response analysisβ11Oct 10, 2024Updated last year
- A converter for Euler Angle,Axis Angle,Quaternion,Rotation Matrix.β16Jun 9, 2021Updated 4 years ago
- Multiple object tracking using Kalman filters and Munkres algorithmβ13Jun 7, 2017Updated 8 years ago
- Augmentation scripts for the bAbI Dialog Tasks datasetβ13Oct 16, 2018Updated 7 years ago
- practiceβ10Jun 30, 2020Updated 5 years ago
- β15Sep 25, 2020Updated 5 years ago
- Playing with an LSTM neural network.β32Oct 31, 2017Updated 8 years ago
- Quadratic Programming for Continuous Control of Safety-Critical Multi-Agent Systems Under Uncertaintyβ14Sep 7, 2024Updated last year
- Waste Sorting with Robot Arm Tossingβ10Sep 19, 2023Updated 2 years ago
- DigitalOcean Gradient AI Platform β’ AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A path planning framework based on Sampling-based algorithm and Deep Reinforcement learning.β10May 9, 2023Updated 2 years ago
- Official implementation of AGSTN model(ICDM2020)β12Sep 12, 2020Updated 5 years ago
- Mxnet implementation of Deep Reinforcement Learning papers, such as DQN, PG, DDPG, PPOβ28Dec 8, 2022Updated 3 years ago
- β18Jan 7, 2019Updated 7 years ago
- The continuous mountain car problem solved with DDPGβ13Apr 19, 2020Updated 5 years ago
- Autonomous Driving on Carla simulator using Deep Deterministic Policy Gradients. Based on Kendall, et. al. 2018.β12Apr 2, 2019Updated 6 years ago
- Spectral Feature Alignmentβ10Oct 6, 2016Updated 9 years ago
- ζΊε¨ε¦δΉ ειεεζε¦δΉ θΏθ‘δΈβ380Feb 3, 2018Updated 8 years ago
- PyTorch bindings for openai-gemmβ20Feb 6, 2017Updated 9 years ago
- Managed Kubernetes at scale on DigitalOcean β’ AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Keras Implementation of TD3(Twin Delayed DDPG) with PER(Prioritized Experience Replay) option on OpenAI gym frameworkβ11May 29, 2021Updated 4 years ago
- laterβ10Jul 9, 2022Updated 3 years ago
- [Reproduce] Code for the EMNLP2018 paper "A Visual Attention Grounding Neural Model for Multimodal Machine Translation".β11Jan 19, 2020Updated 6 years ago
- LibCP -- A Library for Conformal Predictionβ13Feb 26, 2015Updated 11 years ago
- Unbounded cache model for online language modeling with open vocabularyβ11Feb 15, 2019Updated 7 years ago
- η»ζ΅ε¦AIδ½Ώη¨ζ εβ18Sep 10, 2025Updated 6 months ago
- Official repository for the paper DynaPipe: Optimizing Multi-task Training through Dynamic Pipelinesβ19Dec 8, 2023Updated 2 years ago
- Reinforcement learning algorithm implementations and ML experimentation workspaceβ44Jun 8, 2019Updated 6 years ago
- Strassen's Algorithm for Tensor Contractionβ15Jul 7, 2017Updated 8 years ago
- GPU virtual machines on DigitalOcean Gradient AI β’ AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- β16Feb 22, 2024Updated 2 years ago
- Code for phonetically classifying TIMIT using TensorFlowβ18Jul 1, 2016Updated 9 years ago
- Reinforcement Learning in continuous state and action spaces. DDPG: Deep Deterministic Policy Gradient and A3C: Asynchronous Actor-Criticβ¦β14May 14, 2018Updated 7 years ago
- Notebooks for Scaling Deep Learning Interpretability by Visualizing Activation and Attribution Summarizationsβ15Oct 3, 2019Updated 6 years ago
- Tensorflow Reproduction of the EMNLP-2018 paper "Temporally Grounding Natural Sentence in Video"β17Nov 21, 2022Updated 3 years ago
- β19Feb 19, 2016Updated 10 years ago
- Simple and self-contained TensorFlow implementation of reinforcement learning algorithms for continuous control, integrated with OpenAI Gβ¦β11Jun 4, 2020Updated 5 years ago