☆29Oct 10, 2018Updated 7 years ago
Alternatives and similar repositories for Reinforcement-Learning
Users that are interested in Reinforcement-Learning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Deep Reinforcement Learning for Dynamic Multicahnnel Access in Wireless Networks☆14Oct 1, 2017Updated 8 years ago
- ☆10Dec 29, 2020Updated 5 years ago
- Reinforcement Leanring Algorithms Trained with Unity☆13Apr 26, 2019Updated 7 years ago
- DQN implemented in keras with Dueling Network and Prioritized Experience Replay☆16Nov 21, 2018Updated 7 years ago
- Tool to help analyze mptcp pcaps☆21Oct 8, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- pytorch, noisy_distributional_double_dueling_PER_RNN_CNN...CartPole-v1 , Acrobot-v1, MountainCar-v0☆14Mar 19, 2018Updated 8 years ago
- Lightweight flow-level simulator for inter-node network and service coordination (e.g., in cloud/edge computing or NFV).☆61May 4, 2023Updated 3 years ago
- The NS-3 simulation code for MPTCP(Multiple Path TCP) in 802.11ad WiGig and Wi-Fi☆16Sep 26, 2023Updated 2 years ago
- 本项目致力于多人合作实现强化学习用于交通信号灯控制领域,代码将同步更新☆12Mar 11, 2019Updated 7 years ago
- ☆20Mar 4, 2018Updated 8 years ago
- The implementation of PStream based on the mpquic project☆11May 9, 2020Updated 6 years ago
- Reinforcement Learning Algorithms with Unity 3D Environments☆18Jul 15, 2019Updated 6 years ago
- A jax/stax implementation of: Mnih, V., Kavukcuoglu, K., Silver, D., Rusu, A.A., Veness, J., Bellemare, M.G., Graves, A., Riedmiller, M.,…☆10Dec 7, 2020Updated 5 years ago
- RL scheduler for MPQUIC☆16Sep 26, 2020Updated 5 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- 2048 environment for Reinforcement Learning and DQN algorithm☆40May 27, 2022Updated 3 years ago
- Repository for running LLMs efficiently on Mac silicon (M1, M2, M3). Features Jupyter notebook for Meta-Llama-3 setup using MLX framework…☆11May 4, 2024Updated 2 years ago
- ☆15Jun 18, 2023Updated 2 years ago
- a q-learning algorithms on packet routing.☆14Dec 1, 2018Updated 7 years ago
- Deep Recurrent Q-Learning vs Deep Q Learning on a simple Partially Observable Markov Decision Process with Minecraft☆49Apr 12, 2019Updated 7 years ago
- Reinforcement Learning Algorithms Based on PyTorch☆452Oct 21, 2021Updated 4 years ago
- Tensorflow implementation of proximal policy optimization (PPO) algorithm☆13Feb 28, 2018Updated 8 years ago
- Reproduce the Simulation for Software Defined Network(SDN) based on Omnetpp-5.4.1 and Reinforcement Learning Algorithm in paper "QoS-Awar…☆48Feb 16, 2022Updated 4 years ago
- Some scripts to turn an OpenWrt router into a passive find3 scanner☆26Oct 11, 2020Updated 5 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- 车联网数据收集与分析平台,主要使用:MQTT+Kafka+KSQL+Tensorflow(使用MQTT作为传输协议,EMQ作为MQTT Broker,Kafka作为消息中间件存储数据,KSQL作为流处理工具,Tensorflow用于数据分析模型预测)☆17May 31, 2020Updated 5 years ago
- ☆11Oct 22, 2020Updated 5 years ago
- ☆17Nov 19, 2024Updated last year
- CS234 Reinforcement Learning: Keras implementation of Recurrent Deterministic Policy Gradient (https://arxiv.org/abs/1512.04455)☆10Jun 10, 2017Updated 8 years ago
- Pathfinding Using Reinforcement Learning☆12May 21, 2019Updated 7 years ago
- The hyper-parameters tuning and black box optimization games☆13Apr 20, 2023Updated 3 years ago
- Starting with Bi-Directional LSTMS☆19Mar 21, 2018Updated 8 years ago
- FlexiBO: Cost-Aware Multi-Objective Optimization of Deep Neural Networks☆14Mar 12, 2023Updated 3 years ago
- Monte Carlo Conterfactual Regret Minimization for imperfect information games☆13Mar 29, 2019Updated 7 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Recurrent Network-based Deterministic Policy Gradient for Solving Bipedal Walking Challenge on Rugged Terrains☆12Oct 16, 2017Updated 8 years ago
- PPDRL: a Pretraining-and-policy Based Deep Reinforcement Learning Approach for QoS-aware Service Composition☆13May 31, 2019Updated 6 years ago
- simulation of "A novel reinforcement learning algorithm for virtual network emb e dding" paper☆18Jan 16, 2020Updated 6 years ago
- Generate robust counterfactual explanations for machine learning models☆17Jun 8, 2023Updated 2 years ago
- 这里是鲁鹏老师团队的官方小站 (CV学吧)的根据地,欢迎!This is the official station of CV-xueba. Welcome ! ^_^☆15Jun 7, 2025Updated 11 months ago
- SIADEX - An HTN planner with temporal, partial order planning☆14Apr 19, 2023Updated 3 years ago
- Solving a Battleship board game as an optimization problem☆12Jun 21, 2022Updated 3 years ago