A repo to design basic Policy Gradient labs
☆12Jul 6, 2023Updated 2 years ago
Alternatives and similar repositories for Basic-Policy-Gradient-Labs
Users that are interested in Basic-Policy-Gradient-Labs are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Jupyter notebook for the MAP-Elites algorithms (Mouret & Clune, 2015)☆25Jul 9, 2022Updated 3 years ago
- ☆12Apr 18, 2023Updated 3 years ago
- Non-orthogonal multiple access (NOMA) for Indoor Visible Light Communications. We offer a complete review of PD-NOMA-based VLC systems in…☆17Oct 18, 2023Updated 2 years ago
- Archive of my older research papers on optimization☆10Jan 20, 2021Updated 5 years ago
- Traffic Steering (TS) xApp for OAIC O-RAN Testbed☆12Nov 8, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- a modular reinforcement learning library with JAX agents☆27Mar 3, 2025Updated last year
- Environments with IC3Net paper☆15Jan 8, 2019Updated 7 years ago
- Standard interface for entity based reinforcement learning environments.☆39Feb 28, 2024Updated 2 years ago
- Offline RL experiments☆15Oct 1, 2022Updated 3 years ago
- ☆17Nov 2, 2024Updated last year
- My Body Is A Cage☆41Apr 13, 2021Updated 5 years ago
- A Qwen .5B reasoning model trained on OpenR1-Math-220k☆14Jun 24, 2026Updated last week
- [AAMAS 2023] Code for the paper "Automatic Noise Filtering with Dynamic Sparse Training in Deep Reinforcement Learning"☆12Feb 22, 2024Updated 2 years ago
- 统计微信朋友圈送出的赞票与得到的赞票人员比例☆11May 3, 2016Updated 10 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Welcome to the Machine Learning Engineering Repository, a comprehensive collection of resources, code, and insights to guide you through…☆25Feb 25, 2025Updated last year
- Official repo for vidar and vidarc: video foundation model for robotics.☆42Dec 22, 2025Updated 6 months ago
- Loss-Balanced Task Weighting to Reduce Negative Transfer in Multi-Task Learning, AAAI-SA'19☆30Sep 23, 2019Updated 6 years ago
- ☆24Dec 30, 2022Updated 3 years ago
- 高雄 python 社群活動整理☆10Apr 5, 2019Updated 7 years ago
- Pytorch implementation of BEAR in "Stabilizing Off-Policy Q-Learning via Bootstrapping Error Reduction"☆11Oct 29, 2019Updated 6 years ago
- Factored Interactive POMDP solver based on symbolic Perseus.☆11Aug 12, 2025Updated 10 months ago
- Benchmark data (i.e., DeepMind Control Suite and MuJoCo) for RL.☆35Jan 23, 2021Updated 5 years ago
- A JAX-based framework for genetic programming☆32Jun 11, 2026Updated 2 weeks ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Neo4j 大规模 三元组 CVS 导入进数据库☆11Jul 31, 2020Updated 5 years ago
- Implementation of DeDOL algorithm - Deep Reinforcement Learning based algorithm for Green Security Games with Real Time Information☆16Nov 7, 2019Updated 6 years ago
- Minimal Transformer base in JAX. A single backbone for language modelling, diffusion, classification, etc...☆16May 28, 2025Updated last year
- This repository contains all code and experiments for competitive policy gradient (CoPG) algorithm.☆24Aug 1, 2020Updated 5 years ago
- Yet Another Agents Framework - An RL research-oriented framework for agent prototyping and evaluation☆18Oct 9, 2023Updated 2 years ago
- Code for the PAC-Bayes Control paper.☆13May 23, 2023Updated 3 years ago
- Basic PyTorch Implementation of 'Neural Architecture Search with Reinforcement Learning' (https://arxiv.org/abs/1611.01578)☆13Feb 24, 2018Updated 8 years ago
- Official repository for the paper "Automating Continual Learning"☆20Jun 11, 2025Updated last year
- A deep reinforcement learning based approach is used to allocate downlink power for multi-cell wireless system.☆23Feb 21, 2020Updated 6 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- 🛩 Use Deep Reinforcement Learning Algorithms in a simple scene.☆18Jun 18, 2020Updated 6 years ago
- ☆119Jul 9, 2020Updated 5 years ago
- Surprise-based intrinsic motivation for deep reinforcement learning☆21Mar 6, 2017Updated 9 years ago
- A tutorial for the famous non dominated sorting genetic algorithm II, multiobjective evolutionary algorithm.☆17Aug 3, 2020Updated 5 years ago
- Deep Q network-based power allocation for multi-cell massive MIMO cellular network.☆21Apr 11, 2026Updated 2 months ago
- This repository contains code for the paper Direct Preference Optimization with an Offset (ODPO).☆21Feb 17, 2025Updated last year
- Codes accompanying the paper "Offline Reinforcement Learning with Value-Based Episodic Memory" (ICLR 2022 https://arxiv.org/abs/2110.0979…☆15Mar 9, 2022Updated 4 years ago