A repo to design basic Policy Gradient labs
☆12Jul 6, 2023Updated 2 years ago
Alternatives and similar repositories for Basic-Policy-Gradient-Labs
Users that are interested in Basic-Policy-Gradient-Labs are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Notebook for the Bayesian Optimization (quick) tutorial☆10Apr 7, 2021Updated 5 years ago
- Jupyter notebook for the MAP-Elites algorithms (Mouret & Clune, 2015)☆24Jul 9, 2022Updated 3 years ago
- This repository contains code for a tutorial on end to end automatic speech recognition.☆18Sep 10, 2019Updated 6 years ago
- ☆12Apr 18, 2023Updated 3 years ago
- Labs for understanding and coding Standard Reinforcement Learning concepts☆60Jan 17, 2019Updated 7 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- My solution to Collaboration and Competition using MADDPG algorithm, Udacity 3rd project of Deep RL Nanodegree from the paper "Multi-Agen…☆10Oct 6, 2019Updated 6 years ago
- Non-orthogonal multiple access (NOMA) for Indoor Visible Light Communications. We offer a complete review of PD-NOMA-based VLC systems in…☆17Oct 18, 2023Updated 2 years ago
- Archive of my older research papers on optimization☆10Jan 20, 2021Updated 5 years ago
- Traffic Steering (TS) xApp for OAIC O-RAN Testbed☆12Nov 8, 2023Updated 2 years ago
- a modular reinforcement learning library with JAX agents☆27Mar 3, 2025Updated last year
- Environments with IC3Net paper☆15Jan 8, 2019Updated 7 years ago
- R package for tracking Covid19 cases in San Francisco☆12Apr 2, 2023Updated 3 years ago
- Standard interface for entity based reinforcement learning environments.☆38Feb 28, 2024Updated 2 years ago
- Decentralized deep multi-agent reinforcement learning in physical environments.☆14Aug 19, 2018Updated 7 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A Qwen .5B reasoning model trained on OpenR1-Math-220k☆14Oct 11, 2025Updated 6 months ago
- Heterogeneous effects analysis of conjoint experiments using BART☆10Sep 6, 2023Updated 2 years ago
- [AAMAS 2023] Code for the paper "Automatic Noise Filtering with Dynamic Sparse Training in Deep Reinforcement Learning"☆12Feb 22, 2024Updated 2 years ago
- Official Codebase for "Aligning Diffusion Behaviors with Q-functions for Efficient Continuous Control" (NeurIPS 2024)☆15Oct 29, 2024Updated last year
- 统计微信朋友圈送出的赞票与得到的赞票人员比例☆11May 3, 2016Updated 9 years ago
- Welcome to the Machine Learning Engineering Repository, a comprehensive collection of resources, code, and insights to guide you through…☆25Feb 25, 2025Updated last year
- Official repo for vidar and vidarc: video foundation model for robotics.☆40Dec 22, 2025Updated 4 months ago
- Loss-Balanced Task Weighting to Reduce Negative Transfer in Multi-Task Learning, AAAI-SA'19☆30Sep 23, 2019Updated 6 years ago
- ☆24Dec 30, 2022Updated 3 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Pytorch implementation of BEAR in "Stabilizing Off-Policy Q-Learning via Bootstrapping Error Reduction"☆11Oct 29, 2019Updated 6 years ago
- Factored Interactive POMDP solver based on symbolic Perseus.☆11Aug 12, 2025Updated 8 months ago
- Benchmark data (i.e., DeepMind Control Suite and MuJoCo) for RL.☆34Jan 23, 2021Updated 5 years ago
- Model Behavior Study Group☆29Mar 20, 2026Updated last month
- Neo4j 大规模 三元组 CVS 导入进数据库☆11Jul 31, 2020Updated 5 years ago
- Implementation of DeDOL algorithm - Deep Reinforcement Learning based algorithm for Green Security Games with Real Time Information☆16Nov 7, 2019Updated 6 years ago
- Minimal Transformer base in JAX. A single backbone for language modelling, diffusion, classification, etc...☆15May 28, 2025Updated 11 months ago
- This repository contains all code and experiments for competitive policy gradient (CoPG) algorithm.☆24Aug 1, 2020Updated 5 years ago
- Basic PyTorch Implementation of 'Neural Architecture Search with Reinforcement Learning' (https://arxiv.org/abs/1611.01578)☆13Feb 24, 2018Updated 8 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Official repository for the paper "Automating Continual Learning"☆18Jun 11, 2025Updated 10 months ago
- A deep reinforcement learning based approach is used to allocate downlink power for multi-cell wireless system.☆23Feb 21, 2020Updated 6 years ago
- 🛩 Use Deep Reinforcement Learning Algorithms in a simple scene.☆18Jun 18, 2020Updated 5 years ago
- small african spatial datasets for learning & teaching mapping in R☆17Nov 9, 2021Updated 4 years ago
- 2nd place submission to the MEG decoding competition https://www.kaggle.com/c/decoding-the-human-brain☆18Aug 5, 2014Updated 11 years ago
- ☆120Jul 9, 2020Updated 5 years ago
- Surprise-based intrinsic motivation for deep reinforcement learning☆21Mar 6, 2017Updated 9 years ago