☆17Mar 4, 2019Updated 7 years ago
Alternatives and similar repositories for reinforcement-learning
Users that are interested in reinforcement-learning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆19Jun 16, 2021Updated 4 years ago
- Hardware Accelerated MWPM decoder for Quantum Error Correction☆22Mar 23, 2025Updated last year
- A place to store my knowledge base☆12Apr 27, 2026Updated last month
- Learning bisimulation metrics for control, particularly suited to sparse reward settings☆11Feb 28, 2023Updated 3 years ago
- Demo of exporting HTML content as PDFs using various html-to-pdf libraries☆10Aug 22, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆12Aug 28, 2020Updated 5 years ago
- Electroplating simulation environment☆20Sep 26, 2024Updated last year
- Distributional Successor Features Enable Zero-Shot Policy Optimization☆15Apr 11, 2025Updated last year
- RL^2: Fast Reinforcement Learning via Slow Reinforcement Learning☆19May 24, 2023Updated 3 years ago
- ☆13May 21, 2023Updated 3 years ago
- ☆15Jan 6, 2024Updated 2 years ago
- Rewrite the raft algorithm☆11Dec 20, 2020Updated 5 years ago
- A list of papers regarding generalization in (deep) reinforcement learning☆11Aug 13, 2023Updated 2 years ago
- PyTorch implementation of Vanilla PG, TNPG, TRPO, PPO on Mujoco environment☆12Feb 22, 2019Updated 7 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆14Oct 23, 2025Updated 7 months ago
- ☆16Aug 2, 2022Updated 3 years ago
- ☆11Jan 24, 2022Updated 4 years ago
- sequential learning in orthogonal subspaces☆14Nov 20, 2020Updated 5 years ago
- task manager framework in js☆13Jan 10, 2016Updated 10 years ago
- [ICML 2025] Improving Planning of Agents for Long-Horizon Tasks☆40Oct 2, 2025Updated 8 months ago
- Parallel Particle Swarm Optimizer on the Spark Clustering Computing Platform.☆12Oct 29, 2018Updated 7 years ago
- Pytorch Implementation of Learning Latent Dynamic Robust Representations for World Models☆25May 11, 2024Updated 2 years ago
- This is the source code using deep Q learning for calculate UAV resource allocation☆39Aug 28, 2021Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- 3D Tensor Network Decoding☆21Mar 20, 2025Updated last year
- This repo contains an implementation of the Simple-Update Tensor Network algorithm as described in the paper - A universal tensor network…☆27May 3, 2025Updated last year
- 一个尝试固液耦合的沙盒玩具☆11Feb 17, 2025Updated last year
- An educational tool to the introduction of Quantum Error Correction (QEC)☆27Mar 1, 2026Updated 3 months ago
- Official implementation of the δ-model presented in the ICML 2024 paper "A Distributional Analogue to the Successor Representation".☆24Nov 8, 2024Updated last year
- Bayes-Adaptive RL for LLM Reasoning☆45May 28, 2025Updated last year
- Pytorch实现的NMS和Soft-NMS,可直接使用yolov5官方开源的代码中☆22Mar 22, 2022Updated 4 years ago
- 遗传算法解决旅行商问题☆20Nov 19, 2022Updated 3 years ago
- 平时学习时的相关知识点与生活小技巧。☆24Feb 15, 2019Updated 7 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- A simple flappy bird clone written in golang.☆14Jul 7, 2015Updated 10 years ago
- Realistic water rendering using shaders in OpenGL. Inspired from Evan Wallace's WebGL water rendering.☆11Jan 7, 2019Updated 7 years ago
- N-Back Task Games designed to improve working memory and cognitive abilities.☆29Mar 2, 2023Updated 3 years ago
- Hypergraph Minimum-Weight Parity Factor (MWPF) Algorithm for Decoding General Quantum LDPC Codes☆45Feb 12, 2026Updated 4 months ago
- ☆20Mar 15, 2022Updated 4 years ago
- 基于原生前端和 Python Flask 后端的文件服务器,可远程查看、下载和上传文件,局域网搭配内网穿透可实现公网访问☆21Apr 9, 2023Updated 3 years ago
- 🔥🔥🔥Latest Papers, Codes on Uncertainty-based RL☆59Aug 24, 2025Updated 9 months ago