A high throughput, end-to-end RL library for infinite-horizon tasks.
☆23Oct 22, 2025Updated 4 months ago
Alternatives and similar repositories for rl8
Users that are interested in rl8 are comparing it to the libraries listed below
Sorting:
- JAX implementations of various deep reinforcement learning algorithms.☆26Feb 2, 2025Updated last year
- CFR implementation of a poker bot.☆12Feb 17, 2023Updated 3 years ago
- Tutorial: Writing R and Python Packages with Multithreaded C++ Code using BLAS, AVX2/AVX512, OpenMP, C++11 Threads and Cuda GPU accelerat…☆13Nov 27, 2022Updated 3 years ago
- A collection of different PyTorch wrappers for training neural networks and reinforcement algorithms☆13Dec 15, 2022Updated 3 years ago
- This is an efficient implementation of Proximal Policy Optimization in C++ LibTorch adapted from the wonderful Python implementation by: …☆13May 2, 2025Updated 10 months ago
- Airlift Challenge starter kit☆10Apr 18, 2025Updated 10 months ago
- Some microbenchmarks and design docs before commencement☆12Feb 1, 2021Updated 5 years ago
- MiTMoJCo (Microscopic Tunneling Model for Josephson Contacts) is C and Python code for simulating dynamics of superconducting Josephson j…☆10Feb 9, 2023Updated 3 years ago
- React wrapper for daisyUI☆10Mar 12, 2022Updated 3 years ago
- Table of ZhengMa input method for IBus-Table☆19Jun 15, 2009Updated 16 years ago
- Gymnasium environment for research of UAVs and risk constraints☆12Oct 29, 2024Updated last year
- 🚀全流程自己训练一个VLA 「大模型」1小时从0训练26M参数的视觉多模态VLM!🌏 Train a 26M-parameter VLM from scratch in just 1 hours!☆27Oct 16, 2025Updated 4 months ago
- A Texas Holdem poker framework written in C++ 20.☆11Apr 23, 2023Updated 2 years ago
- 🌿快速生成文件夹目录结构,支持定义目录层级,支持生成到 markdown 文件。☆13Oct 19, 2022Updated 3 years ago
- A QA system based on k8s-specific knowledge build on ChatGLM2-6B, serving by Ray.☆10Sep 14, 2023Updated 2 years ago
- Sudoku solver in Golang☆10Sep 6, 2020Updated 5 years ago
- TorchRL is a C++ reinforcement library using PyTorch C++ backend LibTorch☆10Jul 20, 2022Updated 3 years ago
- Advanced_Data_Integration_Project☆11Jul 31, 2018Updated 7 years ago
- [RAL 2025] MTIL: Encoding Full History with Mamba for Temporal Imitation Learning☆27Nov 17, 2025Updated 3 months ago
- CFR-based Texas Hold'em AI☆11Jan 30, 2021Updated 5 years ago
- ☆11Feb 29, 2024Updated 2 years ago
- a minimalistic todo app☆10May 10, 2023Updated 2 years ago
- An unnecessarily tiny and minimal implementation of GPT-2 in NumPy.☆11Feb 12, 2023Updated 3 years ago
- Robust Reinforcement Learning Benchmark☆12Sep 22, 2024Updated last year
- An implementation of the AlphaZero algorithm for adversarial games to be used with the machine learning framework of your choice☆12Aug 30, 2020Updated 5 years ago
- MLflow App Using React, Hooks, RabbitMQ, FastAPI Server, Celery, Microservices☆11Sep 25, 2022Updated 3 years ago
- nd009-cn-advanced-p5,针对Udacity CN MLND P5项目☆14Jun 27, 2022Updated 3 years ago
- JAX implementation of GPTQ quantization algorithm☆10Jul 19, 2023Updated 2 years ago
- Furnace is a high-performance quantitative trading library that provides features similar to CCXT, allowing developers to connect and int…☆14Jan 16, 2025Updated last year
- tmux cheatsheet in terminal friendly text format.☆10Jan 7, 2022Updated 4 years ago
- A chrome extension for improving the ChatGPT UI☆10Apr 14, 2023Updated 2 years ago
- 3rd placed submission to the NeurIPS MineRL competition 2019☆10Mar 24, 2023Updated 2 years ago
- Just another static site generator -> あなたが恋しいです。☆11Dec 5, 2023Updated 2 years ago
- Gym wrapper for pysc2☆10Sep 16, 2022Updated 3 years ago
- Swarm learning algorithm☆11Jun 2, 2021Updated 4 years ago
- Poker hand evaluation for Go☆12Feb 7, 2014Updated 12 years ago
- A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environm…☆43Sep 19, 2022Updated 3 years ago
- ☆13Mar 7, 2024Updated last year
- A Simple Game Using Unity ML-Agents☆10Nov 20, 2020Updated 5 years ago