R.L. methods and techniques.
☆199Feb 17, 2026Updated 2 weeks ago
Alternatives and similar repositories for RL
Users that are interested in RL are comparing it to the libraries listed below
Sorting:
- time to learn mlx☆41Sep 17, 2025Updated 5 months ago
- Sequential Logic☆114Feb 26, 2026Updated last week
- Documentation about the Tympan☆14Jun 14, 2022Updated 3 years ago
- A concise text on quantum mechanics, intended for a general mathematical audience including CS, engineering, math, and physics undergrads…☆156Sep 5, 2025Updated 6 months ago
- Craziness.☆29Feb 10, 2025Updated last year
- Deep Reinforcement Learning: Zero to Hero!☆2,260Oct 27, 2025Updated 4 months ago
- Do what you want, just give credit.☆23Jun 13, 2025Updated 8 months ago
- A bot that tweets automatically☆12Feb 4, 2016Updated 10 years ago
- Pytorch script hot swap: Change code without unloading your LLM from VRAM☆125Apr 21, 2025Updated 10 months ago
- Agents are distributed systems, and in this repository, they are treated as such. arthur@distributed.systems for projects / employment op…☆40Jun 6, 2025Updated 8 months ago
- ☆308Updated this week
- A $100 Agent - Reinforcement tuning a language model to play the game of Wordle☆16Jul 14, 2025Updated 7 months ago
- ☆12May 8, 2024Updated last year
- Build a Content-Based Movie Recommender System (TF-IDF, BM25, BERT)☆13Jun 13, 2022Updated 3 years ago
- Expose Datasette instances to LLM as a tool☆26May 27, 2025Updated 9 months ago
- Visualizing movie frames as art☆123Jun 21, 2020Updated 5 years ago
- Rewriting Principia Mathematica in Lean☆138Feb 5, 2026Updated last month
- Plugin Marketplace for Claude Code☆20Feb 8, 2026Updated 3 weeks ago
- A concise, beginner-friendly introduction to the core ideas of linear algebra.☆1,900Feb 25, 2026Updated last week
- A reimplementation of Stable Diffusion 3.5 in pure PyTorch☆695Jun 14, 2025Updated 8 months ago
- LD_PRELOADable library for exploring the glibc heap☆108Mar 6, 2025Updated 11 months ago
- A multithreaded discrete event simulation library in C☆63Feb 22, 2026Updated last week
- ☆69Oct 17, 2025Updated 4 months ago
- Simple website to pad out images so they fit an aspect ratio of 16x9☆18Mar 25, 2021Updated 4 years ago
- Browser-LLM Auto-Scaling Technology☆775Jan 29, 2026Updated last month
- This package implements 1D and 2D blood flow models for arterial circulation using Trixi.jl, enabling efficient numerical simulation and …☆44Feb 9, 2026Updated 3 weeks ago
- An experimental transformer stack and symbolic computation engine built entirely from first principles in pure Python.☆39Apr 18, 2025Updated 10 months ago
- Various shape regularization algorithms☆89Jan 9, 2026Updated last month
- DiscoGrad - automatically differentiate across conditional branches in C++ programs☆209Sep 12, 2024Updated last year
- Implementation of the Monte-Carlo CTW AIXI approximation as described by Joel Veness et al.☆12Jan 14, 2017Updated 9 years ago
- Fully neural approach for text chunking☆406Oct 23, 2025Updated 4 months ago
- Implement recursion using English as the programming language and an LLM as the runtime.☆240Apr 3, 2023Updated 2 years ago
- Animating R1's thoughts.☆383Feb 17, 2025Updated last year
- A short guide to LaTeX that avoids legacy cruft.☆872Oct 26, 2022Updated 3 years ago
- us cached road graph, freeways, primary and secondary roads☆193Jan 8, 2025Updated last year
- Topological sort library in Zig☆97Dec 13, 2025Updated 2 months ago
- Full-featured logic programming (AKA "Prolog") embedded in/callable from and supporting calls to Clojure. In the spirit of LogLisp, Lisp…☆270Apr 4, 2024Updated last year
- Production-ready K-Means clustering for Apache Spark with pluggable Bregman divergences (KL, Itakura-Saito, L1, etc). 6 algorithms, 740 …☆341Feb 14, 2026Updated 2 weeks ago
- ☆21Jun 16, 2022Updated 3 years ago