Implementation of all RL algorithms in a simpler way
☆1,404Aug 29, 2025Updated 6 months ago
Alternatives and similar repositories for all-rl-algorithms
Users that are interested in all-rl-algorithms are comparing it to the libraries listed below
Sorting:
- Large Language Model in Action☆342Jan 28, 2025Updated last year
- Simple single file implementations of Reinforcement Learning algorithms in Julia☆23Feb 15, 2025Updated last year
- Building DeepSeek R1 from Scratch☆747Mar 21, 2025Updated 11 months ago
- Single File, Single GPU, From Scratch, Efficient, Full Parameter Tuning library for "RL for LLMs"☆597Oct 7, 2025Updated 4 months ago
- Train a 29M parameter GPT from Scratch☆33Mar 4, 2025Updated last year
- High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, T…☆9,213Jul 8, 2025Updated 7 months ago
- Streamlining reinforcement learning with RLOps. State-of-the-art RL algorithms and tools, with 10x faster training through evolutionary h…☆892Updated this week
- 書籍『今日から使えるファインチューニングレシピ』のGitHubリポジトリです。☆13Sep 11, 2024Updated last year
- Minimal and annotated implementations of key ideas from modern deep learning research.☆1,242Jan 29, 2026Updated last month
- RLHF (Supervised fine-tuning, reward model, and PPO) step-by-step in 3 Jupyter notebooks☆232Jun 20, 2025Updated 8 months ago
- This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."☆14,782Feb 22, 2026Updated last week
- LeanRL is a fork of CleanRL, where selected PyTorch scripts optimized for performance using compile and cudagraphs.☆677Aug 22, 2025Updated 6 months ago
- Awesome LLM pre-training resources, including data, frameworks, and methods.☆338Apr 29, 2025Updated 10 months ago
- Official code repo for the O'Reilly Book - "Hands-On Large Language Models"☆23,193Dec 17, 2025Updated 2 months ago
- Turn topics into essays in seconds!☆192Jul 6, 2025Updated 8 months ago
- A concise list for mcp servers☆868Aug 7, 2025Updated 7 months ago
- Various reinforcement learning algorithms written in Jax + Flax☆26Jun 24, 2023Updated 2 years ago
- Self improving agentic rag pipeline☆122Nov 13, 2025Updated 3 months ago
- This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information r…☆25,723Feb 17, 2026Updated 2 weeks ago
- Awesome-RAG-Vision: a curated list of advanced retrieval augmented generation (RAG) for Computer Vision☆320Jan 25, 2026Updated last month
- Interactive Pytorch forward pass visualization in notebooks☆710Feb 4, 2026Updated last month
- ☆10Feb 14, 2025Updated last year
- A Simple, Explainable Vision Language Model for detecting manifacturing defects into products☆14Sep 23, 2025Updated 5 months ago
- RL Environments in JAX 🌍☆868May 30, 2025Updated 9 months ago
- Multi-modal OCR pipeline optimized for ML training (text, figure, math, tables, diagrams)☆682May 20, 2025Updated 9 months ago
- An implementation of AlphaZero and MCTS with neural networks for Tetris☆22Mar 21, 2025Updated 11 months ago
- 遇事不决,Vibe 力学! One-Person Company AI Tools Series – continuously updated to help boost productivity and empower your solo business!☆2,645May 8, 2025Updated 9 months ago
- Simple single-file baselines for Q-Learning in pure-GPU setting☆237Nov 24, 2025Updated 3 months ago
- An interactive web-based demonstration of fundamental tabular Reinforcement Learning (RL) algorithms in a simple grid world environment.☆98Jun 4, 2025Updated 9 months ago
- vyai – A lightweight CLI tool to interact with the Gemini API from the terminal.☆11Dec 8, 2025Updated 2 months ago
- An index of the LangChain + LangGraph ecosystem: concepts, projects, tools, templates, and guides for LLM & multi-agent apps.☆1,522Feb 22, 2026Updated last week
- A Straightforward, Step-by-Step Implementation of a Video Diffusion Model☆77Aug 18, 2025Updated 6 months ago
- Learn about the fundamentals of LangGraph through a series of notebooks☆331Updated this week
- A very quick project that transforms research papers into engaging three-person discussions, offering an intuitive and thought-provoking …☆606Dec 9, 2024Updated last year
- 🏛️A research-friendly codebase for fast experimentation of single-agent reinforcement learning in JAX • End-to-End JAX RL☆396Updated this week
- ☆169Oct 31, 2024Updated last year
- ML from scratch☆2,442Aug 12, 2025Updated 6 months ago
- This repository provides tutorials and implementations for various Generative AI Agent techniques, from basic to advanced. It serves as a…☆20,243Feb 17, 2026Updated 2 weeks ago
- Maximizing the Performance of a Simple RAG using RL☆90Mar 20, 2025Updated 11 months ago