ALucek / three-RL-projectsLinks
Learn RL Techniques in 3 Easy Projects
☆17Updated last year
Alternatives and similar repositories for three-RL-projects
Users that are interested in three-RL-projects are comparing it to the libraries listed below
Sorting:
- ☆75Updated 8 months ago
- ☆26Updated last year
- PyTorch implementations of algorithms from "Reinforcement Learning: An Introduction by Sutton and Barto", along with various RL research …☆206Updated 5 months ago
- Repository of implementations of classic and sota rl algorithms from scratch in PyTorch☆219Updated 3 weeks ago
- Cookbooks for AI Agents☆151Updated 8 months ago
- "LLM from Zero to Hero: An End-to-End Large Language Model Journey from Data to Application!"☆141Updated last month
- Implementation of a GPT-4o like Multimodal from Scratch using Python☆77Updated 9 months ago
- ☆101Updated 8 months ago
- One click templates for inferencing Language Models☆227Updated 2 months ago
- Fine tuning ModernBERT-embed-base on synthetic domain specific data for improvement to unseen queries☆51Updated 8 months ago
- A repository consisting of paper/architecture replications of classic/SOTA AI/ML papers in pytorch☆400Updated 2 months ago
- Files for 'Build Your Own AI Research Agent' Lightning Lesson☆29Updated 8 months ago
- ☆10Updated last year
- ☆76Updated 6 months ago
- ☆44Updated last year
- API Server for Transformer Lab☆82Updated 2 months ago
- Learn the building blocks of how to build gpt-oss from scratch☆110Updated 4 months ago
- System built to find your lookalike with AI☆48Updated last year
- A collection of the the best ML and AI news every week (research, news, resources)☆173Updated 6 months ago
- ☆181Updated 2 years ago
- Open-source medical-compliant AI Assistant and AI Agent Orchestration Engine with marketplace☆21Updated 8 months ago
- Reference implementation of Mistral AI 7B v0.1 model.☆28Updated 2 years ago
- This repository contains an exhaustive coverage of a hands on approach to PyTorch along side powerful tools to accelerate model tuning an…☆226Updated last month
- A reimplementation of langgraph's customer support example in Rasa's CALM paradigm and a quantiative evaluation of the 2 approaches☆81Updated 10 months ago
- Finetune Llama-3-8b on the MathInstruct dataset☆115Updated last year
- Fine tune Gemma 3 on an object detection task☆96Updated 6 months ago
- ☆75Updated last year
- everything i know about cuda and triton☆13Updated last year
- So, I trained a Llama a 130M architecture I coded from ground up to build a small instruct model from scratch. Trained on FineWeb dataset…☆16Updated 10 months ago
- ☆90Updated 2 years ago