This training offers an intensive exploration into the frontier of reinforcement learning techniques with large language models (LLMs). We will explore advanced topics such as Reinforcement Learning with Human Feedback (RLHF), Reinforcement Learning from AI Feedback (RLAIF), Reasoning LLMs, and demonstrate practical applications such as fine-tun…
☆66Mar 9, 2026Updated last month
Alternatives and similar repositories for oreilly-llm-rl-alignment
Users that are interested in oreilly-llm-rl-alignment are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Learn how multimodal AI merges text, image, and audio for smarter models☆30Jan 21, 2025Updated last year
- Here you can find all the datasets and notebooks linked to the Automated machine & deep learning course provided by O'Reilly☆42Aug 1, 2024Updated last year
- Notebooks for the live trainining about llm app development☆84Jun 17, 2025Updated 10 months ago
- This repository contains code for the O'Reilly Live Online Training for Hands on Transfer Learning with BERT☆18Oct 20, 2022Updated 3 years ago
- ☆50Nov 26, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- This repository contains code for the O'Reilly Live Online Training for BERT☆32Dec 5, 2022Updated 3 years ago
- Optimizing LLMs with Fine-Tuning and Prompt Engineering☆87Dec 16, 2025Updated 4 months ago
- Transformer Architectures for Generative AI☆105Jul 23, 2025Updated 9 months ago
- ☆39Feb 8, 2026Updated 2 months ago
- See how to augment LLMs with real-time data for dynamic, context-aware apps - Rag + Agents + GraphRAG.☆165Feb 17, 2026Updated 2 months ago
- ☆17Jul 10, 2024Updated last year
- Repo for my live-training about autogen☆28Apr 5, 2026Updated 3 weeks ago
- Repository for all the code and notebooks for the O'Reilly live-training: "Getting Started with LLM Agents using Langchain"☆57Apr 5, 2026Updated 3 weeks ago
- Examples used in the AI-Tools Course☆28Oct 19, 2025Updated 6 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆27May 20, 2024Updated last year
- ☆15Aug 16, 2022Updated 3 years ago
- ☆25Apr 1, 2026Updated 3 weeks ago
- Repository for the oreilly live training course: "Getting Started with Llama2"☆96Feb 3, 2026Updated 2 months ago
- Mastering the Art of Scalable and Efficient AI Model Deployment☆143Feb 25, 2026Updated 2 months ago
- ☆58Mar 3, 2026Updated last month
- Sinan Ozdemir's ODSC West Workshop: LLMs from Protoype to Production☆14Oct 30, 2024Updated last year
- An introduction to the world of AI Agents☆268Mar 6, 2026Updated last month
- ☆53Feb 27, 2026Updated 2 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆17Dec 11, 2023Updated 2 years ago
- ☆27Mar 9, 2026Updated last month
- ☆14Jul 2, 2024Updated last year
- ☆51Apr 21, 2026Updated last week
- A Crash Course in Hugging Face☆63Nov 5, 2025Updated 5 months ago
- ☆36Jul 16, 2024Updated last year
- ☆28Mar 2, 2026Updated last month
- Hands-on LLM Engineering including Agentic AI Project☆168Feb 2, 2026Updated 2 months ago
- ☆15Dec 12, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆26Jun 25, 2024Updated last year
- ☆17Apr 24, 2024Updated 2 years ago
- Designing and Deploying LLM Pipelines☆38Dec 10, 2025Updated 4 months ago
- Repo for my live-training about getting started with langchain☆80Apr 2, 2026Updated 3 weeks ago
- Tim Warner's GitHub Actions Cert Prep Class☆84Nov 26, 2024Updated last year
- ☆18May 15, 2023Updated 2 years ago
- A basic agent framework by Sinan Ozdemir☆17Jul 14, 2025Updated 9 months ago