RLHF (Supervised fine-tuning, reward model, and PPO) step-by-step in 3 Jupyter notebooks
☆245Jun 20, 2025Updated 9 months ago
Alternatives and similar repositories for RLHF_in_notebooks
Users that are interested in RLHF_in_notebooks are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An LLM based shell assistant that knows your usual shell commands.☆17Jul 18, 2025Updated 9 months ago
- ☆10Jan 23, 2025Updated last year
- SoTA open-source TTS☆23Jun 17, 2025Updated 10 months ago
- A GPT agent with a Text Interface tool☆15Feb 10, 2026Updated 2 months ago
- A Python-based AI coding assistant that uses the Gemini API for code generation, file manipulation, and interactive software development …☆23Jun 28, 2025Updated 9 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆14Jun 6, 2024Updated last year
- Plugin Marketplace for Claude Code☆20Feb 8, 2026Updated 2 months ago
- Central repository for my distributions figures☆16Mar 28, 2019Updated 7 years ago
- Radient turns many data types (not just text) into vectors for similarity search, RAG, regression analysis, and more.☆283Mar 2, 2026Updated last month
- A simple interface for using Ollama with LangChain's RAGChain☆30Mar 5, 2024Updated 2 years ago
- ☆542Jul 1, 2025Updated 9 months ago
- Parallelism and preemptive concurrency for sporadic workloads☆46Dec 2, 2024Updated last year
- A MCP Server to Create MCP Server☆21Mar 4, 2025Updated last year
- Code for Findings of ACL 2021 paper "Addressing Inquiries about History: An Efficient and Practical Framework for Evaluating Open-domain …☆19Dec 16, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Key value store using the redis protocol with Postgres as a backend☆59Nov 9, 2024Updated last year
- A collection of freely-available alternatives to github copilot☆58Aug 12, 2024Updated last year
- Easy .ovpn files import/generation tool☆29Apr 19, 2020Updated 5 years ago
- A simple app for downloading YouTube Shorts transcripts. Built to self-host with Python and Streamlit. Free and open source.☆32Dec 4, 2024Updated last year
- Enable tool-use ability for any LLM model (DeepSeek V3/R1, etc.)☆58May 27, 2025Updated 10 months ago
- 🔥 LitLytics - an affordable, simple analytics platform that leverages LLMs to automate data analysis☆104Nov 25, 2024Updated last year
- ☆54Nov 14, 2024Updated last year
- a simple social media researcher built with vercels ai sdk☆42Aug 17, 2025Updated 8 months ago
- A curated list of awesome things related to the WebMCP W3C standard☆87Mar 25, 2026Updated 3 weeks ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆167Jan 4, 2026Updated 3 months ago
- TideCloak lets your users hold their own digital authority—no central control, no blind trust.☆64Jul 28, 2025Updated 8 months ago
- ☆44Aug 21, 2025Updated 7 months ago
- A reimplementation of Stable Diffusion 3.5 in pure PyTorch☆703Jun 14, 2025Updated 10 months ago
- Hubcap is an autonomous AI agent in 25 lines of code: a small Autobot that you can't trust. *This is the Python fork/port* from https://g…☆22Nov 10, 2025Updated 5 months ago
- A comprehensive suite of tools, built to liberate science by making the creation, evaluation, and dissemination of research more transpar…☆246Aug 8, 2025Updated 8 months ago
- VPN over UDP☆114Feb 3, 2026Updated 2 months ago
- cloudflare workers项目,开箱即用,用于记录美好瞬间✨✨☆73Oct 20, 2025Updated 5 months ago
- Auto Thinking Mode switch for Qwen3 in Open webui☆70May 8, 2025Updated 11 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Detect whether or not an audio file was generated by NotebookLM☆142Nov 30, 2024Updated last year
- A simple to use text only blog using CloudFlare Workers and KV☆86Oct 26, 2024Updated last year
- An MCP server for playing Minesweeper☆108Mar 20, 2025Updated last year
- ☆261Mar 27, 2024Updated 2 years ago
- Simple Page Ordering plugin for WordPress☆12Apr 2, 2018Updated 8 years ago
- An N-Body simulation built in Godot☆11Jul 14, 2021Updated 4 years ago
- Tidal Cycles Code Files☆11Mar 9, 2025Updated last year