A theoretical and practical deep dive into Reinforcement Learning with Human Feedback and it’s applications in Large Language Models from scratch.
☆107Nov 7, 2025Updated 4 months ago
Alternatives and similar repositories for rlhf-from-scratch
Users that are interested in rlhf-from-scratch are comparing it to the libraries listed below
Sorting:
- Parses BGP/AS data from multiple different sources☆11Dec 4, 2021Updated 4 years ago
- Bash script to download all lecture videos & notes for a particular course on Coursera.org.☆13Dec 25, 2015Updated 10 years ago
- Topology library for Coq☆12Dec 24, 2015Updated 10 years ago
- Create microVMs from OCI images☆35Feb 9, 2026Updated 3 weeks ago
- Experiments with syntax and symbols☆15Mar 5, 2019Updated 7 years ago
- Stan Jupyter Magic☆11Oct 6, 2018Updated 7 years ago
- KLayoutPhotonicPCells Core Library. Functionallities to extend KLayout PCells for Photonics☆10Jan 10, 2020Updated 6 years ago
- Tor relay nearest neighbour ranking☆10Oct 18, 2021Updated 4 years ago
- Instrospect function signatures to construct a CLI☆16Apr 16, 2021Updated 4 years ago
- Pastenum is a text dump enumeration tool.☆14Dec 9, 2013Updated 12 years ago
- ☆13Feb 18, 2026Updated 2 weeks ago
- Docker Monitor☆10May 14, 2016Updated 9 years ago
- ADK apps for Product Engineers☆28Jan 8, 2026Updated last month
- PoWx mission: Aiming at smaller energy per hash hardware.☆11Dec 18, 2021Updated 4 years ago
- The ultimate tool to crafting your ARM shell code☆10Aug 7, 2015Updated 10 years ago
- OpenTitan: Open source silicon root of trust☆10Feb 5, 2020Updated 6 years ago
- Some tools☆10Dec 5, 2017Updated 8 years ago
- Some notes on the relationship between the Legendre and Fourier transforms☆11Dec 18, 2025Updated 2 months ago
- dotfiles. Installer for key programs, fonts, vim, and zsh☆13Jul 16, 2024Updated last year
- The NowSecure Mobile Security Report☆11Nov 16, 2016Updated 9 years ago
- CVE-2015-2231 POC☆10Sep 8, 2015Updated 10 years ago
- Application Security Vulnerability Periodic Table☆14Aug 25, 2014Updated 11 years ago
- ☆12Mar 15, 2020Updated 5 years ago
- MCP server for Google search and page fetching using headless Chromium☆71Feb 21, 2026Updated 2 weeks ago
- Netwitness Maltego integration Project☆18May 9, 2017Updated 8 years ago
- Core software stack used by the MIT Hyperloop Team during the SpaceX Hyperloop Pod Competition in January 2017.☆10Jan 30, 2026Updated last month
- ☆12Aug 16, 2022Updated 3 years ago
- Themis MapReduce and TritonSort☆11Nov 2, 2017Updated 8 years ago
- Here comes the paintrain!☆11Aug 8, 2016Updated 9 years ago
- An extra light, extra simple Objective-C hooking framework☆16Jun 18, 2025Updated 8 months ago
- C++14 automated code test infrastructure with permutation, fuzzing, sanitising and edge coverage☆12Dec 16, 2025Updated 2 months ago
- This is a collection of Jupyter Notebooks for teaching and outreach purposes☆10Oct 3, 2023Updated 2 years ago
- IDA scripts that facilitate reverse engineering☆16Aug 10, 2016Updated 9 years ago
- pspgen utility on top of DPDK☆14Mar 21, 2016Updated 9 years ago
- Simple tmux launcher that will take less than 2 minutes to learn and should work across all versions of tmux☆13May 16, 2024Updated last year
- Domaintools addon for Maltego☆15Sep 13, 2012Updated 13 years ago
- Brainfuck compiler and interpreter☆17Jun 29, 2023Updated 2 years ago
- Top Disk Usage. You want to know what is using all your disk space ? This command-line tool estimates the disk space occupied by all file…☆17Jul 1, 2023Updated 2 years ago
- ☆10May 17, 2021Updated 4 years ago