ashworks1706 / rlhf-from-scratchView external linksLinks
A theoretical and practical deep dive into Reinforcement Learning with Human Feedback and it’s applications in Large Language Models from scratch.
☆90Nov 7, 2025Updated 3 months ago
Alternatives and similar repositories for rlhf-from-scratch
Users that are interested in rlhf-from-scratch are comparing it to the libraries listed below
Sorting:
- Parses BGP/AS data from multiple different sources☆11Dec 4, 2021Updated 4 years ago
- Angstrom repository with updated layers file☆11Jul 9, 2021Updated 4 years ago
- DasherA is a Data General DASHER D200/D210 terminal emulator☆37May 20, 2024Updated last year
- Bash script to download all lecture videos & notes for a particular course on Coursera.org.☆13Dec 25, 2015Updated 10 years ago
- ☆11Jul 27, 2016Updated 9 years ago
- ☆12Jun 10, 2023Updated 2 years ago
- This repository contains the code of the Rasa workshop at PyData NYC 2018☆12Oct 19, 2018Updated 7 years ago
- High-level overview on exam topics.☆13Jan 24, 2026Updated 3 weeks ago
- Create microVMs from OCI images☆35Updated this week
- Instrospect function signatures to construct a CLI☆16Apr 16, 2021Updated 4 years ago
- OpenTitan: Open source silicon root of trust☆10Feb 5, 2020Updated 6 years ago
- Themis MapReduce and TritonSort☆11Nov 2, 2017Updated 8 years ago
- KLayoutPhotonicPCells Core Library. Functionallities to extend KLayout PCells for Photonics☆10Jan 10, 2020Updated 6 years ago
- Simple standalone Docker Plugin implementation to demonstrate Clear Containers with VPP.☆13Jan 6, 2023Updated 3 years ago
- PoWx mission: Aiming at smaller energy per hash hardware.☆11Dec 18, 2021Updated 4 years ago
- ☆13Aug 12, 2025Updated 6 months ago
- Here comes the paintrain!☆11Aug 8, 2016Updated 9 years ago
- Application Security Vulnerability Periodic Table☆14Aug 25, 2014Updated 11 years ago
- O'Reilly Course, In-Memory Computing Essentials☆10Oct 16, 2020Updated 5 years ago
- This is a collection of Jupyter Notebooks for teaching and outreach purposes☆10Oct 3, 2023Updated 2 years ago
- Netwitness Maltego integration Project☆18May 9, 2017Updated 8 years ago
- An open hardware reference design based on Mitochondrik-LV.☆13Aug 21, 2024Updated last year
- Stan Jupyter Magic☆11Oct 6, 2018Updated 7 years ago
- Experiments with syntax and symbols☆15Mar 5, 2019Updated 6 years ago
- The NowSecure Mobile Security Report☆11Nov 16, 2016Updated 9 years ago
- An end-to-end management tool for very large computing environments, physical and virtual. it collects alarms and detailed statistics fro…☆16Apr 7, 2023Updated 2 years ago
- Core software stack used by the MIT Hyperloop Team during the SpaceX Hyperloop Pod Competition in January 2017.☆10Jan 30, 2026Updated 2 weeks ago
- ☆32Jan 25, 2026Updated 3 weeks ago
- A thin HTTP API wrapper over rrdtool.☆13May 1, 2023Updated 2 years ago
- ☆10Dec 22, 2016Updated 9 years ago
- the full stack☆13Jun 16, 2015Updated 10 years ago
- Tools to work with vulnerability standards.☆19Mar 19, 2014Updated 11 years ago
- Passive subdomains and web directories recon using Bing.☆13Apr 30, 2018Updated 7 years ago
- 🌐 My personal website made with Hugo in Go.☆12Jan 26, 2026Updated 2 weeks ago
- An evolving hacking framework written in python☆11Jan 11, 2015Updated 11 years ago
- ☆12Oct 28, 2015Updated 10 years ago
- ☆13Jan 11, 2023Updated 3 years ago
- Linux and Windows Hardening Points☆12Mar 6, 2018Updated 7 years ago
- pspgen utility on top of DPDK☆14Mar 21, 2016Updated 9 years ago