eniompw / nanoGPTshakespeareLinks
finetuning shakespeare on karpathy/nanoGPT
☆22Updated 2 years ago
Alternatives and similar repositories for nanoGPTshakespeare
Users that are interested in nanoGPTshakespeare are comparing it to the libraries listed below
Sorting:
- ☆36Updated last year
- Tensor-Slayer : Manipulate weights and tensors of LLMs to achieve performance upgrades and introduce a novel inferenceless mechanistic in…☆25Updated 5 months ago
- ☆36Updated 3 months ago
- inference code for mixtral-8x7b-32kseqlen☆102Updated last year
- A repository of Python scripts to scrape code contents of the public repositories of `huggingface`.☆52Updated last year
- A repository of projects and datasets under active development by Alignment Lab AI☆22Updated last year
- ☆57Updated 4 months ago
- ☆61Updated last year
- a LLM cookbook, for building your own from scratch, all the way from gathering data to training a model☆164Updated last year
- ☆55Updated 2 months ago
- Notus is a collection of fine-tuned LLMs using SFT, DPO, SFT+DPO, and/or any other RLHF techniques, while always keeping a data-first app…☆169Updated last year
- ☆88Updated last year
- An example implementation of RLHF (or, more accurately, RLAIF) built on MLX and HuggingFace.☆33Updated last year
- Command-line script for inferencing from models such as MPT-7B-Chat☆99Updated 2 years ago
- Learn the building blocks of how to build gpt-oss from scratch☆97Updated last month
- Port of Andrej Karpathy's nanoGPT to Apple MLX framework.☆113Updated last year
- NanoGPT (124M) in 5 minutes☆13Updated 8 months ago
- Swarming algorithms like PSO, Ant Colony, Sakana, and more in PyTorch 😊☆132Updated 2 weeks ago
- Using multiple LLMs for ensemble Forecasting☆16Updated last year
- An introduction to LLM Sampling☆79Updated 10 months ago
- ☆46Updated 2 years ago
- Some simple scripts that I use day-to-day when working with LLMs and Huggingface Hub☆160Updated 2 years ago
- ☆138Updated last year
- This repository contains a better implementation of Kolmogorov-Arnold networks☆63Updated 5 months ago
- ☆87Updated last year
- Inference of Mamba models in pure C☆192Updated last year
- Zeus LLM Trainer is a rewrite of Stanford Alpaca aiming to be the trainer for all Large Language Models☆69Updated 2 years ago
- An all-new Language Model That Processes Ultra-Long Sequences of 100,000+ Ultra-Fast☆150Updated last year
- an implementation of Self-Extend, to expand the context window via grouped attention☆118Updated last year
- Examining how large language models (LLMs) perform across various synthetic regression tasks when given (input, output) examples in their…☆155Updated 3 weeks ago