eniompw / nanoGPTshakespeareLinks
finetuning shakespeare on karpathy/nanoGPT
☆19Updated 2 years ago
Alternatives and similar repositories for nanoGPTshakespeare
Users that are interested in nanoGPTshakespeare are comparing it to the libraries listed below
Sorting:
- ☆57Updated 3 weeks ago
- Testing KAN-based text generation GPT models☆17Updated last year
- aesthetic tensor visualiser☆24Updated 2 months ago
- ☆55Updated 2 weeks ago
- ☆34Updated 4 months ago
- A tutorial for building autonomous agents: with LangChain and from scratch☆28Updated 2 years ago
- This repository contains a better implementation of Kolmogorov-Arnold networks☆62Updated last month
- ☆87Updated last year
- ☆31Updated last year
- Explorations into the proposal from the paper "Grokfast, Accelerated Grokking by Amplifying Slow Gradients"☆101Updated 6 months ago
- Very minimal (and stateless) agent framework☆44Updated 6 months ago
- Tools for merging pretrained large language models.☆19Updated last year
- ☆142Updated 7 months ago
- An all-new Language Model That Processes Ultra-Long Sequences of 100,000+ Ultra-Fast☆151Updated 10 months ago
- Inference Llama 2 in C++☆43Updated last year
- ☆19Updated last year
- alternative way to calculating self attention☆18Updated last year
- ☆61Updated last year
- Swarming algorithms like PSO, Ant Colony, Sakana, and more in PyTorch 😊☆125Updated last month
- NanoGPT (124M) in 5 minutes☆11Updated 5 months ago
- Simple GRPO scripts and configurations.☆59Updated 5 months ago
- fine tuning mistral 7B using Huggingface, Weights and Biases, Choline, and Vast AI☆38Updated last year
- A library for simplifying fine tuning with multi gpu setups in the Huggingface ecosystem.☆16Updated 8 months ago
- An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO☆29Updated last week
- inference code for mixtral-8x7b-32kseqlen☆100Updated last year
- Training and Inference Notebooks for the RedPajama (OpenLlama) models☆18Updated 2 years ago
- This is the code that went into our practical dive using mamba as information extraction☆53Updated last year
- OpenPipe Reinforcement Learning Experiments☆25Updated 4 months ago
- Implementation☆25Updated 3 months ago
- Adaptive Parallel PDF Parsing and Resource Scaling Engine☆48Updated last month