eniompw / nanoGPTshakespeareLinks

finetuning shakespeare on karpathy/nanoGPT

☆19

Alternatives and similar repositories for nanoGPTshakespeare

Users that are interested in nanoGPTshakespeare are comparing it to the libraries listed below

Sorting:

wandb / programmer
☆57Updated 3 weeks ago
kabachuha / nanoGPKANT
Testing KAN-based text generation GPT models
☆17Updated last year
attentionmech / tensorlens
aesthetic tensor visualiser
☆24Updated 2 months ago
lamm-mit / LifeGPT
☆55Updated 2 weeks ago
axolotl-ai-cloud / axolotl-cookbook
☆34Updated 4 months ago
Troyanovsky / autonomous_agent_tutorial
A tutorial for building autonomous agents: with LangChain and from scratch
☆28Updated 2 years ago
sidhu2690 / Deep-KAN
This repository contains a better implementation of Kolmogorov-Arnold networks
☆62Updated last month
geronimi73 / phi2-finetune
☆87Updated last year
geronimi73 / mamba
☆31Updated last year
lucidrains / grokfast-pytorch
Explorations into the proposal from the paper "Grokfast, Accelerated Grokking by Amplifying Slow Gradients"
☆101Updated 6 months ago
AtakanTekparmak / agento
Very minimal (and stateless) agent framework
☆44Updated 6 months ago
v-prgmr / mergekit
Tools for merging pretrained large language models.
☆19Updated last year
Jellyfish042 / Sudoku-RWKV
☆142Updated 7 months ago
kyegomez / Andromeda
An all-new Language Model That Processes Ultra-Long Sequences of 100,000+ Ultra-Fast
☆151Updated 10 months ago
AmeyaWagh / llama2.cpp
Inference Llama 2 in C++
☆43Updated last year
GraphIndex-org / semantic-mapper
☆19Updated last year
joey00072 / Attention-as-graph
alternative way to calculating self attention
☆18Updated last year
NousResearch / StripedHyenaTrainer
☆61Updated last year
kyegomez / swarms-pytorch
Swarming algorithms like PSO, Ant Colony, Sakana, and more in PyTorch 😊
☆125Updated last month
alexjc / nanogpt-speedrun
NanoGPT (124M) in 5 minutes
☆11Updated 5 months ago
minosvasilias / simple_grpo
Simple GRPO scripts and configurations.
☆59Updated 5 months ago
bdytx5 / mistral7B_finetune
fine tuning mistral 7B using Huggingface, Weights and Biases, Choline, and Vast AI
☆38Updated last year
iantbutler01 / ditty
A library for simplifying fine tuning with multi gpu setups in the Huggingface ecosystem.
☆16Updated 8 months ago
kyegomez / OpenStrawberry
An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO
☆29Updated last week
vikhyat / mixtral-inference
inference code for mixtral-8x7b-32kseqlen
☆100Updated last year
johnrobinsn / redpajama
Training and Inference Notebooks for the RedPajama (OpenLlama) models
☆18Updated 2 years ago
Oxen-AI / mamba-dive
This is the code that went into our practical dive using mamba as information extraction
☆53Updated last year
OpenPipe / rl-experiments
OpenPipe Reinforcement Learning Experiments
☆25Updated 4 months ago
gregorycoppola / bayes-star
Implementation
☆25Updated 3 months ago
7shoe / AdaParse
Adaptive Parallel PDF Parsing and Resource Scaling Engine
☆48Updated last month