eniompw / nanoGPTshakespeare
finetuning shakespeare on karpathy/nanoGPT
☆17Updated 2 years ago
Alternatives and similar repositories for nanoGPTshakespeare:
Users that are interested in nanoGPTshakespeare are comparing it to the libraries listed below
- Testing KAN-based text generation GPT models☆15Updated 9 months ago
- ☆48Updated last month
- Build Agentic workflows with function calling using open LLMs☆26Updated 2 weeks ago
- Trying to deconstruct RWKV in understandable terms☆14Updated last year
- A tutorial for building autonomous agents: with LangChain and from scratch☆24Updated last year
- Demos of some issues with LangChain.☆31Updated last year
- A library for simplifying fine tuning with multi gpu setups in the Huggingface ecosystem.☆16Updated 3 months ago
- Train a SmolLM-style llm on fineweb-edu in JAX/Flax with an assortment of optimizers.☆17Updated 2 weeks ago
- ☆52Updated last week
- Zeus LLM Trainer is a rewrite of Stanford Alpaca aiming to be the trainer for all Large Language Models☆69Updated last year
- ☆34Updated last year
- My solutions for Advanced Python Mastery (course by @dabeaz)☆11Updated last year
- ☆16Updated last year
- A high throughput, end-to-end RL library for infinite horizon tasks.☆18Updated 8 months ago
- ☆60Updated last year
- Finetuning BLOOM on a single GPU using gradient-accumulation☆27Updated last year
- alternative way to calculating self attention☆18Updated 8 months ago
- tinygrad port of the RWKV large language model.☆44Updated 8 months ago
- Because it's there.☆14Updated 5 months ago
- Latent Large Language Models☆17Updated 5 months ago
- Github repo for Peifeng's internship project☆13Updated last year
- Demo python script app to interact with llama.cpp server using whisper API, microphone and webcam devices.☆46Updated last year
- ML/DL Math and Method notes☆58Updated last year
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…☆25Updated 3 months ago
- inference code for mixtral-8x7b-32kseqlen☆99Updated last year
- a simplified version of Google's Gemma model to be used for learning☆24Updated 11 months ago
- An EXA-Scale repository of Multi-Modality AI resources from papers and models, to foundational libraries!☆42Updated last year