RyanLucas3 / poasterGPTLinks
A single notebook for fine-tuning GPT-3.5 turbo
☆31Updated last year
Alternatives and similar repositories for poasterGPT
Users that are interested in poasterGPT are comparing it to the libraries listed below
Sorting:
- Simple Transformer in Jax☆139Updated last year
- This repository explains and provides examples for "concept anchoring" in GPT4.☆71Updated last year
- KMD is a collection of conversational exchanges between patients and doctors on various medical topics. It aims to capture the intricaci…☆24Updated 2 years ago
- Let's make sand talk☆592Updated 2 years ago
- A collection of LLM services you can self host via docker or modal labs to support your applications development☆197Updated last year
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.☆82Updated 2 years ago
- MiniHF is an inference, human preference data collection, and fine-tuning tool for local language models. It is intended to help the user…☆181Updated last week
- papers.day☆90Updated last year
- ☆414Updated 2 years ago
- Vector search over tweets from the tweet archive using OpenAI embeddings and LanceDB☆58Updated last year
- A discord bot that roleplays!☆150Updated 2 years ago
- Aidan Bench attempts to measure <big_model_smell> in LLMs.☆314Updated 4 months ago
- My name is Ozymandias, King of Kings; Look on my Works, ye Mighty, and despair!☆39Updated 2 years ago
- Bespoke Automata is a GUI and deployment pipline for making complex AI agents locally and offline☆220Updated last year
- an implementation of Self-Extend, to expand the context window via grouped attention☆118Updated last year
- Just large language models. Hackable, with as little abstraction as possible. Done for my own purposes, feel free to rip.☆44Updated 2 years ago
- Stream of my favorite papers and links☆43Updated last week
- Just a bunch of benchmark logs for different LLMs☆118Updated last year
- Some of the scripts I use for scribepod @ https://scribepod.substack.com/, an automated AI podcast☆172Updated 2 years ago
- inference code for mixtral-8x7b-32kseqlen☆102Updated last year
- Hands-free companionship on demand.☆76Updated 2 years ago
- Minimalistic, extremely fast, and hackable researcher's toolbench for GPT models in 307 lines of code. Reaches <3.8 validation loss on wi…☆351Updated last year
- Simple embedding -> text model trained on a small subset of Wikipedia sentences.☆157Updated 2 years ago
- AI sends pull requests for features you request in natural language☆112Updated 2 years ago
- MLX port for xjdr's entropix sampler (mimics jax implementation)☆62Updated last year
- ☆134Updated last year
- Full finetuning of large language models without large memory requirements☆93Updated last month
- This repo is my attempt at a rough implementation of nanoGPT trained on a dataset of 30,000 unique Twitter usernames☆24Updated last year
- ☆112Updated last year
- A puzzle to learn about prompting☆134Updated 2 years ago