NolanoOrg / smol-gpt
Smol but mighty language model
☆63Updated 2 years ago
Alternatives and similar repositories for smol-gpt:
Users that are interested in smol-gpt are comparing it to the libraries listed below
- The GeoV model is a large langauge model designed by Georges Harik and uses Rotary Positional Embeddings with Relative distances (RoPER).…☆121Updated last year
- Reimplementation of the task generation part from the Alpaca paper☆119Updated 2 years ago
- Multi-Domain Expert Learning☆67Updated last year
- Drop in replacement for OpenAI, but with Open models.☆153Updated last year
- [WIP] A 🔥 interface for running code in the cloud☆85Updated 2 years ago
- Command-line script for inferencing from models such as MPT-7B-Chat☆101Updated last year
- ☆46Updated last year
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.☆82Updated last year
- Full finetuning of large language models without large memory requirements☆93Updated last year
- ☆48Updated last year
- ☆22Updated last year
- ☆26Updated 2 years ago
- Maybe the new state of the art vision model? we'll see 🤷♂️☆161Updated last year
- ☆93Updated 3 months ago
- Hands-free companionship on demand.☆77Updated 2 years ago
- Drive a browser with Cohere☆72Updated 2 years ago
- Experiments with generating opensource language model assistants☆97Updated last year
- Command-line script for inferencing from models such as falcon-7b-instruct☆76Updated last year
- Finetune Falcon, LLaMA, MPT, and RedPajama on consumer hardware using PEFT LoRA☆102Updated 8 months ago
- A library for squeakily cleaning and filtering language datasets.☆46Updated last year
- Exploring finetuning public checkpoints on filter 8K sequences on Pile☆115Updated 2 years ago
- ☆24Updated last year
- Instruct-tuning LLaMA on consumer hardware☆66Updated 2 years ago
- [Added T5 support to TRLX] A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)☆47Updated 2 years ago
- ☆143Updated 2 years ago
- Run, build, test transformer models using docker☆32Updated last year
- The first AI artist☆32Updated 2 years ago
- Simple embedding -> text model trained on a small subset of Wikipedia sentences.☆153Updated last year
- Falcon40B and 7B (Instruct) with streaming, top-k, and beam search☆40Updated last year
- Fast inference of Instruct tuned LLaMa on your personal devices.☆22Updated 2 years ago