RobertRiachi / nanoPALMLinks

☆144

Alternatives and similar repositories for nanoPALM

Users that are interested in nanoPALM are comparing it to the libraries listed below

Sorting:

geov-ai / geov
The GeoV model is a large langauge model designed by Georges Harik and uses Rotary Positional Embeddings with Relative distances (RoPER).…
☆121Updated 2 years ago
tysam-code / hlb-gpt
Minimalistic, extremely fast, and hackable researcher's toolbench for GPT models in 307 lines of code. Reaches <3.8 validation loss on wi…
☆350Updated last year
abacaj / train-with-fsdp
☆94Updated 2 years ago
Narsil / fast_gpt2
☆157Updated 2 years ago
MF-FOOM / wikivec2text
Simple embedding -> text model trained on a small subset of Wikipedia sentences.
☆156Updated 2 years ago
CarperAI / treasure_trove
☆22Updated 2 years ago
srush / raspy
An interactive exploration of Transformer programming.
☆269Updated last year
Sentdex / Lambda-Cloud
Helpers and such for working with Lambda Cloud
☆51Updated last year
euclaise / SlimTrainer
Full finetuning of large language models without large memory requirements
☆93Updated last month
notarussianteenager / srf-attention
Simplex Random Feature attention, in PyTorch
☆73Updated 2 years ago
Birch-san / mpt-play
Command-line script for inferencing from models such as MPT-7B-Chat
☆99Updated 2 years ago
aidangomez / weblm
Drive a browser with Cohere
☆71Updated 2 years ago
jxbz / agd
Automatic gradient descent
☆215Updated 2 years ago
hundredblocks / large-model-parallelism
Functional local implementations of main model parallelism approaches
☆96Updated 2 years ago
abacaj / transformers
Understanding large language models
☆119Updated 2 years ago
r-three / git-theta
git extension for {collaborative, communal, continual} model development
☆215Updated 11 months ago
keerthanpg / SwePT
AI sends pull requests for features you request in natural language
☆112Updated 2 years ago
srush / GPTWorld
A puzzle to learn about prompting
☆135Updated 2 years ago
AblateIt / finetune-study
Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.
☆82Updated 2 years ago
sytelus / pcprep
Various handy scripts to quickly setup new Linux and Windows sandboxes, containers and WSL.
☆40Updated this week
teknium1 / transformers-gptq-quant
☆46Updated 2 years ago
Cerebras / gigaGPT
a small code base for training large models
☆309Updated 5 months ago
keerthanpg / TalkToCode
☆166Updated 2 years ago
mobarski / alpaca-libre
Reimplementation of the task generation part from the Alpaca paper
☆118Updated 2 years ago
xjdr-alt / simple_transformer
Simple Transformer in Jax
☆139Updated last year
NolanoOrg / smol-gpt
Smol but mighty language model
☆62Updated 2 years ago
CarperAI / cheese
Used for adaptive human in the loop evaluation of language and embedding models.
☆307Updated 2 years ago
JD-P / minihf
MiniHF is an inference, human preference data collection, and fine-tuning tool for local language models. It is intended to help the user…
☆181Updated last week
joey00072 / Tinytorch
A really tiny autograd engine
☆95Updated 4 months ago
neoneye / ARC-Interactive-History-Dataset
The history files when recording human interaction while solving ARC tasks
☆117Updated last week