yacineMTB / just-large-modelsLinks

Just large language models. Hackable, with as little abstraction as possible. Done for my own purposes, feel free to rip.

☆44

Alternatives and similar repositories for just-large-models

Users that are interested in just-large-models are comparing it to the libraries listed below

Sorting:

xjdr-alt / simple_transformer
Simple Transformer in Jax
☆139Updated last year
teknium1 / transformers-gptq-quant
☆45Updated 2 years ago
AblateIt / finetune-study
Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.
☆83Updated 2 years ago
xjdr-alt / llmri
look how they massacred my boy
☆63Updated last year
NousResearch / StripedHyenaTrainer
☆62Updated last year
Pleias / Quest-Best-Tokens
An introduction to LLM Sampling
☆79Updated 11 months ago
euclaise / SlimTrainer
Full finetuning of large language models without large memory requirements
☆94Updated 2 months ago
JD-P / RetroInstruct
Synthetic data derived by templating, few shot prompting, transformations on public domain corpora, and monte carlo tree search.
☆32Updated last month
SinatrasC / entropix-smollm
smolLM with Entropix sampler on pytorch
☆149Updated last year
vikhyat / mixtral-inference
inference code for mixtral-8x7b-32kseqlen
☆103Updated last year
notarussianteenager / srf-attention
Simplex Random Feature attention, in PyTorch
☆75Updated 2 years ago
Nearcyan / papers.day
papers.day
☆91Updated last year
joey00072 / Tinytorch
A really tiny autograd engine
☆96Updated 6 months ago
MF-FOOM / wikivec2text
Simple embedding -> text model trained on a small subset of Wikipedia sentences.
☆158Updated 2 years ago
ishan0102 / rsrch.space
Stream of my favorite papers and links
☆44Updated last week
sdan / selfextend
an implementation of Self-Extend, to expand the context window via grouped attention
☆119Updated last year
okarthikb / state-space-models
☆28Updated last year
teknium1 / LLM-Benchmark-Logs
Just a bunch of benchmark logs for different LLMs
☆119Updated last year
JD-P / minihf
MiniHF is an inference, human preference data collection, and fine-tuning tool for local language models. It is intended to help the user…
☆182Updated last month
tensoic / Cerule
Cerule - A Tiny Mighty Vision Model
☆68Updated 3 weeks ago
hitorilabs / navi
compute, storage, and networking infra at home
☆65Updated last year
xjdr-alt / muzero_sketch
☆40Updated last year
abacaj / train-with-fsdp
☆94Updated 2 years ago
CarperAI / treasure_trove
☆22Updated 2 years ago
euclaise / supertrainer2000
☆50Updated last year
thesephist / spectre
Sparse autoencoders for Contra text embedding models
☆25Updated last year
JoeLi12345 / nGPT
an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)
☆108Updated 8 months ago
knowrohit / know_medical_dialogues
KMD is a collection of conversational exchanges between patients and doctors on various medical topics. It aims to capture the intricaci…
☆24Updated 2 years ago
teknium1 / RawTransform
A repository of prompts and Python scripts for intelligent transformation of raw text into diverse formats.
☆30Updated 2 years ago
SpellcraftAI / turing
Turing machines, Rule 110, and A::B reversal using Claude 3 Opus.
☆58Updated last year