Blaizzy / Coding-LLMs-from-scratchLinks

☆32

Alternatives and similar repositories for Coding-LLMs-from-scratch

Users that are interested in Coding-LLMs-from-scratch are comparing it to the libraries listed below

Sorting:

evintunador / minGemma
a simplified version of Google's Gemma model to be used for learning
☆26Updated last year
QuixiAI / grokadamw
☆134Updated 11 months ago
vikhyat / mixtral-inference
inference code for mixtral-8x7b-32kseqlen
☆100Updated last year
keeeeenw / MicroLlama
Micro Llama is a small Llama based model with 300M parameters trained from scratch with $500 budget
☆153Updated this week
Locutusque / TPU-Alignment
Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free
☆232Updated 8 months ago
thomasgauthier / LoRD
Low-Rank adapter extraction for fine-tuned transformers models
☆174Updated last year
tensoic / Cerule
Cerule - A Tiny Mighty Vision Model
☆66Updated 10 months ago
teknium1 / ShareGPT-Builder
☆115Updated 7 months ago
nivibilla / build-nanogpt
Video+code lecture on building nanoGPT from scratch
☆69Updated last year
euclaise / SlimTrainer
Full finetuning of large language models without large memory requirements
☆94Updated last year
l4b4r4b4b4 / AIDocks
LLM-Training-API: Including Embeddings & ReRankers, mergekit, LaserRMT
☆27Updated last year
thooton / muse
Let's create synthetic textbooks together :)
☆75Updated last year
huggingface / discord-bots
☆50Updated last year
taprosoft / llm_finetuning
Convenient wrapper for fine-tuning and inference of Large Language Models (LLMs) with several quantization techniques (GTPQ, bitsandbytes…
☆146Updated last year
JoeLi12345 / nGPT
an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)
☆103Updated 4 months ago
NousResearch / Obsidian
Maybe the new state of the art vision model? we'll see 🤷‍♂️
☆166Updated last year
geronimi73 / phi2-finetune
☆87Updated last year
sanchit-gandhi / notebooks
A collection of notebooks for the Hugging Face blog series (https://huggingface.co/blog).
☆45Updated 11 months ago
interstellarninja / function-calling-eval
A framework for evaluating function calls made by LLMs
☆37Updated last year
QuixiAI / kraken
☆66Updated last year
TrelisResearch / install-guides
Various installation guides for Large Language Models
☆71Updated 3 months ago
TheBlokeAI / AIScripts
Some simple scripts that I use day-to-day when working with LLMs and Huggingface Hub
☆162Updated last year
geronimi73 / qlora-minimal
☆86Updated last year
rasbt / dora-from-scratch
LoRA and DoRA from Scratch Implementations
☆206Updated last year
mzbac / mlx_sharding
Distributed Inference for mlx LLm
☆94Updated 11 months ago
Gryphe / MergeMonster
An unsupervised model merging algorithm for Transformers-based language models.
☆106Updated last year
premAI-io / benchmarks
🕹️ Performance Comparison of MLOps Engines, Frameworks, and Languages on Mainstream AI Models.
☆137Updated last year
rmihaylov / mpttune
Tune MPTs
☆84Updated 2 years ago
N8python / mlx-pretrain
A simple MLX implementation for pretraining LLMs on Apple Silicon.
☆82Updated 2 months ago
vithursant / nanoGPT_mlx
Port of Andrej Karpathy's nanoGPT to Apple MLX framework.
☆110Updated last year