vithursant / nanoGPT_mlxLinks

Port of Andrej Karpathy's nanoGPT to Apple MLX framework.

☆116

Alternatives and similar repositories for nanoGPT_mlx

Users that are interested in nanoGPT_mlx are comparing it to the libraries listed below

Sorting:

N8python / mlx-pretrain
A simple MLX implementation for pretraining LLMs on Apple Silicon.
☆84Updated 3 months ago
andrew-silva / mlx-rlhf
An example implementation of RLHF (or, more accurately, RLAIF) built on MLX and HuggingFace.
☆33Updated last year
teknium1 / transformers-gptq-quant
☆45Updated 2 years ago
ToluClassics / mlx-transformers
MLX Transformers is a library that provides model implementation in MLX. It uses a similar model interface as HuggingFace Transformers an…
☆67Updated last year
geronimi73 / phi2-finetune
☆86Updated last year
teknium1 / LLM-Benchmark-Logs
Just a bunch of benchmark logs for different LLMs
☆119Updated last year
taylorai / mlx_embedding_models
run embeddings in MLX
☆96Updated last year
xjdr-alt / llmri
look how they massacred my boy
☆63Updated last year
minosvasilias / simple_grpo
Simple GRPO scripts and configurations.
☆59Updated 10 months ago
SinatrasC / entropix-smollm
smolLM with Entropix sampler on pytorch
☆149Updated last year
LucasSte / MLX-vs-Pytorch
Benchmarks comparing PyTorch and MLX on Apple Silicon GPUs
☆91Updated last year
mzbac / mlx-moe
Scripts to create your own moe models using mlx
☆90Updated last year
s-smits / grpo-optuna
Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna
☆59Updated last month
willccbb / mlx_parallm
Fast parallel LLM inference for MLX
☆234Updated last year
vikhyat / mixtral-inference
inference code for mixtral-8x7b-32kseqlen
☆103Updated last year
vegaluisjose / mlx-rag
Explore a simple example of utilizing MLX for RAG application running locally on your Apple Silicon device.
☆179Updated last year
Pleias / Quest-Best-Tokens
An introduction to LLM Sampling
☆79Updated 11 months ago
JoeLi12345 / nGPT
an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)
☆108Updated 8 months ago
OpenPipe / deductive-reasoning
Train your own SOTA deductive reasoning model
☆107Updated 8 months ago
brendanhogan / picoDeepResearch
☆68Updated 6 months ago
stockeh / mlx-optimizers
A collection of optimizers for MLX
☆54Updated 2 weeks ago
sdan / selfextend
an implementation of Self-Extend, to expand the context window via grouped attention
☆119Updated last year
euclaise / SlimTrainer
Full finetuning of large language models without large memory requirements
☆94Updated 2 months ago
AblateIt / finetune-study
Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.
☆83Updated 2 years ago
smolorg / smoltropix
MLX port for xjdr's entropix sampler (mimics jax implementation)
☆62Updated last year
doomslide / hyperobject
Plotting (entropy, varentropy) for small LMs
☆99Updated 6 months ago
teknium1 / ShareGPT-Builder
☆117Updated 11 months ago
ai8hyf / OpenResearchAssistant
An automated tool for discovering insights from research papaer corpora
☆137Updated last year
Jaykef / mlx-rag-gguf
Minimal, clean code implementation of RAG with mlx using gguf model weights
☆53Updated last year
N8python / n8loom
A tree-based prefix cache library that allows rapid creation of looms: hierarchal branching pathways of LLM generations.
☆76Updated 9 months ago