pranavjad / mlx-gpt2Links
gpt-2 from scratch in mlx
☆397Updated last year
Alternatives and similar repositories for mlx-gpt2
Users that are interested in mlx-gpt2 are comparing it to the libraries listed below
Sorting:
- ☆413Updated last year
- Large Language Models (LLMs) applications and tools running on Apple Silicon in real-time with Apple MLX.☆455Updated 8 months ago
- Fast parallel LLM inference for MLX☆220Updated last year
- ☆447Updated last year
- Following master Karpathy with GPT-2 implementation and training, writing lots of comments cause I have memory of a goldfish☆172Updated last year
- An mlx project to train a base model on your whatsapp chats using (Q)Lora finetuning☆170Updated last year
- Explore a simple example of utilizing MLX for RAG application running locally on your Apple Silicon device.☆175Updated last year
- FastMLX is a high performance production ready API to host MLX models.☆331Updated 6 months ago
- A reinforcement learning framework based on MLX.☆240Updated 3 weeks ago
- A comprehensive repository of reasoning tasks for LLMs (and beyond)☆450Updated last year
- LLM Analytics☆686Updated 11 months ago
- Start a server from the MLX library.☆192Updated last year
- ☆96Updated last year
- A simple UI / Web / Frontend for MLX mlx-lm using Streamlit.☆261Updated 3 months ago
- port of Andrjey Karpathy's llm.c to Mojo☆357Updated 2 months ago
- Phi-3.5 for Mac: Locally-run Vision and Language Models for Apple Silicon☆272Updated last year
- SiLLM simplifies the process of training and running Large Language Models (LLMs) on Apple Silicon by leveraging the MLX framework.☆280Updated 3 months ago
- A simple MLX implementation for pretraining LLMs on Apple Silicon.☆83Updated last month
- (WIP) A small but powerful, homemade PyTorch from scratch.☆638Updated 3 weeks ago
- a small code base for training large models☆308Updated 5 months ago
- smol models are fun too☆93Updated 10 months ago
- The Multilayer Perceptron Language Model☆568Updated last year
- Visualize the intermediate output of Mistral 7B☆371Updated 8 months ago
- ☆159Updated 10 months ago
- Deep learning for dummies. All the practical details and useful utilities that go into working with real models.☆816Updated 2 months ago
- A comprehensive deep dive into the world of tokens☆226Updated last year
- A complete end-to-end pipeline for LLM interpretability with sparse autoencoders (SAEs) using Llama 3.2, written in pure PyTorch and full…☆625Updated 6 months ago
- [ICLR 2025] Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling☆915Updated 5 months ago
- Simple Transformer in Jax☆139Updated last year
- 1.58 Bit LLM on Apple Silicon using MLX☆223Updated last year