pranavjad / mlx-gpt2Links
gpt-2 from scratch in mlx
☆389Updated last year
Alternatives and similar repositories for mlx-gpt2
Users that are interested in mlx-gpt2 are comparing it to the libraries listed below
Sorting:
- FastMLX is a high performance production ready API to host MLX models.☆308Updated 3 months ago
- ☆411Updated 10 months ago
- Fast parallel LLM inference for MLX☆193Updated 11 months ago
- An mlx project to train a base model on your whatsapp chats using (Q)Lora finetuning☆168Updated last year
- A reinforcement learning framework based on MLX.☆233Updated 4 months ago
- port of Andrjey Karpathy's llm.c to Mojo☆352Updated 6 months ago
- SiLLM simplifies the process of training and running Large Language Models (LLMs) on Apple Silicon by leveraging the MLX framework.☆273Updated last week
- Explore a simple example of utilizing MLX for RAG application running locally on your Apple Silicon device.☆171Updated last year
- Start a server from the MLX library.☆187Updated 10 months ago
- Following master Karpathy with GPT-2 implementation and training, writing lots of comments cause I have memory of a goldfish☆173Updated 10 months ago
- ☆176Updated 3 months ago
- Phi-3.5 for Mac: Locally-run Vision and Language Models for Apple Silicon☆268Updated 9 months ago
- Large Language Models (LLMs) applications and tools running on Apple Silicon in real-time with Apple MLX.☆446Updated 4 months ago
- LLM Analytics☆668Updated 8 months ago
- "Deep Dive into AI with MLX and PyTorch" is an educational initiative designed to help anyone interested in AI, specifically in machine l…☆476Updated last month
- a small code base for training large models☆301Updated last month
- Official inference library for pre-processing of Mistral models☆747Updated this week
- run embeddings in MLX☆90Updated 8 months ago
- The n-gram Language Model☆1,428Updated 10 months ago
- The Multilayer Perceptron Language Model☆554Updated 10 months ago
- small auto-grad engine inspired from Karpathy's micrograd and PyTorch☆271Updated 7 months ago
- Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free☆231Updated 7 months ago
- a LLM cookbook, for building your own from scratch, all the way from gathering data to training a model☆145Updated last year
- ☆152Updated 6 months ago
- An extremely fast implementation of whisper optimized for Apple Silicon using MLX.☆730Updated last year
- The Tensor (or Array)☆436Updated 10 months ago
- ☆165Updated last year
- Port of Andrej Karpathy's nanoGPT to Apple MLX framework.☆108Updated last year
- A simple UI / Web / Frontend for MLX mlx-lm using Streamlit.☆256Updated last week
- Fast bare-bones BPE for modern tokenizer training☆159Updated 2 months ago