somewheresystems / llama2mlxLinks
Karpathy's llama2.c transpiled to MLX for Apple Silicon
☆14Updated last year
Alternatives and similar repositories for llama2mlx
Users that are interested in llama2mlx are comparing it to the libraries listed below
Sorting:
- Chat Markup Language conversation library☆55Updated last year
- Project code for training LLMs to write better unit tests + code☆21Updated 5 months ago
- ☆46Updated 2 years ago
- QLoRA for Masked Language Modeling☆22Updated 2 years ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆58Updated last week
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models☆22Updated 11 months ago
- look how they massacred my boy☆63Updated last year
- ☆40Updated last year
- a pipeline for using api calls to agnostically convert unstructured data into structured training data☆31Updated last year
- Simplex Random Feature attention, in PyTorch☆73Updated 2 years ago
- ☆55Updated 11 months ago
- Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).☆79Updated last year
- Zeus LLM Trainer is a rewrite of Stanford Alpaca aiming to be the trainer for all Large Language Models☆69Updated 2 years ago
- Simple GRPO scripts and configurations.☆59Updated 8 months ago
- Using multiple LLMs for ensemble Forecasting☆16Updated last year
- utilities for loading and running text embeddings with onnx☆44Updated 2 months ago
- Just a bunch of benchmark logs for different LLMs☆118Updated last year
- smolLM with Entropix sampler on pytorch☆150Updated last year
- various experiments for scaling inference time compute with small reasoning models☆17Updated 9 months ago
- A strongly typed Python DSL for developing message passing multi agent systems☆53Updated last year
- Track the progress of LLM context utilisation☆54Updated 6 months ago
- Verbosity control for AI agents☆65Updated last year
- Cerule - A Tiny Mighty Vision Model☆67Updated last year
- ☆67Updated last year
- mlx implementations of various transformers, speedups, training☆33Updated last year
- Port of Andrej Karpathy's nanoGPT to Apple MLX framework.☆113Updated last year
- ☆63Updated last year
- QLoRA with Enhanced Multi GPU Support☆37Updated 2 years ago
- ☆50Updated 8 months ago
- Latent Large Language Models☆19Updated last year