exo-explore / mlx-bitnetLinks
1.58 Bit LLM on Apple Silicon using MLX
☆214Updated last year
Alternatives and similar repositories for mlx-bitnet
Users that are interested in mlx-bitnet are comparing it to the libraries listed below
Sorting:
- Fast parallel LLM inference for MLX☆193Updated 11 months ago
- Benchmarks comparing PyTorch and MLX on Apple Silicon GPUs☆85Updated 11 months ago
- Train Large Language Models on MLX.☆94Updated this week
- Inference of Mamba models in pure C☆187Updated last year
- A simple MLX implementation for pretraining LLMs on Apple Silicon.☆80Updated last month
- Distributed Inference for mlx LLm☆93Updated 10 months ago
- FastMLX is a high performance production ready API to host MLX models.☆308Updated 3 months ago
- Train your own small bitnet model☆72Updated 8 months ago
- Phi-3.5 for Mac: Locally-run Vision and Language Models for Apple Silicon☆268Updated 9 months ago
- ☆157Updated 11 months ago
- LLM inference in C/C++☆77Updated this week
- Port of Andrej Karpathy's nanoGPT to Apple MLX framework.☆108Updated last year
- 1.58-bit LLaMa model☆81Updated last year
- ☆213Updated 5 months ago
- Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients.☆198Updated 11 months ago
- run embeddings in MLX☆90Updated 8 months ago
- Scripts to create your own moe models using mlx☆90Updated last year
- MLX port for xjdr's entropix sampler (mimics jax implementation)☆64Updated 7 months ago
- ☆132Updated 10 months ago
- ☆114Updated 6 months ago
- PTX-Tutorial Written Purely By AIs (Deep Research of Openai and Claude 3.7)☆67Updated 3 months ago
- An efficent implementation of the method proposed in "The Era of 1-bit LLMs"☆154Updated 8 months ago
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆101Updated 3 months ago
- Implementation of nougat that focuses on processing pdf locally.☆81Updated 5 months ago
- SiLLM simplifies the process of training and running Large Language Models (LLMs) on Apple Silicon by leveraging the MLX framework.☆273Updated last week
- MLX Transformers is a library that provides model implementation in MLX. It uses a similar model interface as HuggingFace Transformers an…☆66Updated 7 months ago
- A minimalistic C++ Jinja templating engine for LLM chat templates☆156Updated last month
- An implementation of bucketMul LLM inference☆217Updated 11 months ago
- ☆176Updated 3 months ago
- Start a server from the MLX library.☆187Updated 10 months ago