smpanaro / ModernBERT-AppleNeuralEngineLinks
ModernBERT model optimized for Apple Neural Engine.
☆29Updated last year
Alternatives and similar repositories for ModernBERT-AppleNeuralEngine
Users that are interested in ModernBERT-AppleNeuralEngine are comparing it to the libraries listed below
Sorting:
- run embeddings in MLX☆97Updated last year
- MLX-Embeddings is the best package for running Vision and Language Embedding models locally on your Mac using MLX.☆263Updated 2 weeks ago
- A simple MLX implementation for pretraining LLMs on Apple Silicon.☆85Updated 5 months ago
- Find out why your CoreML model isn't running on the Neural Engine!☆30Updated last year
- Start a server from the MLX library.☆196Updated last year
- Supporting code for "LLMs for your iPhone: Whole-Tensor 4 Bit Quantization"☆11Updated last year
- Fast parallel LLM inference for MLX☆245Updated last year
- ☆219Updated last year
- MLX Transformers is a library that provides model implementation in MLX. It uses a similar model interface as HuggingFace Transformers an…☆71Updated last year
- MLX support for the Open Neural Network Exchange (ONNX)☆63Updated last year
- Profile your CoreML models directly from Python 🐍☆30Updated 4 months ago
- FlashAttention (Metal Port)☆577Updated last year
- Benchmark of Apple MLX operations on all Apple Silicon chips (GPU, CPU) + MPS and CUDA.☆214Updated 3 weeks ago
- mlx image models for Apple Silicon machines☆91Updated 2 months ago
- Implementation of nougat that focuses on processing pdf locally.☆84Updated last year
- A python package for serving LLM on OpenAI-compatible API endpoints with prompt caching using MLX.☆100Updated 7 months ago
- ☆27Updated last year
- For inferring and serving local LLMs using the MLX framework☆110Updated last year
- C API for MLX☆170Updated 3 weeks ago
- CLI to demonstrate running a large language model (LLM) on Apple Neural Engine.☆119Updated last year
- 1.58 Bit LLM on Apple Silicon using MLX☆242Updated last year
- A collection of optimizers for MLX☆54Updated last month
- ☆15Updated last year
- mlx implementations of various transformers, speedups, training☆33Updated 2 years ago
- Google TPU optimizations for transformers models☆135Updated last week
- Port of Andrej Karpathy's nanoGPT to Apple MLX framework.☆117Updated last year
- Benchmarks comparing PyTorch and MLX on Apple Silicon GPUs☆93Updated last week
- ☆68Updated last year
- A fast minimalistic implementation of guided generation on Apple Silicon using Outlines and MLX☆59Updated last year
- MLX Swift implementation of Andrej Karpathy's Let's build GPT video☆63Updated last year