smpanaro / ModernBERT-AppleNeuralEngineLinks
ModernBERT model optimized for Apple Neural Engine.
☆27Updated 9 months ago
Alternatives and similar repositories for ModernBERT-AppleNeuralEngine
Users that are interested in ModernBERT-AppleNeuralEngine are comparing it to the libraries listed below
Sorting:
- Find out why your CoreML model isn't running on the Neural Engine!☆26Updated last year
- MLX-Embeddings is the best package for running Vision and Language Embedding models locally on your Mac using MLX.☆210Updated last month
- run embeddings in MLX☆93Updated last year
- A simple MLX implementation for pretraining LLMs on Apple Silicon.☆83Updated last month
- ☆120Updated 3 months ago
- MLX Transformers is a library that provides model implementation in MLX. It uses a similar model interface as HuggingFace Transformers an…☆66Updated 10 months ago
- ☆20Updated last month
- Profile your CoreML models directly from Python 🐍☆28Updated last month
- mlx image models for Apple Silicon machines☆85Updated 5 months ago
- A fast minimalistic implementation of guided generation on Apple Silicon using Outlines and MLX☆57Updated last year
- Port of Andrej Karpathy's nanoGPT to Apple MLX framework.☆112Updated last year
- MLX support for the Open Neural Network Exchange (ONNX)☆59Updated last year
- ☆25Updated 9 months ago
- MLX Swift implementation of Andrej Karpathy's Let's build GPT video☆59Updated last year
- ☆218Updated 8 months ago
- A miniature version of Modal☆20Updated last year
- Fast parallel LLM inference for MLX☆220Updated last year
- FlashAttention (Metal Port)☆538Updated last year
- C API for MLX☆137Updated 2 weeks ago
- look how they massacred my boy☆63Updated 11 months ago
- The Batched API provides a flexible and efficient way to process multiple requests in a batch, with a primary focus on dynamic batching o…☆149Updated 2 months ago
- ☆14Updated 10 months ago
- MLX implementation of xLSTM model by Beck et al. (2024)☆28Updated last year
- Implementation of ModernBERT in MLX☆19Updated last month
- 1.58 Bit LLM on Apple Silicon using MLX☆223Updated last year
- For inferring and serving local LLMs using the MLX framework☆109Updated last year
- A collection of optimizers for MLX☆53Updated last week
- Google TPU optimizations for transformers models☆120Updated 8 months ago
- Python bindings for ggml☆146Updated last year
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆55Updated 8 months ago