smpanaro / ModernBERT-AppleNeuralEngineLinks
ModernBERT model optimized for Apple Neural Engine.
☆28Updated 9 months ago
Alternatives and similar repositories for ModernBERT-AppleNeuralEngine
Users that are interested in ModernBERT-AppleNeuralEngine are comparing it to the libraries listed below
Sorting:
- MLX-Embeddings is the best package for running Vision and Language Embedding models locally on your Mac using MLX.☆218Updated this week
 - Find out why your CoreML model isn't running on the Neural Engine!☆27Updated last year
 - run embeddings in MLX☆94Updated last year
 - A simple MLX implementation for pretraining LLMs on Apple Silicon.☆84Updated 2 months ago
 - Fast parallel LLM inference for MLX☆224Updated last year
 - ☆20Updated 2 months ago
 - ☆218Updated 9 months ago
 - mlx implementations of various transformers, speedups, training☆33Updated last year
 - Start a server from the MLX library.☆192Updated last year
 - MLX Transformers is a library that provides model implementation in MLX. It uses a similar model interface as HuggingFace Transformers an…☆67Updated 11 months ago
 - Profile your CoreML models directly from Python 🐍☆29Updated last month
 - FlashAttention (Metal Port)☆546Updated last year
 - MLX implementation of xLSTM model by Beck et al. (2024)☆29Updated last year
 - ☆122Updated 4 months ago
 - CLI to demonstrate running a large language model (LLM) on Apple Neural Engine.☆117Updated 10 months ago
 - C API for MLX☆145Updated last month
 - ☆25Updated 10 months ago
 - 1.58 Bit LLM on Apple Silicon using MLX☆225Updated last year
 - For inferring and serving local LLMs using the MLX framework☆109Updated last year
 - ☆14Updated 10 months ago
 - ☆46Updated 2 years ago
 - The Batched API provides a flexible and efficient way to process multiple requests in a batch, with a primary focus on dynamic batching o…☆151Updated 3 months ago
 - TensorRT-LLM server with Structured Outputs (JSON) built with Rust☆59Updated 6 months ago
 - Google TPU optimizations for transformers models☆121Updated 9 months ago
 - MLX port for xjdr's entropix sampler (mimics jax implementation)☆62Updated 11 months ago
 - Large Language Models (LLMs) applications and tools running on Apple Silicon in real-time with Apple MLX.☆457Updated 9 months ago
 - look how they massacred my boy☆63Updated last year
 - A fast minimalistic implementation of guided generation on Apple Silicon using Outlines and MLX☆57Updated last year
 - smolLM with Entropix sampler on pytorch☆150Updated last year
 - A collection of optimizers for MLX☆53Updated last week