smpanaro / ModernBERT-AppleNeuralEngine
ModernBERT model optimized for Apple Neural Engine.
☆23Updated 3 weeks ago
Alternatives and similar repositories for ModernBERT-AppleNeuralEngine:
Users that are interested in ModernBERT-AppleNeuralEngine are comparing it to the libraries listed below
- Find out why your CoreML model isn't running on the Neural Engine!☆24Updated 7 months ago
- CLI to demonstrate running a large language model (LLM) on Apple Neural Engine.☆78Updated last month
- ☆14Updated last month
- Profile your CoreML models directly from Python 🐍☆26Updated 3 months ago
- MLX-Embeddings is the best package for running Vision and Language Embedding models locally on your Mac using MLX.☆93Updated 3 months ago
- C API for MLX☆91Updated this week
- mlx image models for Apple Silicon machines☆70Updated 2 months ago
- MLX support for the Open Neural Network Exchange (ONNX)☆43Updated 11 months ago
- look how they massacred my boy☆63Updated 3 months ago
- MLX Swift implementation of Andrej Karpathy's Let's build GPT video☆56Updated 9 months ago
- See the device (CPU/GPU/ANE) and estimated cost for every layer in your CoreML model.☆21Updated 8 months ago
- Tool for exporting Apple Neural Engine-accelerated versions of transformers models on HuggingFace Hub.☆12Updated last year
- ☆21Updated 7 months ago
- run embeddings in MLX☆81Updated 4 months ago
- Google TPU optimizations for transformers models☆90Updated last week
- The Batched API provides a flexible and efficient way to process multiple requests in a batch, with a primary focus on dynamic batching o…☆119Updated last month
- PageRank for LLMs☆35Updated this week
- Fast parallel LLM inference for MLX☆153Updated 6 months ago
- MLX implementation of xLSTM model by Beck et al. (2024)☆26Updated 7 months ago
- Port of Andrej Karpathy's nanoGPT to Apple MLX framework.☆104Updated 11 months ago
- Experiments for efforts to train a new and improved t5☆77Updated 9 months ago
- Training code for Sparse Autoencoders on Embedding models☆35Updated 2 months ago
- MLX Transformers is a library that provides model implementation in MLX. It uses a similar model interface as HuggingFace Transformers an…☆58Updated 2 months ago
- ☆48Updated last year
- Structured generation in Rust☆173Updated this week
- A collection of optimizers for MLX☆29Updated this week
- ☆192Updated last week
- An introduction to LLM Sampling☆75Updated last month
- Chat Markup Language conversation library☆55Updated last year
- Super-fast Structured Outputs☆79Updated this week