smpanaro / ModernBERT-AppleNeuralEngine
ModernBERT model optimized for Apple Neural Engine.
β24Updated 3 months ago
Alternatives and similar repositories for ModernBERT-AppleNeuralEngine:
Users that are interested in ModernBERT-AppleNeuralEngine are comparing it to the libraries listed below
- Find out why your CoreML model isn't running on the Neural Engine!β25Updated 10 months ago
- Profile your CoreML models directly from Python πβ27Updated 6 months ago
- β17Updated 3 weeks ago
- MLX support for the Open Neural Network Exchange (ONNX)β48Updated last year
- MLX-Embeddings is the best package for running Vision and Language Embedding models locally on your Mac using MLX.β137Updated this week
- run embeddings in MLXβ86Updated 6 months ago
- β14Updated 4 months ago
- look how they massacred my boyβ63Updated 6 months ago
- β89Updated 3 weeks ago
- MLX Swift implementation of Andrej Karpathy's Let's build GPT videoβ57Updated last year
- Tool for exporting Apple Neural Engine-accelerated versions of transformers models on HuggingFace Hub.β13Updated last year
- A fast minimalistic implementation of guided generation on Apple Silicon using Outlines and MLXβ53Updated last year
- β26Updated 4 months ago
- NanoGPT-speedrunning for the poor T4 enjoyersβ62Updated this week
- MLX port for xjdr's entropix sampler (mimics jax implementation)β64Updated 5 months ago
- β208Updated 3 months ago
- Inference Llama 2 in one file of zero-dependency, zero-unsafe Rustβ38Updated last year
- CLI to demonstrate running a large language model (LLM) on Apple Neural Engine.β96Updated 3 months ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optunaβ39Updated 2 months ago
- Fast, Modern, Memory Efficient, and Low Precision PyTorch Optimizersβ92Updated 9 months ago
- Fast parallel LLM inference for MLXβ184Updated 9 months ago
- A collection of optimizers for MLXβ35Updated last month
- A python package for serving LLM on OpenAI-compatible API endpoints with prompt caching using MLX.β77Updated 4 months ago
- β49Updated last year
- β48Updated last year
- mlx implementations of various transformers, speedups, trainingβ34Updated last year
- MLX Transformers is a library that provides model implementation in MLX. It uses a similar model interface as HuggingFace Transformers anβ¦β65Updated 5 months ago
- See the device (CPU/GPU/ANE) and estimated cost for every layer in your CoreML model.β22Updated 10 months ago
- a pipeline for using api calls to agnostically convert unstructured data into structured training dataβ30Updated 7 months ago
- C API for MLXβ106Updated this week