smpanaro / ModernBERT-AppleNeuralEngineLinks
ModernBERT model optimized for Apple Neural Engine.
☆29Updated 11 months ago
Alternatives and similar repositories for ModernBERT-AppleNeuralEngine
Users that are interested in ModernBERT-AppleNeuralEngine are comparing it to the libraries listed below
Sorting:
- A simple MLX implementation for pretraining LLMs on Apple Silicon.☆84Updated 4 months ago
- MLX-Embeddings is the best package for running Vision and Language Embedding models locally on your Mac using MLX.☆236Updated last month
- Fast parallel LLM inference for MLX☆235Updated last year
- run embeddings in MLX☆96Updated last year
- Find out why your CoreML model isn't running on the Neural Engine!☆28Updated last year
- Supporting code for "LLMs for your iPhone: Whole-Tensor 4 Bit Quantization"☆11Updated last year
- MLX implementation of xLSTM model by Beck et al. (2024)☆29Updated last year
- MLX Transformers is a library that provides model implementation in MLX. It uses a similar model interface as HuggingFace Transformers an…☆67Updated last year
- MoE training for Me and You and maybe other people☆239Updated this week
- mlx image models for Apple Silicon machines☆88Updated 2 weeks ago
- A fast minimalistic implementation of guided generation on Apple Silicon using Outlines and MLX☆59Updated last year
- smol models are fun too☆92Updated last year
- FlashAttention (Metal Port)☆567Updated last year
- Start a server from the MLX library.☆195Updated last year
- C API for MLX☆155Updated this week
- look how they massacred my boy☆63Updated last year
- ☆219Updated 10 months ago
- smolLM with Entropix sampler on pytorch☆149Updated last year
- NanoGPT-speedrunning for the poor T4 enjoyers☆73Updated 7 months ago
- For inferring and serving local LLMs using the MLX framework☆109Updated last year
- 1.58 Bit LLM on Apple Silicon using MLX☆227Updated last year
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆150Updated 10 months ago
- Benchmark of Apple MLX operations on all Apple Silicon chips (GPU, CPU) + MPS and CUDA.☆206Updated 6 months ago
- MLX support for the Open Neural Network Exchange (ONNX)☆63Updated last year
- A collection of optimizers for MLX☆54Updated last week
- A python package for serving LLM on OpenAI-compatible API endpoints with prompt caching using MLX.☆99Updated 5 months ago
- Port of Andrej Karpathy's nanoGPT to Apple MLX framework.☆116Updated last year
- mlx implementations of various transformers, speedups, training☆33Updated 2 years ago
- The Batched API provides a flexible and efficient way to process multiple requests in a batch, with a primary focus on dynamic batching o…☆154Updated 5 months ago
- ☆20Updated 3 months ago