ModernBERT model optimized for Apple Neural Engine.
β31Jan 10, 2025Updated last year
Alternatives and similar repositories for ModernBERT-AppleNeuralEngine
Users that are interested in ModernBERT-AppleNeuralEngine are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Profile your CoreML models directly from Python πβ30Sep 8, 2025Updated 6 months ago
- Supporting code for "LLMs for your iPhone: Whole-Tensor 4 Bit Quantization"β11Mar 31, 2024Updated last year
- Convert StableHLO models into Apple Core ML formatβ22Updated this week
- CLI to demonstrate running a large language model (LLM) on Apple Neural Engine.β126Dec 27, 2024Updated last year
- Starbucks: Improved Training for 2D Matryoshka Embeddingsβ22Jun 30, 2025Updated 8 months ago
- DigitalOcean Gradient AI Platform β’ AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Run transformers (incl. LLMs) on the Apple Neural Engine.β62Nov 22, 2023Updated 2 years ago
- Tool for exporting Apple Neural Engine-accelerated versions of transformers models on HuggingFace Hub.β13Mar 16, 2026Updated last week
- A minimalistic Swift implementation of the Jinja templating engine, specifically designed for parsing and rendering ML chat templates.β122Feb 19, 2026Updated last month
- β12Jan 4, 2024Updated 2 years ago
- β19Dec 31, 2025Updated 2 months ago
- Swift package for reading and writing Safetensors files.β12Feb 6, 2026Updated last month
- See the device (CPU/GPU/ANE) and estimated cost for every layer in your CoreML model.β25Oct 23, 2025Updated 5 months ago
- EnriCo: Enriched Representation and Globally Constrained Inference for Entity and Relation Extractionβ26May 22, 2024Updated last year
- A RAG that can scale π§π»βπ»β11May 28, 2024Updated last year
- Simple, predictable pricing with DigitalOcean hosting β’ AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Token De/Serializer for testing De/Serialize implementationsβ14Dec 17, 2024Updated last year
- Trully flash implementation of DeBERTa disentangled attention mechanism.β81Feb 10, 2026Updated last month
- Fast search index for SPLADE sparse retrieval models implemented in Python using Numpy and Numbaβ37Oct 16, 2025Updated 5 months ago
- β35Apr 28, 2025Updated 10 months ago
- β18Nov 19, 2024Updated last year
- β12Jun 27, 2024Updated last year
- Rust crate for some audio utilitiesβ27Mar 8, 2025Updated last year
- A Swift wrapper for the Supertone text-to-speech modelβ34Dec 11, 2025Updated 3 months ago
- Ukrainian ELECTRA modelβ12Mar 11, 2023Updated 3 years ago
- Managed Database hosting by DigitalOcean β’ AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ANE accelerated embedding models!β20Dec 11, 2024Updated last year
- π€ HuggingFace Inference Toolkit for Google Cloud Vertex AI (similar to SageMaker's Inference Toolkit, but for Vertex AI and unofficial)β17Mar 20, 2024Updated 2 years ago
- β20Apr 23, 2025Updated 11 months ago
- Sparse Embedding Compression for Scalable Retrieval in Recommender Systemsβ35Nov 21, 2025Updated 4 months ago
- Semantically Search Emojis From the Command Line!β13Nov 26, 2023Updated 2 years ago
- A missing piece of the Python multitask (both threads and processes) API: An extension that supports stateful worker pools & size-aware iβ¦β29Mar 8, 2026Updated 2 weeks ago
- β91Feb 29, 2024Updated 2 years ago
- Swift Core ML Examplesβ258Nov 28, 2024Updated last year
- β16Apr 30, 2025Updated 10 months ago
- Proton VPN Special Offer - Get 70% off β’ AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- β13Nov 27, 2025Updated 3 months ago
- A Python module for retrieving script types of writing systems including alphabets, abjads, abugidas, syllabaries, logographs, featurals β¦β15Jul 19, 2024Updated last year
- Versatile framework designed to streamline the integration of your models, as well as those sourced from Hugging Face, into complex progrβ¦β35Aug 21, 2025Updated 7 months ago
- MLX Swift implementation of Andrej Karpathy's Let's build GPT videoβ63Apr 14, 2024Updated last year
- β15Dec 4, 2024Updated last year
- FFT based 2D cross-correlation on OSX/iOS using Accelerate frameworkβ10Jan 5, 2016Updated 10 years ago
- A proposed standard `NOCK` for a Parquet format that supports efficient distributed serialization of multiple kinds of graph technologiesβ21Oct 24, 2022Updated 3 years ago