mzbac / mlx_sharding
Distributed Inference for mlx LLm
☆70Updated 3 months ago
Related projects ⓘ
Alternatives and complementary repositories for mlx_sharding
- ☆104Updated 8 months ago
- A python package for serving LLM on OpenAI-compatible API endpoints with prompt caching using MLX.☆55Updated last week
- ☆149Updated 4 months ago
- ☆38Updated 8 months ago
- MLX port for xjdr's entropix sampler (mimics jax implementation)☆55Updated 2 weeks ago
- Scripts to create your own moe models using mlx☆86Updated 8 months ago
- ☆64Updated 5 months ago
- ☆112Updated this week
- ☆118Updated 3 months ago
- Fast parallel LLM inference for MLX☆149Updated 4 months ago
- Large Model Proxy is designed to make it easy to run multiple resource-heavy Large Models (LM) on the same machine with limited amount of…☆47Updated last month
- Simple examples using Argilla tools to build AI☆40Updated this week
- Generate Synthetic Data Using OpenAI, MistralAI or AnthropicAI☆221Updated 6 months ago
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆113Updated 3 weeks ago
- 1.58 Bit LLM on Apple Silicon using MLX☆146Updated 6 months ago
- Official homepage for "Self-Harmonized Chain of Thought"☆83Updated 2 months ago
- A toolkit for building multimodal AI agents☆111Updated this week
- Routing on Random Forest (RoRF)☆84Updated last month
- ☆58Updated this week
- AnyModal is a Flexible Multimodal Language Model Framework☆40Updated this week
- prime (previously called ZeroBand) is a framework for efficient, globally distributed training of AI models over the internet.☆212Updated this week
- look how they massacred my boy☆58Updated last month
- Easily view and modify JSON datasets for large language models☆62Updated last month
- For inferring and serving local LLMs using the MLX framework☆89Updated 7 months ago
- Very basic framework for parameterized large language model (Q)LoRa fine-tuning using mlx, mlx_lm, and OgbujiPT. Architecture for system…☆35Updated last week
- Leveraging DSPy for AI-driven task understanding and solution generation, the Self-Discover Framework automates problem-solving through r…☆57Updated 4 months ago
- an implementation of Self-Extend, to expand the context window via grouped attention☆118Updated 10 months ago
- inference code for mixtral-8x7b-32kseqlen☆98Updated 11 months ago
- All the world is a play, we are but actors in it.☆47Updated 4 months ago
- Something similar to Apple Intelligence?☆57Updated 4 months ago