nath1295 / MLX-Textgen
A python package for serving LLM on OpenAI-compatible API endpoints with prompt caching using MLX.
☆52Updated 2 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for MLX-Textgen
- ☆38Updated 8 months ago
- Routing on Random Forest (RoRF)☆83Updated last month
- Scripts to create your own moe models using mlx☆86Updated 8 months ago
- Very basic framework for parameterized large language model (Q)LoRa fine-tuning using mlx, mlx_lm, and OgbujiPT. Architecture for system…☆34Updated this week
- ☆104Updated 7 months ago
- MLX-Embeddings is the best package for running Vision and Language Embedding models locally on your Mac using MLX.☆75Updated 3 weeks ago
- Distributed Inference for mlx LLm☆69Updated 3 months ago
- Fast parallel LLM inference for MLX