mzbac / mlx-moe
Scripts to create your own moe models using mlx
☆89Updated last year
Alternatives and similar repositories for mlx-moe:
Users that are interested in mlx-moe are comparing it to the libraries listed below
- ☆112Updated 4 months ago
- Very basic framework for composable parameterized large language model (Q)LoRA / (Q)Dora fine-tuning using mlx, mlx_lm, and OgbujiPT.☆40Updated 2 months ago
- ☆66Updated 11 months ago
- Fast approximate inference on a single GPU with sparsity aware offloading☆38Updated last year
- Fast parallel LLM inference for MLX☆186Updated 10 months ago
- Generate Synthetic Data Using OpenAI, MistralAI or AnthropicAI☆223Updated last year
- ☆28Updated last year
- an implementation of Self-Extend, to expand the context window via grouped attention☆119Updated last year
- tiny_fnc_engine is a minimal python library that provides a flexible engine for calling functions extracted from a LLM.☆38Updated 7 months ago
- All the world is a play, we are but actors in it.☆49Updated this week
- ☆38Updated last year
- For inferring and serving local LLMs using the MLX framework☆103Updated last year
- Just a bunch of benchmark logs for different LLMs☆119Updated 9 months ago
- ☆154Updated 9 months ago
- ☆73Updated last year
- Port of Suno's Bark TTS transformer in Apple's MLX Framework☆80Updated last year
- GPT-4 Level Conversational QA Trained In a Few Hours☆60Updated 8 months ago
- The one who calls upon functions - Function-Calling Language Model☆36Updated last year
- Full finetuning of large language models without large memory requirements☆94Updated last year
- Open-source AI for voice control, rivaling Alexa and Siri☆12Updated last year
- Chat Markup Language conversation library☆55Updated last year
- Gradio based tool to run opensource LLM models directly from Huggingface☆91Updated 10 months ago
- Implementation of nougat that focuses on processing pdf locally.☆81Updated 3 months ago
- auto fine tune of models with synthetic data☆75Updated last year
- Video+code lecture on building nanoGPT from scratch☆66Updated 10 months ago
- inference code for mixtral-8x7b-32kseqlen☆99Updated last year
- ☆85Updated 7 months ago
- Let's create synthetic textbooks together :)☆74Updated last year
- Minimal, clean code implementation of RAG with mlx using gguf model weights☆49Updated last year
- huggingface chat-ui integration with mlx-lm server☆60Updated last year