exo-explore / mlx-bitnet
1.58 Bit LLM on Apple Silicon using MLX
☆146Updated 6 months ago
Related projects ⓘ
Alternatives and complementary repositories for mlx-bitnet
- Fast parallel LLM inference for MLX☆149Updated 4 months ago
- SiLLM simplifies the process of training and running Large Language Models (LLMs) on Apple Silicon by leveraging the MLX framework.☆226Updated this week
- Distributed Inference for mlx LLm☆70Updated 3 months ago
- Phi-3.5 for Mac: Locally-run Vision and Language Models for Apple Silicon☆237Updated 2 months ago
- ☆149Updated 4 months ago
- FastMLX is a high performance production ready API to host MLX models.☆218Updated 3 weeks ago
- Scripts to create your own moe models using mlx☆86Updated 8 months ago
- run embeddings in MLX☆73Updated last month
- 1.58-bit LLaMa model☆79Updated 7 months ago
- ☆118Updated 3 months ago
- A python package for serving LLM on OpenAI-compatible API endpoints with prompt caching using MLX.☆55Updated last week
- prime (previously called ZeroBand) is a framework for efficient, globally distributed training of AI models over the internet.☆212Updated this week
- ☆104Updated 8 months ago
- Video+code lecture on building nanoGPT from scratch☆64Updated 5 months ago
- inference code for mixtral-8x7b-32kseqlen☆98Updated 11 months ago
- Explore a simple example of utilizing MLX for RAG application running locally on your Apple Silicon device.☆145Updated 9 months ago
- MLX-Embeddings is the best package for running Vision and Language Embedding models locally on your Mac using MLX.☆77Updated last month
- This is our own implementation of 'Layer Selective Rank Reduction'☆232Updated 5 months ago
- Solving data for LLMs - Create quality synthetic datasets!☆137Updated last month
- ☆94Updated 2 months ago
- look how they massacred my boy☆58Updated last month
- MLX Swift implementation of Andrej Karpathy's Let's build GPT video☆54Updated 7 months ago
- PyTorch implementation of models from the Zamba2 series.☆158Updated this week
- Implementation of nougat that focuses on processing pdf locally.☆73Updated 6 months ago
- Start a server from the MLX library.☆161Updated 3 months ago
- ☆99Updated 3 months ago
- Local ML voice chat using high-end models.☆145Updated this week
- Train your own small bitnet model☆56Updated last month
- For inferring and serving local LLMs using the MLX framework☆89Updated 7 months ago
- ☆64Updated 5 months ago