Run Llama 2 using MLX on macOS
☆34Dec 18, 2023Updated 2 years ago
Alternatives and similar repositories for llm-mlx-llama
Users that are interested in llm-mlx-llama are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- "llm python" is a command to run a Python interpreter in the LLM virtual environment☆37Oct 27, 2023Updated 2 years ago
- Ongoing research training transformer language models at scale, including: BERT & GPT-2☆17Mar 11, 2026Updated last month
- The Solana Memo program and its clients☆20Apr 23, 2026Updated last week
- MLX implementation of GCN, with benchmark on MPS, CUDA and CPU (M1 Pro, M2 Ultra, M3 Max).☆25Dec 16, 2023Updated 2 years ago
- A mcp server that uses the Osmosis-Apply-1.7B model to apply code merges☆53Jul 3, 2025Updated 10 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ANE accelerated embedding models!☆20Dec 11, 2024Updated last year
- API server for aolium.com☆18May 18, 2024Updated last year
- Examples for using the SiLLM framework for training and running Large Language Models (LLMs) on Apple Silicon☆16May 8, 2025Updated 11 months ago
- Plugin for LLM adding a Markov chain generating model☆21Jul 3, 2024Updated last year
- HTTP Proxy for DuckDB☆23Jun 4, 2024Updated last year
- Scripts to create your own moe models using mlx☆89Feb 26, 2024Updated 2 years ago
- A very basic webserver and REST key-value store☆22Sep 23, 2013Updated 12 years ago
- dwarfdump utility but in Zig☆30Mar 8, 2024Updated 2 years ago
- A Model Context Protocol server for Jira.☆27Jul 25, 2025Updated 9 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- alternative docs for ActivityStreams 2.0 vocabulary☆17Mar 23, 2024Updated 2 years ago
- Source code for the paper titled: "Unlocking the full potential of smart charging: Addressing paused and delayed charging problems in ele…☆11May 22, 2024Updated last year
- This repository shows how to implement reusable Codeblock components inside of a SvelteKit project, with syntax highlighting by Shiki. Th…☆13Jan 26, 2024Updated 2 years ago
- Embedding models from Jina AI☆66Jan 18, 2024Updated 2 years ago
- A simple example of VAEs with KANs☆12May 17, 2024Updated last year
- A browser library for turning GeoJson into a GPX XMLDocument☆12Mar 23, 2024Updated 2 years ago
- AgentParse is a high-performance parsing library designed to map various structured data formats (such as Pydantic models, JSON, YAML, an…☆18Oct 13, 2025Updated 6 months ago
- Integrates DuckDB with the high-performance Apache DataSketches library. This extension enables users to perform approximate analytics on…☆44Apr 27, 2026Updated last week
- ☆15May 17, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Repo that hosts the companion book of Julia for Deep Learning☆14Aug 23, 2023Updated 2 years ago
- A tiny server to run local inference on MLX model in the style of OpenAI☆13Jan 31, 2024Updated 2 years ago
- ☆11Aug 22, 2023Updated 2 years ago
- Deep-Learning-Based Flow Prediction for CO2 Storage in Shale–Sandstone Formations☆11Jan 27, 2023Updated 3 years ago
- A cookiecutter template for building plugins for LLM☆31Apr 10, 2026Updated 3 weeks ago
- An introduction to global assessment techniques using Python☆12Apr 24, 2023Updated 3 years ago
- Information about the CodedotAI reading group sessions.☆12Aug 16, 2021Updated 4 years ago
- ☆24Mar 30, 2026Updated last month
- Towards Finding the Essence of Everything in Large Language Models☆14Mar 29, 2026Updated last month
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Code for reproducing our paper "Low Rank Adapting Models for Sparse Autoencoder Features"☆17Mar 31, 2025Updated last year
- Contains the summaries and notes on a variety of DL papers/blogs☆12Jul 30, 2024Updated last year
- Deploy your iOS apps written with Zig!☆33Apr 19, 2022Updated 4 years ago
- ☆47Updated this week
- A bookshelf plugin which handles relationships.☆22Updated this week
- Coder Desktop application for Windows☆23Feb 24, 2026Updated 2 months ago
- VSCode Chat Extensions made easy☆17Aug 5, 2024Updated last year