itsmostafa / inference-speed-tests
Local LLM inference speed tests on various devices
☆65Updated last month
Alternatives and similar repositories for inference-speed-tests:
Users that are interested in inference-speed-tests are comparing it to the libraries listed below
- Local image and music generation for Apple Silicon☆43Updated last month
- A wannabe Ollama equivalent for Apple MlX models☆65Updated last month
- ☆169Updated this week
- Your gateway to both Ollama & Apple MlX models☆122Updated last month
- Optimized Ollama LLM server configuration for Mac Studio and other Apple Silicon Macs. Headless setup with automatic startup, resource op…☆166Updated last month
- Local LLM Powered Recursive Search & Smart Knowledge Explorer☆237Updated 2 months ago
- AI planner similar to OpenAI's deep research☆149Updated this week
- a Repository of Open-WebUI tools to use with your favourite LLMs☆206Updated last month
- Welcome!☆140Updated 4 months ago
- Serving LLMs in the HF-Transformers format via a PyFlask API☆71Updated 7 months ago
- Orpheus Chat WebUI☆52Updated 3 weeks ago
- Local Apple Notes + LLM Chat☆72Updated last month
- SiLLM simplifies the process of training and running Large Language Models (LLMs) on Apple Silicon by leveraging the MLX framework.☆262Updated 2 weeks ago
- Adaptive Modular Network (AMN) a potentially novel machine learning architecture capable of producing models which can learn at inference…☆52Updated last month
- ☆198Updated this week
- MLX Omni Server is a local inference server powered by Apple's MLX framework, specifically designed for Apple Silicon (M-series) chips. I…☆335Updated 2 weeks ago
- ☆93Updated this week
- Overide (pronounced over·ide) is a lightweight, yet powerful CLI tool that seamlessly integrates AI-powered code generation into your dev…☆174Updated last week
- Tool for scraping and consolidating documentation websites into a single MD file.☆159Updated last week
- FastMLX is a high performance production ready API to host MLX models.☆293Updated last month
- Notate is a desktop chat application that takes AI conversations to the next level. It combines the simplicity of chat with advanced feat…☆250Updated 2 months ago
- Benchmark that evaluates LLMs using 651 NYT Connections puzzles extended with extra trick words☆80Updated last week
- PocketFlow's node-based workflow structure, with Manus' agents and tools!☆192Updated this week
- The Fastest Way to Fine-Tune LLMs Locally☆292Updated last month
- Copy a bunch of files into your clipboard to provide context for LLMs☆106Updated 3 months ago
- Limopola is an AI platform that allows you to communicate with a wide range of AI models. It features autonomous agents, model-agnostic r…☆103Updated this week
- An implementation of the CSM(Conversation Speech Model) for Apple Silicon using MLX.☆318Updated this week
- A python package for serving LLM on OpenAI-compatible API endpoints with prompt caching using MLX.☆77Updated 4 months ago
- Finally, an open source Youtube Summarizer extension☆67Updated this week
- ☆149Updated 3 weeks ago