simonw / llm-mlc
LLM plugin for running models using MLC
☆186Updated last year
Alternatives and similar repositories for llm-mlc
Users that are interested in llm-mlc are comparing it to the libraries listed below
Sorting:
- LLM plugin for running models using llama.cpp☆143Updated last year
- Array-Inspired Pipeline Language☆119Updated last year
- LLM plugin providing access to Mistral models using the Mistral API☆177Updated this week
- hnsqlite integrates hnswlib and sqlite for simple text embedding search☆159Updated last year
- Support for MLX models in LLM☆153Updated 2 weeks ago
- Plugin for LLM adding support for the GPT4All collection of models☆250Updated last year
- Save OpenAI API results to a SQLite database☆232Updated last year
- Count and truncate text based on tokens☆346Updated last year
- For inferring and serving local LLMs using the MLX framework☆103Updated last year
- LLM plugin for interacting with the Claude 3 family of models☆292Updated 3 months ago
- Wraps openai.ChatCompletion to produce pydantic model output via schema prompt and error feedback.☆55Updated last year
- Demos utilizing the ChatGPT API☆95Updated 2 years ago
- Experimental fork of Facebooks LLaMa model which runs it with GPU acceleration on Apple Silicon M1/M2☆86Updated last year
- ☆135Updated last year
- Embedding models from Jina AI☆59Updated last year
- CLI tool for running text through OpenAI Text to speech☆165Updated last year
- Structured LLM APIs☆156Updated last year
- Simple embedding -> text model trained on a small subset of Wikipedia sentences.☆153Updated last year
- LLM plugin providing access to models running on an Ollama server☆287Updated last week
- ☆113Updated 11 months ago
- automatic sentence highlights based on their significance to the document☆191Updated last year
- Enforce structured output from LLMs 100% of the time☆249Updated 9 months ago
- GPT-3 on your command line☆132Updated last year
- Finetune llama2-70b and codellama on MacBook Air without quantization☆448Updated last year
- LLM plugin for models hosted on Replicate☆62Updated last year
- This project collects GPU benchmarks from various cloud providers and compares them to fixed per token costs. Use our tool for efficient …☆221Updated 4 months ago
- GPT-based Conversation Summarizer☆148Updated 2 years ago
- Claudette is Claude's friend☆237Updated this week
- GenAI & agent toolkit for Apple Silicon Mac, implementing JSON schema-steered structured output (3SO) and tool-calling in Python. For mor…☆123Updated 2 months ago
- A fast minimalistic implementation of guided generation on Apple Silicon using Outlines and MLX☆53Updated last year