HyperMink / inferenceable
Scalable AI Inference Server for CPU and GPU with Node.js | Utilizes llama.cpp and parts of llamafile C/C++ core under the hood.
☆14Updated 11 months ago
Alternatives and similar repositories for inferenceable
Users that are interested in inferenceable are comparing it to the libraries listed below
Sorting:
- Run Structured LLM Inference with Easy Parallelism☆16Updated 3 months ago
- convert natural language into technical diagrams☆14Updated 5 months ago
- Streamable multi-format serialization with schema☆22Updated 5 months ago
- Search a JSON path and get the value fast☆22Updated 3 months ago
- ☆10Updated 11 months ago
- 360M model running in the browser on WebGPU☆21Updated 8 months ago
- 🔥 Helper program for setting up a Firecracker microVM on a fresh metal☆23Updated last year
- Smart reproducible analytical pipeline inspection☆17Updated 3 weeks ago
- Create embeddings for LLM using the Nomic API☆23Updated 5 months ago
- LLM plugin for asking questions of LLM's own documentation, and related packages☆16Updated last week
- A JSX-native peer-to-peer browser that runs on Node.☆11Updated last year
- A tiny distributable Node server for serving web pages written in Markdown☆11Updated last year
- A Python library for real-time PostgreSQL event-driven cache invalidation.☆22Updated 3 weeks ago
- A collection of tools that can be used for LLM function calling☆33Updated last year
- Example usages of the Scaffoldly toolchain.☆16Updated 4 months ago
- Automatically pass your funcions defined in Python to ChatGPT have it call them back seemlessly.☆13Updated last year
- Make tool-calling schemas for existing tools☆14Updated 2 months ago
- ☆12Updated 9 months ago
- Dillusion is the dillo of the future☆9Updated 10 months ago
- Secure, locally-run Retrieval-Augmented Generation system for document-based question-answering, utilizing Llama 3, Mistral, and Gemini m…☆23Updated 7 months ago
- Master PDF Summarization with Google Bard☆12Updated last year
- ☆12Updated 2 months ago
- Ready to go EKS setup☆10Updated 8 months ago
- Optimum graph creation and distribution for underground networks.☆34Updated 10 months ago
- Gateway and load balancer to your LLM inference endpoints☆22Updated 6 months ago
- A GUI-based AI development tool with integrated Metaphor support☆41Updated this week
- Pollux payload core files and examples☆3Updated this week
- HandyDash is a cross-platform HTTP, TCP, and IP monitoring tool, intended for desktop use. It is agent free, requires no installation, an…☆16Updated 9 months ago
- An Agentic platform that allows you to define extensions☆27Updated last month
- recipes for BASH, Docker and more☆13Updated 3 months ago