HyperMink / inferenceableLinks
Scalable AI Inference Server for CPU and GPU with Node.js | Utilizes llama.cpp and parts of llamafile C/C++ core under the hood.
☆14Updated last year
Alternatives and similar repositories for inferenceable
Users that are interested in inferenceable are comparing it to the libraries listed below
Sorting:
- Streamable multi-format serialization with schema☆22Updated 6 months ago
- Run Structured LLM Inference with Easy Parallelism☆16Updated 5 months ago
- convert natural language into technical diagrams☆14Updated 6 months ago
- AutoKitteh projects: full-fledged solutions, composable templates, and demos of capabilities and features☆37Updated last week
- LLM plugin for asking questions of LLM's own documentation, and related packages☆18Updated last month
- Gateway and load balancer to your LLM inference endpoints☆23Updated 7 months ago
- A versatile and powerful data platform allowing interactive searches, dashboards, alerts, and more.☆26Updated last week
- ☆12Updated 3 months ago
- A JSX-native peer-to-peer browser that runs on Node, with a custom renderer based on SDL (no DOM).☆11Updated last year
- Create embeddings for LLM using the Nomic API☆23Updated 7 months ago
- A lightweight tool that converts directory contents into structured output optimized for LLM interpretation, featuring Git-aware file ord…☆16Updated last week
- ☆11Updated 10 months ago
- This repository is designed for deploying and managing server processes that handle embeddings using the Infinity Embedding model or Larg…☆23Updated 3 months ago
- Ready to go EKS setup☆10Updated 9 months ago
- Make tool-calling schemas for existing tools☆14Updated 3 months ago
- Smart reproducible analytical pipeline inspection☆17Updated 2 months ago
- Define and implement any functions on the fly with LLMs☆11Updated last year
- ☆24Updated last year
- Search a JSON path and get the value fast☆22Updated 4 months ago
- 🔥 Helper program for setting up a Firecracker microVM on a fresh metal☆24Updated last year
- ☆10Updated last year
- 360M model running in the browser on WebGPU☆22Updated 10 months ago
- A tshark MCP server for packet capture and analysis☆17Updated 2 weeks ago
- Web form editor framework powered by ProseMirror.☆14Updated this week
- Example usages of the Scaffoldly toolchain.☆16Updated 6 months ago
- Master PDF Summarization with Google Bard☆12Updated last year
- Load generator for TCP servers.☆20Updated last year
- Dillusion is the dillo of the future☆9Updated 11 months ago
- Full-Stack Configuration Management for Developers and Sysadmins☆32Updated 2 weeks ago
- A simple github actions script to build a llamafile and uploads to huggingface☆14Updated last year