HyperMink / inferenceable
Scalable AI Inference Server for CPU and GPU with Node.js | Utilizes llama.cpp and parts of llamafile C/C++ core under the hood.
☆14Updated 9 months ago
Alternatives and similar repositories for inferenceable:
Users that are interested in inferenceable are comparing it to the libraries listed below
- Run Structured LLM Inference with Easy Parallelism☆15Updated last month
- convert natural language into technical diagrams☆12Updated 2 months ago
- Streamable multi-format serialization with schema☆22Updated 2 months ago
- Search a JSON path and get the value fast☆21Updated 2 weeks ago
- Example usages of the Scaffoldly toolchain.☆14Updated 2 months ago
- Gateway and load balancer to your LLM inference endpoints☆21Updated 4 months ago
- Ready to go EKS setup☆10Updated 6 months ago
- 360M model running in the browser on WebGPU☆21Updated 6 months ago
- A collection of tools that can be used for LLM function calling☆32Updated 11 months ago
- Create embeddings for LLM using the Nomic API☆22Updated 3 months ago
- ☆10Updated 9 months ago
- Optimum graph creation and distribution for underground networks.☆33Updated 8 months ago
- A fast TUI application (with optional webui) to visually navigate and inspect JSON and JSONL data. Easily localize parse errors in large …☆13Updated 5 months ago
- Safely Run Commands in Productions☆17Updated 3 months ago
- Secure, locally-run Retrieval-Augmented Generation system for document-based question-answering, utilizing Llama 3, Mistral, and Gemini m…☆23Updated 4 months ago
- A tiny distributable Node server for serving web pages written in Markdown☆11Updated last year
- Pragmatic framework to build LLM Copilots☆17Updated last month
- Master PDF Summarization with Google Bard☆12Updated last year
- Load generator for TCP servers.☆20Updated 11 months ago
- Pollux payload core files and examples☆3Updated 3 weeks ago
- A JSX-native peer-to-peer browser that runs on Node.☆11Updated 10 months ago
- Query databases and tables with AI assistance☆16Updated 10 months ago
- A QT GUI for large language models☆31Updated last year
- Serverless runtime environment tailored for code produced by LLMs. Automatic API generation from your code, support for multiple programm…☆25Updated last year
- An open-source alternative for ridesharing and hitchhiking☆13Updated 2 weeks ago
- Vector Embedding Server in under 100 lines of code☆22Updated last year
- ☆12Updated 6 months ago
- Repository to allow collaboration between Cycle Labs Cloud community in support of the community.☆9Updated 3 years ago