HyperMink / inferenceable
Scalable AI Inference Server for CPU and GPU with Node.js | Utilizes llama.cpp and parts of llamafile C/C++ core under the hood.
☆14Updated 5 months ago
Related projects ⓘ
Alternatives and complementary repositories for inferenceable
- Run Structured LLM Inference with Easy Parallelism☆15Updated this week
- Streamable multi-format serialization with schema☆23Updated 2 months ago
- Secure, locally-run Retrieval-Augmented Generation system for document-based question-answering, utilizing Llama 3, Mistral, and Gemini m…☆21Updated last month
- Ready to go EKS setup☆9Updated 2 months ago
- Define and implement any functions on the fly with LLMs☆11Updated 6 months ago
- A JSX-native peer-to-peer browser that runs on Node.☆11Updated 7 months ago
- ☆10Updated 6 months ago
- ☆12Updated 3 months ago
- 360M model running in the browser on WebGPU☆20Updated 3 months ago
- Environment Variables Encryption Tool☆10Updated 2 months ago
- An open-source alternative for ridesharing and hitchhiking☆12Updated 2 weeks ago
- Configuration Management That Evolves With Your Infrastructure☆32Updated this week
- Query databases and tables with AI assistance☆16Updated 7 months ago
- ☆24Updated last year
- Dillusion is the dillo of the future☆9Updated 4 months ago
- An open source collection of agentic Github workflows☆13Updated 6 months ago
- ☆17Updated last week
- An interface for llama.cpp and ChatGPT☆21Updated this week
- A tiny distributable Node server for serving web pages written in Markdown☆11Updated last year
- OMF is a compact, user-friendly specification that defines a lightweight API contract between client and server for building conversation…☆58Updated 2 months ago
- Data Neuron is a powerful framework that enables you to build text-to-SQL applications with an easily maintainable semantic layer. Whethe…☆41Updated 3 months ago
- Analyze your image in seconds with AI☆60Updated 5 months ago
- Chat strategies for LLMs☆91Updated 3 months ago
- pip installable duckdb extensions published to pypi☆12Updated 2 weeks ago
- Self-hostable headless QR code generator☆15Updated 2 months ago
- 🔥 Helper program for setting up a Firecracker microVM on a fresh metal☆21Updated last year
- Express.js ported to a Service Worker context☆17Updated 10 months ago
- Pollux payload core files and examples☆3Updated 10 months ago
- Dockerized FastAPI wrapper around the recognize-anything image recognition models☆25Updated 8 months ago
- Use GPTparser with your OpenAI API to scrape & parse files into structured JSON files.☆12Updated 7 months ago