HyperMink / inferenceable
Scalable AI Inference Server for CPU and GPU with Node.js | Utilizes llama.cpp and parts of llamafile C/C++ core under the hood.
☆14Updated 5 months ago
Related projects ⓘ
Alternatives and complementary repositories for inferenceable
- Run Structured LLM Inference with Easy Parallelism☆15Updated 3 months ago
- Secure, locally-run Retrieval-Augmented Generation system for document-based question-answering, utilizing Llama 3, Mistral, and Gemini m…☆20Updated last month
- Ready to go EKS setup☆9Updated 2 months ago
- ☆11Updated 2 months ago
- Define and implement any functions on the fly with LLMs☆11Updated 6 months ago
- Streamable multi-format serialization with schema☆22Updated last month
- 360M model running in the browser on WebGPU☆20Updated 2 months ago
- Gateway and load balancer to your LLM inference endpoints☆18Updated last week
- Data Neuron is a powerful framework that enables you to build text-to-SQL applications with an easily maintainable semantic layer. Whethe…☆41Updated 2 months ago
- 🚀 Full-stack scaffolding tool for Java/Kotlin + React apps.☆18Updated 6 months ago
- Connect to your customer data using any LLM and gain actionable insights. IdentityRAG creates a single comprehensive customer 360 view (g…☆21Updated this week
- 🐝 Create powerful, collaborative AI applications.☆38Updated this week
- An open-source alternative for ridesharing and hitchhiking☆12Updated this week
- A JSX-native peer-to-peer browser that runs on Node.☆11Updated 6 months ago
- ☆9Updated 5 months ago
- Dockerized FastAPI wrapper around the recognize-anything image recognition models☆25Updated 7 months ago
- A Python library for real-time PostgreSQL event-driven cache invalidation.☆18Updated 6 months ago
- Create embeddings for LLM using the Nomic API☆16Updated 7 months ago
- Watch local git repositories, keep in sync with remote and run commands.☆23Updated last week
- Provide a command and let Claude fix the code until the command passes☆21Updated 4 months ago
- Serverless runtime environment tailored for code produced by LLMs. Automatic API generation from your code, support for multiple programm…☆25Updated last year
- Chat strategies for LLMs☆90Updated 2 months ago
- An interface for llama.cpp and ChatGPT☆21Updated last month
- A tiny distributable Node server for serving web pages written in Markdown☆11Updated 11 months ago
- Environment Variables Encryption Tool☆10Updated 2 months ago
- 🔥 Helper program for setting up a Firecracker microVM on a fresh metal☆21Updated last year
- A collection of tools that can be used for LLM function calling☆32Updated 8 months ago
- ☆24Updated last year
- Load generator for TCP servers.☆18Updated 7 months ago
- A simple github actions script to build a llamafile and uploads to huggingface☆10Updated 10 months ago