Run AI models locally on your machine with node.js bindings for llama.cpp. Enforce a JSON schema on the model output on the generation level
β2,027Apr 12, 2026Updated 2 weeks ago
Alternatives and similar repositories for node-llama-cpp
Users that are interested in node-llama-cpp are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Run AI β¨ assistant locally! with simple API for Node.js πβ491Nov 16, 2025Updated 5 months ago
- Believe in AI democratization. llama for nodejs backed by llama-rs, llama.cpp and rwkv.cpp, work locally on your laptop CPU. support llamβ¦β865Aug 3, 2023Updated 2 years ago
- State-of-the-art Machine Learning for the web. Run π€ Transformers directly in your browser, with no need for a server!β15,914Updated this week
- Ollama JavaScript libraryβ4,181Feb 18, 2026Updated 2 months ago
- WebAssembly binding for llama.cpp - Enabling on-browser LLM inferenceβ1,048Updated this week
- Managed Kubernetes at scale on DigitalOcean β’ AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- WebAssembly (Wasm) Build and Bindings for llama.cppβ289Jul 23, 2024Updated last year
- The agent engineering platformβ17,565Updated this week
- LLM inference in C/C++β106,639Updated this week
- The TypeScript library for building AI applications.β1,318Jul 19, 2024Updated last year
- Data framework for your LLM applications. Focus on server side solutionβ3,081Mar 11, 2026Updated last month
- Machine learning framework for Node.js.β283Apr 19, 2025Updated last year
- NodeJS Bindings for Whisper - the CPU version of OpenAI's Whisper, as initially crafted in C++ by ggerganov.β204Apr 20, 2026Updated last week
- High-performance In-browser LLM Inference Engineβ17,825Updated this week
- A fast, efficient Node.js Worker Thread Pool implementationβ5,114Apr 20, 2026Updated last week
- GPUs on demand by Runpod - Special Offer Available β’ AdRun AI, ML, and HPC workloads on powerful cloud GPUsβwithout limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- TypeScript generator for llama.cpp Grammar directly from TypeScript interfacesβ142Jul 9, 2024Updated last year
- LLama.cpp rust bindingsβ422Jun 27, 2024Updated last year
- Vectra is a local vector database for Node.js with features similar to pinecone but built using local files.β612Apr 20, 2026Updated last week
- Distribute and run LLMs with a single file.β24,274Apr 23, 2026Updated last week
- Local AI API Platformβ2,761Jul 4, 2025Updated 9 months ago
- Embeddable Postgres with real-time, reactive bindings.β15,146Updated this week
- The AI Toolkit for TypeScript. From the creators of Next.js, the AI SDK is a free open-source library for building AI-powered applicationβ¦β23,777Updated this week
- Web framework built on Web Standardsβ30,136Apr 24, 2026Updated last week
- From the team behind Gatsby, Mastra is a framework for building AI-powered applications and agents with a modern TypeScript stack.β23,351Updated this week
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Python bindings for llama.cppβ10,240Updated this week
- Official JavaScript / TypeScript library for the OpenAI APIβ10,853Updated this week
- The official TypeScript SDK for Model Context Protocol servers and clientsβ12,280Updated this week
- LM Studio TypeScript SDKβ1,609Updated this week
- β‘οΈ TypeScript Execute | The easiest way to run TypeScript in Node.jsβ11,953Updated this week
- Get up and running with Kimi-K2.5, GLM-5, MiniMax, DeepSeek, gpt-oss, Qwen, Gemma and other models.β170,289Updated this week
- BullMQ - Message Queue and Batch processing for NodeJS, Python, Elixir and PHP based on Redisβ8,781Updated this week
- pgvector support for Node.js, Deno, and Bun (and TypeScript)β435Mar 1, 2026Updated 2 months ago
- The simplest and fastest way to bundle your TypeScript libraries.β11,214Feb 28, 2026Updated 2 months ago
- Simple, predictable pricing with DigitalOcean hosting β’ AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- A vector search SQLite extension that runs anywhere!β7,483Apr 8, 2026Updated 3 weeks ago
- LocalAI is the open-source AI engine. Run any model - LLMs, vision, voice, image, video - on any hardware. No GPU required.β45,883Updated this week
- π A complete search engine and RAG pipeline in your browser, server or edge network with support for full-text, vector, and hybrid searβ¦β10,305Feb 13, 2026Updated 2 months ago
- Port of OpenAI's Whisper model in C/C++β49,148Apr 20, 2026Updated last week
- A Next.js chat app to use Llama 2 locally using node-llama-cppβ12Oct 27, 2024Updated last year
- TypeScript-first schema validation with static type inferenceβ42,486Feb 15, 2026Updated 2 months ago
- Shared data types for building collaborative softwareβ21,726Apr 14, 2026Updated 2 weeks ago