Run AI models locally on your machine with node.js bindings for llama.cpp. Enforce a JSON schema on the model output on the generation level
β2,109Jun 21, 2026Updated last week
Alternatives and similar repositories for node-llama-cpp
Users that are interested in node-llama-cpp are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Run AI β¨ assistant locally! with simple API for Node.js πβ495Nov 16, 2025Updated 7 months ago
- Believe in AI democratization. llama for nodejs backed by llama-rs, llama.cpp and rwkv.cpp, work locally on your laptop CPU. support llamβ¦β863Aug 3, 2023Updated 2 years ago
- State-of-the-art Machine Learning for the web. Run π€ Transformers directly in your browser, with no need for a server!β16,152Updated this week
- Ollama JavaScript libraryβ4,280Feb 18, 2026Updated 4 months ago
- WebAssembly binding for llama.cpp - Enabling on-browser LLM inferenceβ1,126Jun 17, 2026Updated last week
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- WebAssembly (Wasm) Build and Bindings for llama.cppβ293Jul 23, 2024Updated last year
- The agent engineering platformβ17,862Updated this week
- LLM inference in C/C++β118,422Updated this week
- The TypeScript library for building AI applications.β1,320Jul 19, 2024Updated last year
- Data framework for your LLM applications. Focus on server side solutionβ3,079Mar 11, 2026Updated 3 months ago
- Machine learning framework for Node.js.β282Apr 19, 2025Updated last year
- NodeJS Bindings for Whisper - the CPU version of OpenAI's Whisper, as initially crafted in C++ by ggerganov.β210Jun 22, 2026Updated last week
- High-performance In-browser LLM Inference Engineβ18,279Jun 9, 2026Updated 3 weeks ago
- β91Updated this week
- Deploy on Railway without the complexity - Free Credits Offer β’ AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A fast, efficient Node.js Worker Thread Pool implementationβ5,155Updated this week
- TypeScript generator for llama.cpp Grammar directly from TypeScript interfacesβ143Jul 9, 2024Updated last year
- LLama.cpp rust bindingsβ422Jun 27, 2024Updated 2 years ago
- Vectra is a local vector database for Node.js with features similar to pinecone but built using local files.β623Jun 22, 2026Updated last week
- Embeddable Postgres with real-time, reactive bindings.β15,450Updated this week
- Local AI API Platformβ2,755Jul 4, 2025Updated 11 months ago
- Distribute and run LLMs with a single file.β25,105Updated this week
- The AI Toolkit for TypeScript. From the creators of Next.js, the AI SDK is a free open-source library for building AI-powered applicationβ¦β25,191Updated this week
- Web framework built on Web Standardsβ31,138Jun 23, 2026Updated last week
- Simple, predictable pricing with DigitalOcean hosting β’ AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Mastra is the modern TypeScript framework for AI-powered applications and agents.β25,558Updated this week
- Python bindings for llama.cppβ10,446Updated this week
- Official JavaScript / TypeScript library for the OpenAI APIβ10,989Jun 24, 2026Updated last week
- The official TypeScript SDK for Model Context Protocol servers and clientsβ12,736Updated this week
- LM Studio TypeScript SDKβ1,713Jun 17, 2026Updated last week
- β‘οΈ TypeScript Execute | The easiest way to run TypeScript in Node.jsβ12,029May 31, 2026Updated last month
- Get up and running with Kimi-K2.6, GLM-5.1, MiniMax, DeepSeek, gpt-oss, Qwen, Gemma and other models.β174,889Updated this week
- BullMQ - Message Queue and Batch processing for NodeJS, Python, Elixir and PHP based on Redisβ9,035Jun 24, 2026Updated last week
- pgvector support for Node.js, Deno, and Bun (and TypeScript)β442Updated this week
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- The simplest and fastest way to bundle your TypeScript libraries.β11,273Jun 14, 2026Updated 2 weeks ago
- LocalAI is the open-source AI engine. Run any model - LLMs, vision, voice, image, video - on any hardware. No GPU required.β47,070Jun 23, 2026Updated last week
- A vector search SQLite extension that runs anywhere!β7,790May 18, 2026Updated last month
- Port of OpenAI's Whisper model in C/C++β51,030Jun 23, 2026Updated last week
- π A complete search engine and RAG pipeline in your browser, server or edge network with support for full-text, vector, and hybrid searβ¦β10,455Updated this week
- Shared data types for building collaborative softwareβ22,090Jun 22, 2026Updated last week
- TypeScript-first schema validation with static type inferenceβ43,056Jun 13, 2026Updated 2 weeks ago