Vercel and web-llm template to run wasm models directly in the browser.
☆174Apr 17, 2026Updated last month
Alternatives and similar repositories for wasm-ai
Users that are interested in wasm-ai are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A browser-based AI inference network.☆128Jul 28, 2024Updated last year
- WebAssembly binding for llama.cpp - Enabling on-browser LLM inference☆1,072May 17, 2026Updated last week
- Train text generation model with JavaScript.☆15Jul 14, 2024Updated last year
- Test your local LLMs on the AIME problems☆39Jun 7, 2025Updated 11 months ago
- Graphlit Platform☆32Feb 20, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Run `npm i -g socrate` to install a discussion room for using GPT personalities with internal monologues to debate problems. Provide a pr…☆28Apr 12, 2023Updated 3 years ago
- This is a demo repository for parallel multi-index question answering using streamlit and llama index☆24Aug 31, 2023Updated 2 years ago
- High-performance In-browser LLM Inference Engine☆18,047May 19, 2026Updated last week
- Open Source Projects from Pallas Lab☆21Oct 10, 2021Updated 4 years ago
- ☆12Dec 23, 2024Updated last year
- Compile WASM binaries to Javascript code.☆32Feb 6, 2024Updated 2 years ago
- My Gen AI research☆11Jun 3, 2024Updated last year
- GPU accelerated client-side embeddings for vector search, RAG etc.☆65Dec 4, 2023Updated 2 years ago
- This repository contains common interfaces, services and utilities used by other Paperbits libraries☆20Oct 8, 2025Updated 7 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Python client for txtai☆15May 12, 2026Updated 2 weeks ago
- 🐜🔧 A minimalistic tool to fine-tune your LLMs☆18Aug 17, 2023Updated 2 years ago
- Run LLMs in the Browser with MLC / WebLLM ✨☆169Oct 5, 2024Updated last year
- Proof of concept for a generative AI application framework powered by WebAssembly and Extism☆14Aug 10, 2023Updated 2 years ago
- A python script that automates the process of giving input to GPT and receiving output☆10Apr 2, 2023Updated 3 years ago
- ☆41May 17, 2026Updated last week
- Read slack messages from selected channels since last night and summarize them☆10Nov 19, 2023Updated 2 years ago
- A nuxt codemirror module to enjoy all the runtime editor possibilities☆24Updated this week
- A high-performance attention mechanism that computes softmax normalization in a single streaming pass using running accumulators (online …☆31Updated this week
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Vercel AI provider for Chrome built-in model (Gemini Nano)☆374Sep 9, 2024Updated last year
- A single-binary, GPU-accelerated LLM server (HTTP and WebSocket API) written in Rust☆79Jan 14, 2024Updated 2 years ago
- ☆16Apr 25, 2023Updated 3 years ago
- Legal Matter Standard Specification (LMSS) library for Python☆17Nov 14, 2023Updated 2 years ago
- WebAssembly (Wasm) Build and Bindings for llama.cpp☆291Jul 23, 2024Updated last year
- Latent Large Language Models☆19Aug 24, 2024Updated last year
- code for the paper "Cluster & Tune: Boost Cold Start Performance in Text Classification" for ACL2022☆27May 18, 2022Updated 4 years ago
- ☆259May 14, 2026Updated last week
- ☆15Apr 10, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A starter for Langchain, Docker Compose, Fastapi, Qdrant, Sveltekit☆24May 4, 2023Updated 3 years ago
- Constraint-based graph layout in tldraw☆163Apr 15, 2024Updated 2 years ago
- Supporting code for "LLMs for your iPhone: Whole-Tensor 4 Bit Quantization"☆11Mar 31, 2024Updated 2 years ago
- Fluid Database☆111Sep 20, 2024Updated last year
- neovim remote rpc client implementation with zig☆28Feb 6, 2026Updated 3 months ago
- A fake livestreaming twitch.tv chat, where small streamers can communicate with unique ChatGPT bots that act as fans.☆17Jan 19, 2025Updated last year
- ☆27Jul 9, 2024Updated last year