Vercel and web-llm template to run wasm models directly in the browser.
☆169Nov 21, 2023Updated 2 years ago
Alternatives and similar repositories for wasm-ai
Users that are interested in wasm-ai are comparing it to the libraries listed below
Sorting:
- This is a FastAPI based LLM server. Load multiple LLM models (MLX or llama.cpp) simultaneously using multiprocessing.☆16Updated this week
- Test your local LLMs on the AIME problems☆32Jun 7, 2025Updated 9 months ago
- ☆36Feb 6, 2026Updated last month
- ☆10Apr 10, 2014Updated 11 years ago
- WebAssembly binding for llama.cpp - Enabling on-browser LLM inference☆1,009Dec 17, 2025Updated 2 months ago
- A platform aimed at creating websites that perform self-optimization☆12May 4, 2024Updated last year
- Native Salesforce app to codify compliance rules and check documents against them. DMD is the PMD for Business Documents.☆13Jan 20, 2026Updated last month
- AI assisted article writing project☆13Jan 31, 2025Updated last year
- A Go gRPC client library for Vald☆13Feb 3, 2026Updated last month
- GHC WASM backend made easy to use for platforms without precompiled bindists powered by Earthly☆14Apr 19, 2025Updated 10 months ago
- Use GPTparser with your OpenAI API to scrape & parse files into structured JSON files.☆13Apr 2, 2024Updated last year
- My Gen AI research☆11Jun 3, 2024Updated last year
- Read slack messages from selected channels since last night and summarize them☆10Nov 19, 2023Updated 2 years ago
- ☆27Jul 9, 2024Updated last year
- Rust Implementation of micrograd☆52Jul 3, 2024Updated last year
- ☆35Aug 16, 2024Updated last year
- A browser-based AI inference network.☆126Jul 28, 2024Updated last year
- Code for paper https://arxiv.org/abs/2501.00522☆14Apr 28, 2025Updated 10 months ago
- Latent Large Language Models☆19Aug 24, 2024Updated last year
- We present algorithms to extract data from annual reports (PDF files) using the Python programming language with the aim to automate the …☆16Jul 1, 2021Updated 4 years ago
- High-performance In-browser LLM Inference Engine☆17,515Mar 2, 2026Updated last week
- ☆15Apr 10, 2024Updated last year
- Proof of concept for a generative AI application framework powered by WebAssembly and Extism☆14Aug 10, 2023Updated 2 years ago
- A workflowy clone written in Svelte, that aims to be pixel perfect!☆14Feb 8, 2026Updated last month
- My personal implementation of the model from "Qwen-VL: A Frontier Large Vision-Language Model with Versatile Abilities", they haven't rel…☆12Jan 29, 2024Updated 2 years ago
- ☆14Apr 16, 2025Updated 10 months ago
- GPU accelerated client-side embeddings for vector search, RAG etc.☆65Dec 4, 2023Updated 2 years ago
- Python client for txtai☆15Feb 25, 2026Updated last week
- Developing a Toolkit for On-chain analysis of Blockchains☆17Oct 19, 2019Updated 6 years ago
- Automatically annotates YOLO dataset using Moondream visual model☆20Aug 24, 2025Updated 6 months ago
- Small app to gather UserStories for Projects in your company☆17Mar 25, 2025Updated 11 months ago
- 🐜🔧 A minimalistic tool to fine-tune your LLMs☆18Aug 17, 2023Updated 2 years ago
- 📷 A module for the interacting with the Imgflip API.☆15Sep 1, 2025Updated 6 months ago
- Machine learning and data analysis package implemented in JavaScript and its online demo.☆17Mar 1, 2026Updated last week
- WebAssembly (Wasm) Build and Bindings for llama.cpp☆286Jul 23, 2024Updated last year
- Fast approximate inference on a single GPU with sparsity aware offloading☆39Jan 4, 2024Updated 2 years ago
- 2024 LlamaIndex RAG Hackathon "1st Place Award" Project☆69Feb 16, 2024Updated 2 years ago
- Time series forecasting with DuckDB and Evidence☆43Nov 1, 2024Updated last year
- Setu is a comprehensive pipeline designed to clean, filter, and deduplicate diverse data sources including Web, PDF, and Speech data. Bui…☆16May 17, 2024Updated last year