lxe / wasm-gptLinks
Tensor library for machine learning
☆276Updated 2 years ago
Alternatives and similar repositories for wasm-gpt
Users that are interested in wasm-gpt are comparing it to the libraries listed below
Sorting:
- WebGPU LLM inference tuned by hand☆151Updated 2 years ago
- Add local LLMs to your Web or Electron apps! Powered by Rust + WebGPU☆102Updated 2 years ago
- TypeScript generator for llama.cpp Grammar directly from TypeScript interfaces☆137Updated 11 months ago
- OpenAI-compatible Python client that can call any LLM☆371Updated 2 years ago
- Layered, depth-first reading—start with summaries, tap to explore details, and gain clarity on complex topics.☆273Updated last year
- ☆145Updated last year
- Run GGML models with Kubernetes.☆173Updated last year
- Augment GPT-4 Environment Access☆285Updated 2 years ago
- An implementation of bucketMul LLM inference☆217Updated 11 months ago
- JS tokenizer for LLaMA 1 and 2☆353Updated 11 months ago
- Extensible AI assistant platform that bridges LLMs to tasks and actions☆38Updated 2 years ago
- 🦜️🔗 This is a very simple re-implementation of LangChain, in ~100 lines of code☆253Updated last year
- https://ermine.ai -- 100% client-side live audio transcription, powered by transformers.js☆324Updated 2 years ago
- Simple repo that compiles and runs llama2.c on the Web☆57Updated last year
- LLM-based code completion engine☆194Updated 5 months ago
- LLaMA Cog template☆307Updated last year
- ☆163Updated last year
- Command-line script for inferencing from models such as MPT-7B-Chat☆101Updated last year
- C++ implementation for 💫StarCoder☆453Updated last year
- Tensor computation with WebGPU acceleration☆620Updated 10 months ago
- Vercel and web-llm template to run wasm models directly in the browser.☆152Updated last year
- Enforce structured output from LLMs 100% of the time☆249Updated 11 months ago
- Redteaming LLMs using other LLMs☆254Updated 2 years ago
- A tiny implementation of an autonomous agent powered by LLMs (OpenAI GPT-4)☆443Updated 2 years ago
- Revealing example of self-attention, the building block of transformer AI models☆131Updated 2 years ago
- LLM-based tool for parsing information and chatting with it☆214Updated last year
- Applied methods of analytical augmentation to build tools using large-language models.☆26Updated 2 years ago
- Next-token prediction in JavaScript — build fast language and diffusion models.☆143Updated 9 months ago
- Run inference on replit-3B code instruct model using CPU☆156Updated last year
- Web-optimized vector database (written in Rust).☆243Updated 3 months ago