lxe / wasm-gpt
Tensor library for machine learning
☆278Updated last year
Alternatives and similar repositories for wasm-gpt:
Users that are interested in wasm-gpt are comparing it to the libraries listed below
- WebGPU LLM inference tuned by hand☆148Updated last year
- Add local LLMs to your Web or Electron apps! Powered by Rust + WebGPU☆102Updated last year
- Revealing example of self-attention, the building block of transformer AI models☆130Updated last year
- Extend the original llama.cpp repo to support redpajama model.☆117Updated 4 months ago
- Command-line script for inferencing from models such as MPT-7B-Chat☆101Updated last year
- Enforce structured output from LLMs 100% of the time☆245Updated 6 months ago
- JS tokenizer for LLaMA 1 and 2☆348Updated 7 months ago
- Tiny inference-only implementation of LLaMA☆91Updated 9 months ago
- Implement recursion using English as the programming language and an LLM as the runtime.☆135Updated last year
- OpenAI-compatible Python client that can call any LLM☆369Updated last year
- ☆139Updated last year
- Extensible AI assistant platform that bridges LLMs to tasks and actions☆38Updated last year
- Run inference on replit-3B code instruct model using CPU☆154Updated last year
- Run GGML models with Kubernetes.☆173Updated last year
- Web-optimized vector database (written in Rust).☆200Updated 2 weeks ago
- TypeScript generator for llama.cpp Grammar directly from TypeScript interfaces☆131Updated 6 months ago
- ☆250Updated last year
- An implementation of bucketMul LLM inference☆215Updated 6 months ago
- Augment GPT-4 Environment Access☆287Updated last year
- A library for incremental loading of large PyTorch checkpoints☆56Updated last year
- Redteaming LLMs using other LLMs☆253Updated last year
- 🦜️🔗 This is a very simple re-implementation of LangChain, in ~100 lines of code☆251Updated last year
- GGUF implementation in C as a library and a tools CLI program☆251Updated 3 weeks ago
- https://ermine.ai -- 100% client-side live audio transcription, powered by transformers.js☆323Updated last year
- Mistral7B playing DOOM☆127Updated 6 months ago
- Large language model evaluation and workflow framework from Phase AI.☆453Updated last week
- LLaMa retrieval plugin script using OpenAI's retrieval plugin☆324Updated last year
- LLM-based tool for parsing information and chatting with it☆215Updated last year
- Easy-to-use headless React Hooks to run LLMs in the browser with WebGPU. Just useLLM().☆687Updated last year
- SoTA Transformers with C-backend for fast inference on your CPU.☆312Updated last year