JS tokenizer for LLaMA 1 and 2
☆364Jun 27, 2024Updated 2 years ago
Alternatives and similar repositories for llama-tokenizer-js
Users that are interested in llama-tokenizer-js are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Tensor library for machine learning☆273Apr 23, 2023Updated 3 years ago
- A toolbox for working with WebRTC, Audio and AI☆702Jul 29, 2023Updated 2 years ago
- A fast inference library for running LLMs locally on modern consumer-class GPUs☆4,567Mar 4, 2026Updated 3 months ago
- A simple no-install web UI for Ollama and OAI-Compatible APIs!☆31Jan 30, 2025Updated last year
- Simple UI for LLM Model Finetuning☆2,052Dec 21, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆17Dec 18, 2023Updated 2 years ago
- A simple "Be My Eyes" web app with a llama.cpp/llava backend☆496Nov 28, 2023Updated 2 years ago
- Plugin for LLM-CLI adding support for Together.AI hosting a large collection of open-source LLMs☆18Apr 10, 2024Updated 2 years ago
- Falcon LLM ggml framework with CPU and GPU support☆249Jan 22, 2024Updated 2 years ago
- ☆16Dec 16, 2024Updated last year
- Finetune llama2-70b and codellama on MacBook Air without quantization☆449Mar 28, 2024Updated 2 years ago
- A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.☆2,924Sep 30, 2023Updated 2 years ago
- AI-managed code blocks in Python ⏪⏩☆465Oct 5, 2023Updated 2 years ago
- High-performance In-browser LLM Inference Engine☆18,279Jun 9, 2026Updated 3 weeks ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- 🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading☆10,248Sep 7, 2024Updated last year
- OpenLLaMA, a permissively licensed open source reproduction of Meta AI’s LLaMA 7B trained on the RedPajama dataset☆7,530Jul 16, 2023Updated 2 years ago
- Bringing stable diffusion models to web browsers. Everything runs inside the browser with no server support.☆3,719Mar 12, 2024Updated 2 years ago
- Slidef is a CLI tool that converts your PDF presentations into a modern, web-based slide viewer. Perfect for sharing presentations, creat…☆33Nov 8, 2025Updated 7 months ago
- Fill up the `model_list` field in your LiteLLM proxy configuration file☆10Sep 7, 2024Updated last year
- Generate High Quality textual or multi-modal datasets with Agents☆18Jun 7, 2023Updated 3 years ago
- Llama 2 Everywhere (L2E)☆1,526Aug 27, 2025Updated 10 months ago
- Quantized inference code for LLaMA models☆1,038Mar 17, 2023Updated 3 years ago
- ☆260Jul 15, 2023Updated 2 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Fork of Facebooks LLaMa model to run on CPU☆766Mar 6, 2023Updated 3 years ago
- An extensible, easy-to-use, and portable diffusion web UI 👨🎨☆1,669Aug 18, 2023Updated 2 years ago
- AutoChain: Build lightweight, extensible, and testable LLM Agents☆1,877Dec 16, 2025Updated 6 months ago
- Visual Studio Code extension for WizardCoder☆148Aug 1, 2023Updated 2 years ago
- code for training and using chess embeddings models☆14Jun 9, 2024Updated 2 years ago
- Complex LLM Workflows from Simple JSON.☆322Aug 11, 2023Updated 2 years ago
- AI superpowers you own. Sila is an open alternative to ChatGPT where you own AI assistants, chats and data.☆27Apr 22, 2026Updated 2 months ago
- Basaran is an open-source alternative to the OpenAI text completion API. It provides a compatible streaming API for your Hugging Face Tra…☆1,285Jan 24, 2024Updated 2 years ago
- Lightweight inference library for ONNX files, written in C++. It can run Stable Diffusion XL 1.0 on a RPI Zero 2 (or in 298MB of RAM) but…☆2,080Jun 18, 2026Updated last week
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- A Bulletproof Way to Generate Structured JSON from Language Models☆4,930Feb 24, 2024Updated 2 years ago
- A guidance language for controlling large language models.☆21,519May 21, 2026Updated last month
- LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath☆9,485Jun 7, 2025Updated last year
- Proof of concept for a generative AI application framework powered by WebAssembly and Extism☆14Aug 10, 2023Updated 2 years ago
- Highly concurrent and fast content processing for Mighty Inference Server☆10Feb 6, 2023Updated 3 years ago
- Minimalist log collector☆114Jan 14, 2025Updated last year
- Finetune a LLM to speak like you based on your WhatsApp Conversations☆380May 5, 2024Updated 2 years ago