belladoreai / llama-tokenizer-js
JS tokenizer for LLaMA 1 and 2
☆342Updated 4 months ago
Related projects ⓘ
Alternatives and complementary repositories for llama-tokenizer-js
- LLaMA Cog template☆306Updated 9 months ago
- LLaMa retrieval plugin script using OpenAI's retrieval plugin☆324Updated last year
- Finetune llama2-70b and codellama on MacBook Air without quantization☆447Updated 7 months ago
- OpenAI-compatible Python client that can call any LLM☆366Updated last year
- Complex LLM Workflows from Simple JSON.☆280Updated last year
- Enforce structured output from LLMs 100% of the time☆241Updated 4 months ago
- Landmark Attention: Random-Access Infinite Context Length for Transformers QLoRA☆124Updated last year
- TypeScript generator for llama.cpp Grammar directly from TypeScript interfaces☆131Updated 4 months ago
- ☆149Updated 4 months ago
- Tensor library for machine learning☆279Updated last year
- 🦜️🔗 This is a very simple re-implementation of LangChain, in ~100 lines of code☆250Updated last year
- Generate Synthetic Data Using OpenAI, MistralAI or AnthropicAI☆221Updated 6 months ago
- Constrained Decoding for LLMs against JSON Schema☆322Updated last year
- Action library for AI Agent☆191Updated 2 weeks ago
- Extend the original llama.cpp repo to support redpajama model.☆117Updated 2 months ago
- Code Indexer Loop is a Python library for indexing and retrieving source code files through an integrated vector database that's continuo…☆169Updated 7 months ago
- Generate question/answer training pairs out of raw text.☆204Updated 11 months ago
- Convenient wrapper for fine-tuning and inference of Large Language Models (LLMs) with several quantization techniques (GTPQ, bitsandbytes…☆145Updated last year
- Add local LLMs to your Web or Electron apps! Powered by Rust + WebGPU☆102Updated last year
- Visualize the intermediate output of Mistral 7B☆313Updated 9 months ago
- WebGPU LLM inference tuned by hand☆147Updated last year
- Sister project to OpenLLMetry, but in Typescript. Open-source observability for your LLM application, based on OpenTelemetry☆266Updated last week
- Stateful load balancer custom-tailored for llama.cpp☆563Updated this week
- JS tokenizer for LLaMA 3 and LLaMA 3.1☆91Updated 3 months ago
- ☆411Updated last year
- ☆113Updated 3 weeks ago
- WebAssembly (Wasm) Build and Bindings for llama.cpp☆213Updated 3 months ago
- A simple Python sandbox for helpful LLM data agents☆170Updated 5 months ago