belladoreai / llama-tokenizer-js
JS tokenizer for LLaMA 1 and 2
☆351Updated 9 months ago
Alternatives and similar repositories for llama-tokenizer-js:
Users that are interested in llama-tokenizer-js are comparing it to the libraries listed below
- OpenAI-compatible Python client that can call any LLM☆370Updated last year
- Finetune llama2-70b and codellama on MacBook Air without quantization☆448Updated last year
- TypeScript generator for llama.cpp Grammar directly from TypeScript interfaces☆135Updated 9 months ago
- Enforce structured output from LLMs 100% of the time☆249Updated 9 months ago
- LLaMa retrieval plugin script using OpenAI's retrieval plugin☆324Updated 2 years ago
- Tensor library for machine learning☆278Updated 2 years ago
- 🦜️🔗 This is a very simple re-implementation of LangChain, in ~100 lines of code☆253Updated last year
- Complex LLM Workflows from Simple JSON.☆296Updated last year
- Command-line script for inferencing from models such as falcon-7b-instruct☆76Updated last year
- Generate Synthetic Data Using OpenAI, MistralAI or AnthropicAI☆223Updated 11 months ago
- Extend the original llama.cpp repo to support redpajama model.☆117Updated 7 months ago
- This repository contains code for extending the Stanford Alpaca synthetic instruction tuning to existing instruction-tuned models such as…☆351Updated last year
- Exact structure out of any language model completion.☆508Updated last year
- ☆153Updated 9 months ago
- Landmark Attention: Random-Access Infinite Context Length for Transformers QLoRA☆123Updated last year
- Action library for AI Agent☆214Updated 3 weeks ago
- Constrained Decoding for LLMs against JSON Schema☆326Updated last year
- Inference code for Persimmon-8B☆415Updated last year
- Tune any FALCON in 4-bit☆466Updated last year
- C++ implementation for 💫StarCoder☆453Updated last year
- Convenient wrapper for fine-tuning and inference of Large Language Models (LLMs) with several quantization techniques (GTPQ, bitsandbytes…☆147Updated last year
- Create API agents from OpenAPI Specs☆180Updated last year
- Merge Transformers language models by use of gradient parameters.☆206Updated 8 months ago
- An implementation of bucketMul LLM inference☆216Updated 9 months ago
- ☆112Updated 2 months ago
- A benchmark for emotional intelligence in large language models☆281Updated 8 months ago
- Code Indexer Loop is a Python library for indexing and retrieving source code files through an integrated vector database that's continuo…☆174Updated last year
- Simple repo that compiles and runs llama2.c on the Web☆54Updated last year
- Fine-tune mistral-7B on 3090s, a100s, h100s☆710Updated last year
- ☆412Updated last year