NolanoOrg / InstructLLaMa.cppLinks
Fast inference of Instruct tuned LLaMa on your personal devices.
☆23Updated 2 years ago
Alternatives and similar repositories for InstructLLaMa.cpp
Users that are interested in InstructLLaMa.cpp are comparing it to the libraries listed below
Sorting:
- Command-line script for inferencing from models such as MPT-7B-Chat☆100Updated 2 years ago
- Instruct-tuning LLaMA on consumer hardware☆66Updated 2 years ago
- 4 bits quantization of SantaCoder using GPTQ☆51Updated 2 years ago
- Experimental sampler to make LLMs more creative☆31Updated 2 years ago
- Smol but mighty language model☆65Updated 2 years ago
- Command-line script for inferencing from models such as falcon-7b-instruct☆75Updated 2 years ago
- Drop in replacement for OpenAI, but with Open models.☆154Updated 2 years ago
- Landmark Attention: Random-Access Infinite Context Length for Transformers QLoRA☆124Updated 2 years ago
- an implementation of Self-Extend, to expand the context window via grouped attention☆119Updated 2 years ago
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆150Updated last month
- Modified Stanford-Alpaca Trainer for Training Replit's Code Model☆42Updated 2 years ago
- The first AI artist☆32Updated 2 years ago
- a pipeline for using api calls to agnostically convert unstructured data into structured training data☆32Updated last year
- ☆74Updated 2 years ago
- Full finetuning of large language models without large memory requirements☆94Updated 4 months ago
- A repository of prompts and Python scripts for intelligent transformation of raw text into diverse formats.☆31Updated 2 years ago
- ☆40Updated 2 years ago
- A synthetic story narration dataset to study small audio LMs.☆31Updated 2 years ago
- Simple LLM inference server☆20Updated last year
- inference code for mixtral-8x7b-32kseqlen☆105Updated 2 years ago
- Code for the paper "SparseGPT: Massive Language Models Can Be Accurately Pruned in One-Shot" with LLaMA implementation.☆71Updated 2 years ago
- GPU accelerated client-side embeddings for vector search, RAG etc.☆65Updated 2 years ago
- OpenPipe Reinforcement Learning Experiments☆32Updated 10 months ago
- Embeddings focused small version of Llama NLP model☆107Updated 2 years ago
- Web browser version of StarCoder.cpp☆46Updated 2 years ago
- Generates grammer files from typescript for LLM generation☆38Updated last year
- Simplex Random Feature attention, in PyTorch☆75Updated 2 years ago
- Approximating the joint distribution of language models via MCTS☆22Updated last year
- Command-line script for inferencing from models such as LLaMA, in a chat scenario, with LoRA adaptations☆33Updated 2 years ago
- Simple setup to self-host LLaMA3-70B model with an OpenAI API☆19Updated last year