LeonEricsson / llmcontextLinks

Pressure testing the context window of open LLMs

☆25

Alternatives and similar repositories for llmcontext

Users that are interested in llmcontext are comparing it to the libraries listed below

Sorting:

Gryphe / MergeMonster
An unsupervised model merging algorithm for Transformers-based language models.
☆108Updated last year
sdan / selfextend
an implementation of Self-Extend, to expand the context window via grouped attention
☆119Updated last year
thomasgauthier / LoRD
Low-Rank adapter extraction for fine-tuned transformers models
☆179Updated last year
thooton / muse
Let's create synthetic textbooks together :)
☆75Updated last year
euclaise / SlimTrainer
Full finetuning of large language models without large memory requirements
☆94Updated last month
VatsaDev / NanoPhi-alpha
GPT-2 small trained on phi-like data
☆67Updated last year
teknium1 / ShareGPT-Builder
☆116Updated 11 months ago
ahmed-moubtahij / TokenHealer
☆23Updated last year
Mihaiii / backtrack_sampler
An easy-to-understand framework for LLM samplers that rewind and revise generated tokens
☆145Updated 9 months ago
zarakiquemparte / zaraki-tools
☆26Updated 2 years ago
eugenepentland / landmark-attention-qlora
Landmark Attention: Random-Access Infinite Context Length for Transformers QLoRA
☆123Updated 2 years ago
QuixiAI / kraken
☆67Updated last year
vikhyat / mixtral-inference
inference code for mixtral-8x7b-32kseqlen
☆102Updated last year
official-elinas / zeus-llm-trainer
Zeus LLM Trainer is a rewrite of Stanford Alpaca aiming to be the trainer for all Large Language Models
☆69Updated 2 years ago
QuixiAI / OpenChatML
☆163Updated 3 months ago
AlpinDale / sparsegpt-for-LLaMA
Code for the paper "SparseGPT: Massive Language Models Can Be Accurately Pruned in One-Shot" with LLaMA implementation.
☆71Updated 2 years ago
emrgnt-cmplxty / zero-shot-replication
☆73Updated 2 years ago
AlpinDale / LLM-Shearing
Preprint: Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning
☆28Updated last year
deployradiant / pychatml
Chat Markup Language conversation library
☆55Updated last year
austinsilveria / tricksy
Fast approximate inference on a single GPU with sparsity aware offloading
☆38Updated last year
Maximilian-Winter / llama_cpp_function_calling
☆31Updated last year
Gryphe / BlockMerge_Gradient
Merge Transformers language models by use of gradient parameters.
☆208Updated last year
Birch-san / falcon-play
Command-line script for inferencing from models such as falcon-7b-instruct
☆74Updated 2 years ago
Digitous / ModelREVOLVER
Model REVOLVER, a human in the loop model mixing system.
☆32Updated 2 years ago
Hellisotherpeople / llm_steer-oobabooga
Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vecto…
☆42Updated last year
Preemo-Inc / text-generation-inference
☆198Updated last year
jondurbin / qlora
QLoRA: Efficient Finetuning of Quantized LLMs
☆79Updated last year
nicholasyager / llama-cpp-guidance
A guidance compatibility layer for llama-cpp-python
☆36Updated 2 years ago
brittlewis12 / autogguf
Easily convert HuggingFace models to GGUF-format for llama.cpp
☆23Updated last year
silphendio / sliced_llama
Simple LLM inference server
☆20Updated last year