cbh123 / llmboxingLinks

LLM boxing matches

☆57

Alternatives and similar repositories for llmboxing

Users that are interested in llmboxing are comparing it to the libraries listed below

Sorting:

QuixiAI / kraken
☆66Updated last year
stunningpixels / lou-eval
Track the progress of LLM context utilisation
☆55Updated 3 months ago
teknium1 / LLM-Benchmark-Logs
Just a bunch of benchmark logs for different LLMs
☆119Updated last year
thomasgauthier / LoRD
Low-Rank adapter extraction for fine-tuned transformers models
☆175Updated last year
NousResearch / StripedHyenaTrainer
☆61Updated last year
sdan / selfextend
an implementation of Self-Extend, to expand the context window via grouped attention
☆119Updated last year
ChrisHayduk / qlora-multi-gpu
QLoRA with Enhanced Multi GPU Support
☆37Updated last year
teknium1 / ShareGPT-Builder
☆115Updated 7 months ago
automix-llm / automix
Mixing Language Models with Self-Verification and Meta-Verification
☆105Updated 7 months ago
geronimi73 / phi2-finetune
☆87Updated last year
euclaise / SlimTrainer
Full finetuning of large language models without large memory requirements
☆94Updated last year
emrgnt-cmplxty / zero-shot-replication
☆74Updated last year
reactorsh / ambrosia
clean up your LLM datasets
☆115Updated 2 years ago
arcee-ai / DAM
☆53Updated 8 months ago
euclaise / supertrainer2000
☆49Updated last year
orionw / promptriever
The first dense retrieval model that can be prompted like an LM
☆81Updated 2 months ago
argilla-io / notus
Notus is a collection of fine-tuned LLMs using SFT, DPO, SFT+DPO, and/or any other RLHF techniques, while always keeping a data-first app…
☆168Updated last year
jondurbin / qlora
QLoRA: Efficient Finetuning of Quantized LLMs
☆78Updated last year
Hannibal046 / nanoColBERT
Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).
☆80Updated last year
Mihaiii / llm_steer
Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vecto…
☆240Updated 5 months ago
cg123 / bitnet
Modeling code for a BitNet b1.58 Llama-style model.
☆25Updated last year
QuixiAI / grokadamw
☆134Updated 11 months ago
LLM360 / amber-data-prep
Data preparation code for Amber 7B LLM
☆91Updated last year
akjindal53244 / Arithmo
Small and Efficient Mathematical Reasoning LLMs
☆71Updated last year
ZeroSumEval / ZeroSumEval
A framework for pitting LLMs against each other in an evolving library of games ⚔
☆32Updated 3 months ago
CERC-AAI / Robin
☆63Updated 10 months ago
Zyphra / Zyda_processing
☆37Updated last year
Digitous / LLM-SLERP-Merge
Spherical Merge Pytorch/HF format Language Models with minimal feature loss.
☆135Updated last year
ContextualAI / CLAIR_and_APO
Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment
☆60Updated 11 months ago
davanstrien / haiku-dpo
Using open source LLMs to build synthetic datasets for direct preference optimization
☆65Updated last year