teknium1/LLM-Benchmark-Logs

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/teknium1/LLM-Benchmark-Logs)

teknium1 / LLM-Benchmark-Logs

Just a bunch of benchmark logs for different LLMs

☆130

Alternatives and similar repositories for LLM-Benchmark-Logs

Users that are interested in LLM-Benchmark-Logs are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

teknium1 / LLM-Logbook
View on GitHub
Public reports detailing responses to sets of prompts by Large Language Models.
☆40Jan 4, 2025Updated last year
teknium1 / transformers-gptq-quant
View on GitHub
☆46Oct 13, 2023Updated 2 years ago
argilla-io / distilabel-spin-dibt
View on GitHub
Repository containing the SPIN experiments on the DIBT 10k ranked prompts
☆24Mar 12, 2024Updated 2 years ago
AblateIt / finetune-study
View on GitHub
Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.
☆82Sep 10, 2023Updated 2 years ago
lumpenspace / FRAG
View on GitHub
Flexible, efficient, and context-aware generation from large unstructured knowledge sources.
☆17May 7, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
GAIR-NLP / scaleeval
View on GitHub
Scalable Meta-Evaluation of LLMs as Evaluators
☆43Feb 15, 2024Updated 2 years ago
agnt-gg / rizz
View on GitHub
The place for RIZZ
☆12Mar 10, 2025Updated last year
Datura-ai / cortex.t
View on GitHub
☆63Apr 12, 2026Updated 3 months ago
brendanhogan / completion_tree_view
View on GitHub
☆15Apr 26, 2025Updated last year
CarperAI / treasure_trove
View on GitHub
☆21Aug 27, 2023Updated 2 years ago
hamelsmu / replicate-examples
View on GitHub
☆21Apr 29, 2024Updated 2 years ago
Re-Align / AlignTDS
View on GitHub
Analyzing LLM Alignment via Token distribution shift
☆17Jan 26, 2024Updated 2 years ago
mlabonne / llm-autoeval
View on GitHub
Automatically evaluate your LLMs in Google Colab
☆695May 7, 2024Updated 2 years ago
lunary-ai / llm-benchmarks
View on GitHub
LLM benchmarks
☆13Feb 22, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
mbzuai-nlp / LaMini-LM
View on GitHub
LaMini-LM: A Diverse Herd of Distilled Models from Large-Scale Instructions
☆822May 6, 2023Updated 3 years ago
teknium1 / RawTransform
View on GitHub
A repository of prompts and Python scripts for intelligent transformation of raw text into diverse formats.
☆34May 29, 2023Updated 3 years ago
QuixiAI / OpenChatML
View on GitHub
☆166Aug 8, 2025Updated 11 months ago
tokenbender / avataRL
View on GitHub
rl from zero pretrain, can it be done? yes.
☆295Sep 28, 2025Updated 9 months ago
IST-DASLab / qmoe
View on GitHub
Code for the paper "QMoE: Practical Sub-1-Bit Compression of Trillion-Parameter Models".
☆278Nov 3, 2023Updated 2 years ago
deployradiant / pychatml
View on GitHub
Chat Markup Language conversation library
☆55Jan 3, 2024Updated 2 years ago
sabetAI / BLoRA
View on GitHub
batched loras
☆350Sep 6, 2023Updated 2 years ago
sam-paech / antislop-sampler
View on GitHub
☆350Mar 5, 2026Updated 4 months ago
matttreed / diloco-sim
View on GitHub
☆23Jan 5, 2025Updated last year
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
taylorai / galactic
View on GitHub
data cleaning and curation for unstructured text
☆329Aug 6, 2024Updated last year
kevinyaobytedance / llm_eval
View on GitHub
LLM evaluation.
☆16Nov 7, 2023Updated 2 years ago
jondurbin / bagel
View on GitHub
A bagel, with everything.
☆326Apr 11, 2024Updated 2 years ago
official-elinas / zeus-llm-trainer
View on GitHub
Zeus LLM Trainer is a rewrite of Stanford Alpaca aiming to be the trainer for all Large Language Models
☆69Aug 27, 2023Updated 2 years ago
Shark-NLP / EVALM
View on GitHub
Official codebase for “In-Context Learning with Many Demonstration Examples”
☆16Feb 13, 2023Updated 3 years ago
joey00072 / microjax
View on GitHub
Jax like function transformation engine but micro, microjax
☆34Oct 25, 2024Updated last year
sdan / selfextend
View on GitHub
an implementation of Self-Extend, to expand the context window via grouped attention
☆119Jan 7, 2024Updated 2 years ago
joey00072 / Tinytorch
View on GitHub
A really tiny autograd engine
☆100May 26, 2025Updated last year
UpstageAI / evalverse-IFEval
View on GitHub
Submodule of evalverse forked from [google-research/instruction_following_eval](https://github.com/google-research/google-research/tree/m…
☆15May 4, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
reactorsh / ambrosia
View on GitHub
clean up your LLM datasets
☆113May 30, 2023Updated 3 years ago
mistralai / megablocks-public
View on GitHub
☆865Dec 8, 2023Updated 2 years ago
rosmineb / unit_test_rl
View on GitHub
Project code for training LLMs to write better unit tests + code
☆22May 19, 2025Updated last year
geronimi73 / phi2-finetune
View on GitHub
☆85Feb 1, 2024Updated 2 years ago
SidU / MathBlackBox
View on GitHub
☆11Jul 21, 2024Updated 2 years ago
argilla-io / distilabel
View on GitHub
Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…
☆3,342Updated this week
teknium1 / GPTeacher
View on GitHub
A collection of modular datasets generated by GPT-4, General-Instruct - Roleplay-Instruct - Code-Instruct - and Toolformer
☆1,668Sep 15, 2023Updated 2 years ago