fairydreaming / farel-benchLinks

Testing LLM reasoning abilities with family relationship quizzes.

☆62

Alternatives and similar repositories for farel-bench

Users that are interested in farel-bench are comparing it to the libraries listed below

Sorting:

thomasgauthier / LoRD
Low-Rank adapter extraction for fine-tuned transformers models
☆178Updated last year
Mihaiii / backtrack_sampler
An easy-to-understand framework for LLM samplers that rewind and revise generated tokens
☆145Updated 7 months ago
QuixiAI / grokadamw
☆136Updated last year
jukofyork / transplant-vocab
Transplants vocabulary between language models, enabling the creation of draft models for speculative decoding WITHOUT retraining.
☆42Updated last month
QuixiAI / laserRMT
This is our own implementation of 'Layer Selective Rank Reduction'
☆239Updated last year
QuixiAI / spectrum
☆136Updated last month
Gryphe / MergeMonster
An unsupervised model merging algorithm for Transformers-based language models.
☆106Updated last year
agokrani / distillKitPlus
Easy to use, High Performant Knowledge Distillation for LLMs
☆93Updated 5 months ago
uukuguy / multi_loras
Load multiple LoRA modules simultaneously and automatically switch the appropriate combination of LoRA modules to generate the best answe…
☆158Updated last year
VITA-Group / Q-GaLore
Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients.
☆201Updated last year
rafacelente / bllama
1.58-bit LLaMa model
☆83Updated last year
tdrussell / qlora-pipe
A pipeline parallel training script for LLMs.
☆157Updated 5 months ago
teknium1 / ShareGPT-Builder
☆115Updated 9 months ago
epolewski / EricLLM
A fast batching API to serve LLM models
☆187Updated last year
tiiuae / onebitllms
Lightweight toolkit package to train and fine-tune 1.58bit Language models
☆90Updated 4 months ago
QuixiAI / kraken
☆67Updated last year
Pints-AI / 1.5-Pints
A compact LLM pretrained in 9 days by using high quality data
☆330Updated 6 months ago
serp-ai / unsloth
5X faster 60% less memory QLoRA finetuning
☆21Updated last year
TheProxyCompany / proxy-structuring-engine
Guaranteed Structured Output from any Language Model via Hierarchical State Machines
☆145Updated this week
astramind-ai / BitMat
An efficent implementation of the method proposed in "The Era of 1-bit LLMs"
☆154Updated 11 months ago
EduardTalianu / EntropixLab
entropix style sampling + GUI
☆27Updated 11 months ago
LostRuins / datasetexplorer
Easily view and modify JSON datasets for large language models
☆83Updated 4 months ago
keeeeenw / MicroLlama
Micro Llama is a small Llama based model with 300M parameters trained from scratch with $500 budget
☆161Updated 2 months ago
sam-paech / antislop-sampler
☆315Updated 2 months ago
matt-c1 / llama-3-quant-comparison
Comparison of the output quality of quantization methods, using Llama 3, transformers, GGUF, EXL2.
☆165Updated last year
Pleias / Quest-Best-Tokens
An introduction to LLM Sampling
☆79Updated 9 months ago
nyunAI / PruneGPT
☆51Updated last year
euclaise / SlimTrainer
Full finetuning of large language models without large memory requirements
☆93Updated 3 weeks ago
nivibilla / build-nanogpt
Video+code lecture on building nanoGPT from scratch
☆68Updated last year
Locutusque / TPU-Alignment
Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free
☆231Updated 11 months ago