kevinwu23 / StanfordFineTuneBenchLinks

☆31

Alternatives and similar repositories for StanfordFineTuneBench

Users that are interested in StanfordFineTuneBench are comparing it to the libraries listed below

Sorting:

AnswerDotAI / ModernBERT-Instruct-mini-cookbook
☆51Updated 9 months ago
Upaya07 / NeurIPS-llm-efficiency-challenge
Code for NeurIPS LLM Efficiency Challenge
☆59Updated last year
Knowledgator / FlashDeBERTa
Trully flash implementation of DeBERTa disentangled attention mechanism.
☆66Updated last month
ContextualAI / CLAIR_and_APO
Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment
☆60Updated last year
minosvasilias / simple_grpo
Simple GRPO scripts and configurations.
☆59Updated 9 months ago
AnswerDotAI / fastkmeans
☆86Updated 4 months ago
arcee-ai / DAM
☆55Updated last year
rwightman / genalog
Genalog is an open source, cross-platform python package allowing generation of synthetic document images with custom degradations and te…
☆43Updated last year
automix-llm / automix
Mixing Language Models with Self-Verification and Meta-Verification
☆109Updated 11 months ago
SeunghyunSEO / optimized_hf_llama_class_for_training
☆48Updated last year
Pleias / Quest-Best-Tokens
An introduction to LLM Sampling
☆79Updated 11 months ago
facebookresearch / matrix
Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…
☆100Updated last week
allenai / infinigram-api
☆81Updated last week
Aleph-Alpha-Research / trigrams
☆57Updated last month
s-smits / grpo-optuna
Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna
☆58Updated last month
IlyasMoutawwakil / py-txi
A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.
☆33Updated 2 months ago
deshwalmahesh / PHUDGE
Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…
☆50Updated last year
YuchenJin / llm.c
LLM training in simple, raw C/CUDA
☆15Updated 11 months ago
pacman100 / peft-codegen-25
☆23Updated 2 years ago
QuixiAI / spectrum
☆138Updated 2 months ago
luyug / magix
Supercharge huggingface transformers with model parallelism.
☆77Updated 3 months ago
jxmorris12 / bm25_pt
minimal pytorch implementation of bm25 (with sparse tensors)
☆104Updated 3 weeks ago
krypticmouse / matryoshka-representation-learning
PyTorch implementation for MRL
☆19Updated last year
akjindal53244 / Arithmo
Small and Efficient Mathematical Reasoning LLMs
☆72Updated last year
chainyo / tensorshare
🤝 Trade any tensors over the network
☆30Updated 2 years ago
google-research-datasets / QAmeleon
QAmeleon introduces synthetic multilingual QA data using PaLM, a 540B large language model. This dataset was generated by prompt tuning P…
☆35Updated 2 years ago
stephantul / skeletoken
Datamodels for hugging face tokenizers
☆86Updated 2 weeks ago
huggingface / llm-swarm
Manage scalable open LLM inference endpoints in Slurm clusters
☆276Updated last year
AnswerDotAI / fastdata
☆159Updated 11 months ago
muellerzr / nbdistributed
Seemless interface of using PyTOrch distributed with Jupyter notebooks
☆53Updated 2 months ago