explodinggradients / FuntunerLinks

Supervised instruction finetuning for LLM with HF trainer and Deepspeed

☆36

Alternatives and similar repositories for Funtuner

Users that are interested in Funtuner are comparing it to the libraries listed below

Sorting:

explodinggradients / nemesis
Reward Model framework for LLM RLHF
☆61Updated 2 years ago
weaviate-tutorials / Hurricane
Writing Blog Posts with Generative Feedback Loops!
☆50Updated last year
krypticmouse / matryoshka-representation-learning
PyTorch implementation for MRL
☆19Updated last year
pacman100 / peft-codegen-25
☆23Updated 2 years ago
IlyasMoutawwakil / py-txi
A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.
☆33Updated last month
official-elinas / zeus-llm-trainer
Zeus LLM Trainer is a rewrite of Stanford Alpaca aiming to be the trainer for all Large Language Models
☆69Updated 2 years ago
AnswerDotAI / ModernBERT-Instruct-mini-cookbook
☆51Updated 9 months ago
geronimi73 / phi2-finetune
☆86Updated last year
davanstrien / data-for-fine-tuning-llms
☆80Updated last year
v-prgmr / mergekit
Tools for merging pretrained large language models.
☆19Updated last year
hamelsmu / llama-inference
experiments with inference on llama
☆103Updated last year
Arize-ai / LLMTest_NeedleInAHaystack
Doing simple retrieval from LLM models at various context lengths to measure accuracy
☆106Updated last month
lancedb / ragged
☆21Updated last year
teknium1 / transformers-gptq-quant
☆45Updated 2 years ago
deshwalmahesh / PHUDGE
Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…
☆50Updated last year
Muhtasham / summarization-eval
📝 Reference-Free automatic summarization evaluation with potential hallucination detection
☆102Updated last year
unicamp-dl / InRanker
☆48Updated last year
hamelsmu / ft-drift
Check for data drift between two OpenAI multi-turn chat jsonl files.
☆38Updated last year
ChrisHayduk / qlora-multi-gpu
QLoRA with Enhanced Multi GPU Support
☆37Updated 2 years ago
PrithivirajDamodaran / SPLADERunner
Lite weight wrapper for the independent implementation of SPLADE++ models for search & retrieval pipelines. Models and Library created by…
☆33Updated last year
davanstrien / haiku-dpo
Using open source LLMs to build synthetic datasets for direct preference optimization
☆69Updated last year
akjindal53244 / Arithmo
Small and Efficient Mathematical Reasoning LLMs
☆72Updated last year
lightonai / pylate-rs
PyLate efficient inference engine
☆66Updated 2 months ago
muellerzr / nbdistributed
Seemless interface of using PyTOrch distributed with Jupyter notebooks
☆53Updated 2 months ago
dm4ml / motion
Framework for building and maintaining self-updating prompts for LLMs
☆64Updated last year
titanml / takeoff-community
TitanML Takeoff Server is an optimization, compression and deployment platform that makes state of the art machine learning models access…
☆114Updated last year
MantisAI / hugie
Command Line Interface for Hugging Face Inference Endpoints
☆66Updated last year
Upaya07 / NeurIPS-llm-efficiency-challenge
Code for NeurIPS LLM Efficiency Challenge
☆59Updated last year
Alignment-Lab-AI / datagen
a pipeline for using api calls to agnostically convert unstructured data into structured training data
☆31Updated last year
NielsRogge / awesome-huggingface
Repository containing awesome resources regarding Hugging Face tooling.
☆48Updated last year