Alignment-Lab-AI / datagenLinks

a pipeline for using api calls to agnostically convert unstructured data into structured training data

☆32

Alternatives and similar repositories for datagen

Users that are interested in datagen are comparing it to the libraries listed below

Sorting:

teknium1 / transformers-gptq-quant
☆45Updated 2 years ago
CarperAI / treasure_trove
☆22Updated 2 years ago
pacman100 / peft-codegen-25
☆23Updated 2 years ago
s-smits / grpo-optuna
Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna
☆58Updated last month
AblateIt / finetune-study
Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.
☆83Updated 2 years ago
official-elinas / zeus-llm-trainer
Zeus LLM Trainer is a rewrite of Stanford Alpaca aiming to be the trainer for all Large Language Models
☆70Updated 2 years ago
teknium1 / LLM-Benchmark-Logs
Just a bunch of benchmark logs for different LLMs
☆119Updated last year
deployradiant / pychatml
Chat Markup Language conversation library
☆55Updated last year
automix-llm / automix
Mixing Language Models with Self-Verification and Meta-Verification
☆110Updated 11 months ago
BBischof / yapping
Verbosity control for AI agents
☆64Updated last year
geronimi73 / phi2-finetune
☆86Updated last year
AnswerDotAI / ModernBERT-Instruct-mini-cookbook
☆51Updated 9 months ago
NousResearch / StripedHyenaTrainer
☆62Updated last year
arcee-ai / DAM
☆55Updated last year
CarperAI / squeakily
A library for squeakily cleaning and filtering language datasets.
☆49Updated 2 years ago
minosvasilias / simple_grpo
Simple GRPO scripts and configurations.
☆59Updated 9 months ago
ChrisHayduk / qlora-multi-gpu
QLoRA with Enhanced Multi GPU Support
☆37Updated 2 years ago
krypticmouse / matryoshka-representation-learning
PyTorch implementation for MRL
☆20Updated last year
modal-labs / ci-on-modal
A sample pattern for running CI tests on Modal
☆18Updated 7 months ago
Arize-ai / LLMTest_NeedleInAHaystack
Doing simple retrieval from LLM models at various context lengths to measure accuracy
☆106Updated 2 months ago
ChrisHayduk / QLoRA-for-MLM
QLoRA for Masked Language Modeling
☆22Updated 2 years ago
stunningpixels / lou-eval
Track the progress of LLM context utilisation
☆55Updated 7 months ago
teknium1 / LLM-Logbook
Public reports detailing responses to sets of prompts by Large Language Models.
☆32Updated 10 months ago
huu4ontocord / MDEL
Multi-Domain Expert Learning
☆67Updated last year
JoshuaPurtell / SmallBench
Small, simple agent task environments for training and evaluation
☆19Updated last year
explodinggradients / Funtuner
Supervised instruction finetuning for LLM with HF trainer and Deepspeed
☆36Updated 2 years ago
luyug / magix
Supercharge huggingface transformers with model parallelism.
☆77Updated 4 months ago
argilla-io / distilabel-spin-dibt
Repository containing the SPIN experiments on the DIBT 10k ranked prompts
☆24Updated last year
abacaj / train-with-fsdp
☆94Updated 2 years ago
interstellarninja / function-calling-eval
A framework for evaluating function calls made by LLMs
☆40Updated last year