emrgnt-cmplxty / SmolTrainerLinks

☆20

Alternatives and similar repositories for SmolTrainer

Users that are interested in SmolTrainer are comparing it to the libraries listed below

Sorting:

emrgnt-cmplxty / zero-shot-replication
☆74Updated last year
teknium1 / LLM-Benchmark-Logs
Just a bunch of benchmark logs for different LLMs
☆119Updated last year
AblateIt / finetune-study
Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.
☆82Updated last year
akjindal53244 / Arithmo
Small and Efficient Mathematical Reasoning LLMs
☆71Updated last year
arcee-ai / DAM
☆53Updated 8 months ago
sdan / selfextend
an implementation of Self-Extend, to expand the context window via grouped attention
☆119Updated last year
thooton / muse
Let's create synthetic textbooks together :)
☆75Updated last year
EduardTalianu / EntropixLab
entropix style sampling + GUI
☆26Updated 9 months ago
thomasgauthier / LoRD
Low-Rank adapter extraction for fine-tuned transformers models
☆175Updated last year
serp-ai / Parameter-Efficient-MoE
Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks
☆31Updated last year
discus-labs / discus
A data-centric AI package for ML/AI. Get the best high-quality data for the best results. Discord: https://discord.gg/t6ADqBKrdZ
☆63Updated last year
TheBlokeAI / AIScripts
Some simple scripts that I use day-to-day when working with LLMs and Huggingface Hub
☆162Updated last year
teknium1 / transformers-gptq-quant
☆47Updated last year
automix-llm / automix
Mixing Language Models with Self-Verification and Meta-Verification
☆105Updated 7 months ago
deployradiant / pychatml
Chat Markup Language conversation library
☆55Updated last year
louisbrulenaudet / ragoon
High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡
☆66Updated 9 months ago
VatsaDev / NanoPhi-alpha
GPT-2 small trained on phi-like data
☆67Updated last year
Xalp / ECHO
Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)
☆91Updated 6 months ago
ChrisHayduk / qlora-multi-gpu
QLoRA with Enhanced Multi GPU Support
☆37Updated last year
official-elinas / zeus-llm-trainer
Zeus LLM Trainer is a rewrite of Stanford Alpaca aiming to be the trainer for all Large Language Models
☆69Updated last year
rawsh / mirrorllm
☆17Updated 6 months ago
geronimi73 / phi2-finetune
☆87Updated last year
kerekovskik / autologic
autologic is a Python package that implements the SELF-DISCOVER framework proposed in the paper SELF-DISCOVER: Large Language Models Self…
☆60Updated last year
nyunAI / PruneGPT
☆51Updated last year
huu4ontocord / MDEL
Multi-Domain Expert Learning
☆67Updated last year
QuixiAI / kraken
☆66Updated last year
Cerebras / DocChat
GPT-4 Level Conversational QA Trained In a Few Hours
☆63Updated 11 months ago
jondurbin / qlora
QLoRA: Efficient Finetuning of Quantized LLMs
☆78Updated last year
teknium1 / ShareGPT-Builder
☆115Updated 7 months ago
euclaise / SlimTrainer
Full finetuning of large language models without large memory requirements
☆94Updated last year