unslothai / unsloth-zooLinks

Utils for Unsloth https://github.com/unslothai/unsloth

☆173

Alternatives and similar repositories for unsloth-zoo

Users that are interested in unsloth-zoo are comparing it to the libraries listed below

Sorting:

QuixiAI / spectrum
☆138Updated 3 months ago
huggingface / gpt-oss-recipes
Collection of scripts and notebooks for OpenAI's latest GPT OSS models
☆479Updated 3 months ago
ServiceNow / Fast-LLM
Accelerating your LLM training to full speed! Made with ❤️ by ServiceNow Research
☆262Updated this week
huggingface / kernel-builder
👷 Build compute kernels
☆186Updated this week
anhvth / opensloth
☆229Updated 2 months ago
microsoft / ArchScale
Simple & Scalable Pretraining for Neural Architecture Research
☆302Updated last month
huggingface / optimum-tpu
Google TPU optimizations for transformers models
☆122Updated 10 months ago
tiiuae / onebitllms
Lightweight toolkit package to train and fine-tune 1.58bit Language models
☆99Updated 6 months ago
wolfecameron / nanoMoE
An extension of the nanoGPT repository for training small MOE models.
☆215Updated 8 months ago
Pints-AI / 1.5-Pints
A compact LLM pretrained in 9 days by using high quality data
☆334Updated 7 months ago
facebookresearch / memory
Memory layers use a trainable key-value lookup mechanism to add extra parameters to a model without increasing FLOPs. Conceptually, spars…
☆358Updated 11 months ago
arcee-ai / EvolKit
EvolKit is an innovative framework designed to automatically enhance the complexity of instructions used for fine-tuning Large Language M…
☆242Updated last year
VITA-Group / Q-GaLore
Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients.
☆202Updated last year
ibm-granite / granite-3.0-language-models
☆268Updated 5 months ago
agokrani / distillKitPlus
Easy to use, High Performant Knowledge Distillation for LLMs
☆96Updated 6 months ago
huggingface / fineweb-2
☆209Updated last month
allenai / OLMo-core
PyTorch building blocks for the OLMo ecosystem
☆400Updated this week
microsoft / LongRoPE
LongRoPE is a novel method that can extends the context window of pre-trained LLMs to an impressive 2048k tokens.
☆272Updated last month
OpenPipe / deductive-reasoning
Train your own SOTA deductive reasoning model
☆107Updated 8 months ago
neuralmagic / nm-vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
☆267Updated last year
golololologol / LLM-Distillery
A pipeline for LLM knowledge distillation
☆110Updated 7 months ago
facebookresearch / LayerSkip
Code for "LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding", ACL 2024
☆347Updated 6 months ago
huggingface / inference-benchmarker
Inference server benchmarking tool
☆128Updated last month
huggingface / llm-swarm
Manage scalable open LLM inference endpoints in Slurm clusters
☆277Updated last year
NVlabs / hymba
☆200Updated 11 months ago
facebookresearch / matrix
Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…
☆106Updated this week
jxmorris12 / cde
code for training & evaluating Contextual Document Embedding models
☆200Updated 6 months ago
sgl-project / sgl-project.github.io
This is the documentation repository for SGLang. It is auto-generated from https://github.com/sgl-project/sglang/tree/main/docs.
☆92Updated this week
opendatahub-io / vllm-tgis-adapter
vLLM adapter for a TGIS-compatible gRPC server.
☆45Updated this week
writer / writing-in-the-margins
☆120Updated last year