cindysridykhan / instruct_storyteller_tinyllama2Links

Training and Fine-tuning an llm in Python and PyTorch.

☆42

Alternatives and similar repositories for instruct_storyteller_tinyllama2

Users that are interested in instruct_storyteller_tinyllama2 are comparing it to the libraries listed below

Sorting:

keeeeenw / MicroLlama
Micro Llama is a small Llama based model with 300M parameters trained from scratch with $500 budget
☆153Updated last year
geronimi73 / phi2-finetune
☆87Updated last year
LLM360 / amber-data-prep
Data preparation code for Amber 7B LLM
☆91Updated last year
IST-DASLab / qmoe
Code for the paper "QMoE: Practical Sub-1-Bit Compression of Trillion-Parameter Models".
☆277Updated last year
LLM360 / amber-train
Pre-training code for Amber 7B LLM
☆166Updated last year
aju22 / LLaMA2
This repository contains an implementation of the LLaMA 2 (Large Language Model Meta AI) model, a Generative Pretrained Transformer (GPT)…
☆68Updated last year
hamelsmu / llama-inference
experiments with inference on llama
☆104Updated last year
swj0419 / detect-pretrain-code-contamination
☆76Updated last year
minosvasilias / simple_grpo
Simple GRPO scripts and configurations.
☆59Updated 5 months ago
golololologol / LLM-Distillery
A pipeline for LLM knowledge distillation
☆105Updated 3 months ago
dust-tt / llama-ssp
Experiments on speculative sampling with Llama models
☆128Updated 2 years ago
akjindal53244 / Arithmo
Small and Efficient Mathematical Reasoning LLMs
☆71Updated last year
thepowerfuldeez / OLMo
My fork os allen AI's OLMo for educational purposes.
☆30Updated 7 months ago
rasbt / pytorch-memory-optim
This code repository contains the code used for my "Optimizing Memory Usage for Training LLMs and Vision Transformers in PyTorch" blog po…
☆92Updated 2 years ago
terryyz / llm-benchmark
A list of LLM benchmark frameworks.
☆68Updated last year
SeunghyunSEO / optimized_hf_llama_class_for_training
☆48Updated 10 months ago
VITA-Group / Q-GaLore
Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients.
☆198Updated last year
promptslab / LLMtuner
FineTune LLMs in few lines of code (Text2Text, Text2Speech, Speech2Text)
☆240Updated last year
TheBlokeAI / AIScripts
Some simple scripts that I use day-to-day when working with LLMs and Huggingface Hub
☆162Updated last year
official-elinas / zeus-llm-trainer
Zeus LLM Trainer is a rewrite of Stanford Alpaca aiming to be the trainer for all Large Language Models
☆69Updated last year
daniel-furman / sft-demos
Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.
☆77Updated 8 months ago
jondurbin / bagel
A bagel, with everything.
☆322Updated last year
hydrallm / llama-moe-v1
☆95Updated last year
imagination-research / sot
[ICLR 2024] Skeleton-of-Thought: Prompting LLMs for Efficient Parallel Generation
☆171Updated last year
argilla-io / notus
Notus is a collection of fine-tuned LLMs using SFT, DPO, SFT+DPO, and/or any other RLHF techniques, while always keeping a data-first app…
☆168Updated last year
qwopqwop200 / gptqlora
GPTQLoRA: Efficient Finetuning of Quantized LLMs with GPTQ
☆102Updated 2 years ago
abacaj / train-with-fsdp
☆92Updated last year
EmbeddedLLM / vllm
vLLM: A high-throughput and memory-efficient inference and serving engine for LLMs
☆87Updated this week
Pints-AI / 1.5-Pints
A compact LLM pretrained in 9 days by using high quality data
☆318Updated 3 months ago
Digitous / LLM-SLERP-Merge
Spherical Merge Pytorch/HF format Language Models with minimal feature loss.
☆132Updated last year