austinsilveria / tricksyLinks

Fast approximate inference on a single GPU with sparsity aware offloading

☆38

Alternatives and similar repositories for tricksy

Users that are interested in tricksy are comparing it to the libraries listed below

Sorting:

official-elinas / zeus-llm-trainer
Zeus LLM Trainer is a rewrite of Stanford Alpaca aiming to be the trainer for all Large Language Models
☆69Updated 2 years ago
mzbac / mlx-moe
Scripts to create your own moe models using mlx
☆90Updated last year
s-smits / grpo-optuna
Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna
☆58Updated last week
arcee-ai / DAM
☆55Updated 11 months ago
minosvasilias / simple_grpo
Simple GRPO scripts and configurations.
☆59Updated 8 months ago
sdan / selfextend
an implementation of Self-Extend, to expand the context window via grouped attention
☆118Updated last year
SebastianBodza / EnsembleForecasting
Using multiple LLMs for ensemble Forecasting
☆16Updated last year
LLM360 / crystalcoder-data-prep
Data preparation code for CrystalCoder 7B LLM
☆45Updated last year
matthewrenze / jhu-concise-cot
The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models
☆22Updated 11 months ago
Cerebras / DocChat
GPT-4 Level Conversational QA Trained In a Few Hours
☆65Updated last year
thooton / muse
Let's create synthetic textbooks together :)
☆75Updated last year
QuixiAI / kraken
☆67Updated last year
geronimi73 / phi2-finetune
☆88Updated last year
ChrisHayduk / qlora-multi-gpu
QLoRA with Enhanced Multi GPU Support
☆37Updated 2 years ago
ElleLeonne / Lightning-ReLoRA
A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.
☆35Updated last year
Birch-san / booru-embed
[WIP] Transformer to embed Danbooru labelsets
☆13Updated last year
tval2 / contextual-pruning
Library to facilitate pruning of LLMs based on context
☆32Updated last year
EduardTalianu / EntropixLab
entropix style sampling + GUI
☆27Updated 11 months ago
joey00072 / ohara
Collection of autoregressive model implementation
☆86Updated 6 months ago
nyunAI / PruneGPT
☆51Updated last year
deepshard / mixtral-8x7b-Inference
Eh, simple and works.
☆27Updated last year
reka-ai / rekaquant
☆62Updated 3 months ago
emrgnt-cmplxty / zero-shot-replication
☆73Updated 2 years ago
iulia-b10 / multilingual-embedding-models
☆20Updated last year
AnswerDotAI / ModernBERT-Instruct-mini-cookbook
☆49Updated 8 months ago
vikhyat / mixtral-inference
inference code for mixtral-8x7b-32kseqlen
☆102Updated last year
VikParuchuri / classified
Score LLM pretraining data with classifiers
☆54Updated last year
VatsaDev / NanoPhi-alpha
GPT-2 small trained on phi-like data
☆67Updated last year
serp-ai / Parameter-Efficient-MoE
Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks
☆31Updated last year
QuixiAI / grokadamw
☆136Updated last year