tcapelle / mixtralLinks

Mixtral finetuning

☆19

Alternatives and similar repositories for mixtral

Users that are interested in mixtral are comparing it to the libraries listed below

Sorting:

ChrisHayduk / QLoRA-for-MLM
QLoRA for Masked Language Modeling
☆22Updated last year
Upaya07 / NeurIPS-llm-efficiency-challenge
Code for NeurIPS LLM Efficiency Challenge
☆59Updated last year
rwightman / genalog
Genalog is an open source, cross-platform python package allowing generation of synthetic document images with custom degradations and te…
☆42Updated last year
AnswerDotAI / ModernBERT-Instruct-mini-cookbook
☆48Updated 5 months ago
pacman100 / peft-codegen-25
☆23Updated 2 years ago
geronimi73 / phi2-finetune
☆87Updated last year
ChrisHayduk / qlora-multi-gpu
QLoRA with Enhanced Multi GPU Support
☆37Updated last year
deployradiant / pychatml
Chat Markup Language conversation library
☆55Updated last year
CarperAI / treasure_trove
☆22Updated last year
huggingface / wikirace-llms
☆23Updated 2 months ago
argilla-io / distilabel-spin-dibt
Repository containing the SPIN experiments on the DIBT 10k ranked prompts
☆24Updated last year
minosvasilias / simple_grpo
Simple GRPO scripts and configurations.
☆59Updated 5 months ago
official-elinas / zeus-llm-trainer
Zeus LLM Trainer is a rewrite of Stanford Alpaca aiming to be the trainer for all Large Language Models
☆69Updated last year
arcee-ai / DAM
☆52Updated 8 months ago
CarperAI / squeakily
A library for squeakily cleaning and filtering language datasets.
☆47Updated 2 years ago
huu4ontocord / MDEL
Multi-Domain Expert Learning
☆67Updated last year
Knowledgator / FlashDeBERTa
Trully flash implementation of DeBERTa disentangled attention mechanism.
☆62Updated 2 months ago
teknium1 / transformers-gptq-quant
☆47Updated last year
krypticmouse / matryoshka-representation-learning
PyTorch implementation for MRL
☆19Updated last year
jxmorris12 / bm25_pt
minimal pytorch implementation of bm25 (with sparse tensors)
☆102Updated last year
trapoom555 / Language-Model-STS-CFT
Improving Text Embedding of Language Models Using Contrastive Fine-tuning
☆64Updated 11 months ago
enjalot / latent-sae
Training code for Sparse Autoencoders on Embedding models
☆38Updated 4 months ago
Zyphra / Zyda_processing
☆36Updated last year
KaiNylund / lm-weights-encode-time
☆68Updated 11 months ago
luyug / magix
Supercharge huggingface transformers with model parallelism.
☆77Updated 9 months ago
kevinwu23 / StanfordFineTuneBench
☆30Updated 8 months ago
AnswerDotAI / fastkmeans
☆62Updated last week
Aleph-Alpha-Research / trigrams
☆56Updated 2 months ago
YuchenJin / llm.c
LLM training in simple, raw C/CUDA
☆15Updated 7 months ago
TIGER-AI-Lab / StructLM
Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)
☆75Updated 8 months ago