ChrisHayduk / QLoRA-for-MLMLinks

QLoRA for Masked Language Modeling

☆22

Alternatives and similar repositories for QLoRA-for-MLM

Users that are interested in QLoRA-for-MLM are comparing it to the libraries listed below

Sorting:

ChrisHayduk / qlora-multi-gpu
QLoRA with Enhanced Multi GPU Support
☆37Updated 2 years ago
google-research-datasets / QAmeleon
QAmeleon introduces synthetic multilingual QA data using PaLM, a 540B large language model. This dataset was generated by prompt tuning P…
☆35Updated 2 years ago
rwightman / genalog
Genalog is an open source, cross-platform python package allowing generation of synthetic document images with custom degradations and te…
☆44Updated last year
AnswerDotAI / ModernBERT-Instruct-mini-cookbook
☆52Updated 9 months ago
Knowledgator / FlashDeBERTa
Trully flash implementation of DeBERTa disentangled attention mechanism.
☆67Updated 2 months ago
minosvasilias / simple_grpo
Simple GRPO scripts and configurations.
☆59Updated 9 months ago
CarperAI / squeakily
A library for squeakily cleaning and filtering language datasets.
☆49Updated 2 years ago
Upaya07 / NeurIPS-llm-efficiency-challenge
Code for NeurIPS LLM Efficiency Challenge
☆59Updated last year
CarperAI / treasure_trove
☆22Updated 2 years ago
argilla-io / distilabel-spin-dibt
Repository containing the SPIN experiments on the DIBT 10k ranked prompts
☆24Updated last year
IlyasMoutawwakil / py-txi
A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.
☆32Updated 2 months ago
teknium1 / transformers-gptq-quant
☆45Updated 2 years ago
allenai / EmbeddingRecycling
Embedding Recycling for Language models
☆38Updated 2 years ago
automix-llm / automix
Mixing Language Models with Self-Verification and Meta-Verification
☆110Updated 11 months ago
KaiNylund / lm-weights-encode-time
☆69Updated last year
krypticmouse / matryoshka-representation-learning
PyTorch implementation for MRL
☆20Updated last year
arcee-ai / DAM
☆55Updated last year
official-elinas / zeus-llm-trainer
Zeus LLM Trainer is a rewrite of Stanford Alpaca aiming to be the trainer for all Large Language Models
☆70Updated 2 years ago
huu4ontocord / MDEL
Multi-Domain Expert Learning
☆67Updated last year
pacman100 / peft-codegen-25
☆23Updated 2 years ago
NathanGodey / headless-lm
Training and evaluation code for the paper "Headless Language Models: Learning without Predicting with Contrastive Weight Tying" (https:/…
☆28Updated last year
Zyphra / Zyda_processing
☆39Updated last year
kaiokendev / cutoff-len-is-context-len
Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit
☆63Updated 2 years ago
Hannibal046 / nanoColBERT
Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).
☆79Updated last year
mobarski / alpaca-libre
Reimplementation of the task generation part from the Alpaca paper
☆119Updated 2 years ago
ContextualAI / CLAIR_and_APO
Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment
☆60Updated last year
Alignment-Lab-AI / datagen
a pipeline for using api calls to agnostically convert unstructured data into structured training data
☆32Updated last year
TIGER-AI-Lab / StructLM
Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)
☆75Updated last year
unicamp-dl / InRanker
☆48Updated last year
luyug / magix
Supercharge huggingface transformers with model parallelism.
☆77Updated 4 months ago