QLoRA for Masked Language Modeling
☆23Sep 11, 2023Updated 2 years ago
Alternatives and similar repositories for QLoRA-for-MLM
Users that are interested in QLoRA-for-MLM are comparing it to the libraries listed below
Sorting:
- Generating Protein Variants with Different Generative Models (HMM, VAE, ESM-2, ProtGPT2)☆11Mar 14, 2024Updated last year
- Text generation using language models with multiple exit heads☆16Sep 18, 2025Updated 5 months ago
- Plug-and-play Search Interfaces with Pyserini and Hugging Face☆32Aug 5, 2023Updated 2 years ago
- Karpathy's llama2.c transpiled to MLX for Apple Silicon☆14Dec 28, 2023Updated 2 years ago
- Arabic edition of ALBERT pretrained language models☆16Apr 25, 2021Updated 4 years ago
- ☆16Feb 14, 2025Updated last year
- Mixtral finetuning☆19Feb 2, 2024Updated 2 years ago
- Tokun to can tokens☆18Jun 19, 2025Updated 8 months ago
- Using multiple LLMs for ensemble Forecasting☆16Jan 17, 2024Updated 2 years ago
- ☆21Oct 22, 2024Updated last year
- ☆22Aug 27, 2023Updated 2 years ago
- ☆25May 7, 2025Updated 10 months ago
- ☆27Dec 13, 2024Updated last year
- Generate textbook-quality synthetic LLM pretraining data☆509Oct 19, 2023Updated 2 years ago
- [ICLR 2026] RPG: KL-Regularized Policy Gradient (https://arxiv.org/abs/2505.17508)☆64Feb 19, 2026Updated 2 weeks ago
- Official repository for "Scaling Retrieval-Based Langauge Models with a Trillion-Token Datastore".☆224Dec 16, 2025Updated 2 months ago
- GPT2 fine-tuning pipeline with KerasNLP, TensorFlow, and TensorFlow Extended☆33Sep 6, 2023Updated 2 years ago
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆150Jan 7, 2026Updated last month
- some common Huggingface transformers in maximal update parametrization (µP)☆87Mar 14, 2022Updated 3 years ago
- ☆18Dec 5, 2024Updated last year
- R Shiny application "Vaccine Designer" aiming for the construction of vaccine sequences based on multi epitope design workflow.☆10Jun 25, 2024Updated last year
- Swete's LXX Text from 1KY Greek with Corrections Against Manuscripts☆10Oct 11, 2020Updated 5 years ago
- Arabic News Stance Corpus☆11Feb 5, 2021Updated 5 years ago
- Convert all of libgen to high quality markdown☆255Dec 13, 2023Updated 2 years ago
- Fast approximate inference on a single GPU with sparsity aware offloading☆39Jan 4, 2024Updated 2 years ago
- Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.☆96Feb 9, 2023Updated 3 years ago
- An open source project on estimating train delays in India.☆11Oct 29, 2018Updated 7 years ago
- State-of-the-art neural tagger and lemmatizer for ancient languages☆13Mar 9, 2025Updated 11 months ago
- Self-Supervised Document-to-Document Similarity Ranking via Contextualized Language Models and Hierarchical Inference☆45Nov 28, 2022Updated 3 years ago
- Simple repository for training small reasoning models☆49Feb 17, 2026Updated 2 weeks ago
- Shared MuJoCo simulation scenes and assets for ROBEL environments.☆14Jul 31, 2020Updated 5 years ago
- Conversion of audio files to text using whisper from OpenAI with a simple tkinter GUI☆10Apr 13, 2023Updated 2 years ago
- Java command line tool to convert PAGE XML files with layout and text content to PDF☆10Apr 27, 2020Updated 5 years ago
- Supporting code for Deep Neural Network or Dermatologist?☆10Nov 14, 2019Updated 6 years ago
- The official implementation of the paper "Text Classification in the Wild: a Large-scale Long-tailed Name Normalization Dataset"(ICASSP 2…☆12Feb 19, 2023Updated 3 years ago
- Code for improving the performance of sequence-to-expression models for making individual-specific gene expression predictions by fine-tu…☆15Dec 12, 2025Updated 2 months ago
- ☆18Jun 25, 2025Updated 8 months ago
- LLM Building Blocks for Python Course☆16Nov 17, 2025Updated 3 months ago
- Token-free Language Modeling with ByGPT5 & Friends!☆12Jul 18, 2025Updated 7 months ago