ChrisHayduk / QLoRA-for-MLMView external linksLinks
QLoRA for Masked Language Modeling
☆22Sep 11, 2023Updated 2 years ago
Alternatives and similar repositories for QLoRA-for-MLM
Users that are interested in QLoRA-for-MLM are comparing it to the libraries listed below
Sorting:
- Generating Protein Variants with Different Generative Models (HMM, VAE, ESM-2, ProtGPT2)☆11Mar 14, 2024Updated last year
- Plug-and-play Search Interfaces with Pyserini and Hugging Face☆32Aug 5, 2023Updated 2 years ago
- Text generation using language models with multiple exit heads☆16Sep 18, 2025Updated 4 months ago
- Karpathy's llama2.c transpiled to MLX for Apple Silicon☆14Dec 28, 2023Updated 2 years ago
- Arabic edition of ALBERT pretrained language models☆16Apr 25, 2021Updated 4 years ago
- ☆22Aug 27, 2023Updated 2 years ago
- ☆25May 7, 2025Updated 9 months ago
- Generate textbook-quality synthetic LLM pretraining data☆509Oct 19, 2023Updated 2 years ago
- Few-shot Learning with Auxiliary Data☆31Dec 8, 2023Updated 2 years ago
- This repository contain the simple llama3 implementation in pure jax.☆71Feb 17, 2025Updated 11 months ago
- LTG-Bert☆34Jan 8, 2024Updated 2 years ago
- Official repository for "Scaling Retrieval-Based Langauge Models with a Trillion-Token Datastore".☆224Dec 16, 2025Updated last month
- GPT2 fine-tuning pipeline with KerasNLP, TensorFlow, and TensorFlow Extended☆33Sep 6, 2023Updated 2 years ago
- some common Huggingface transformers in maximal update parametrization (µP)☆87Mar 14, 2022Updated 3 years ago
- A guidance compatibility layer for llama-cpp-python☆36Sep 11, 2023Updated 2 years ago
- A Human-LLM Collaborative Dataset for Generative Information-seeking with Attribution☆35Aug 2, 2023Updated 2 years ago
- R Shiny application "Vaccine Designer" aiming for the construction of vaccine sequences based on multi epitope design workflow.☆10Jun 25, 2024Updated last year
- Arabic News Stance Corpus☆11Feb 5, 2021Updated 5 years ago
- ☆18Dec 5, 2024Updated last year
- Convert all of libgen to high quality markdown☆254Dec 13, 2023Updated 2 years ago
- ☆42Jun 19, 2024Updated last year
- Fast approximate inference on a single GPU with sparsity aware offloading☆39Jan 4, 2024Updated 2 years ago
- Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.☆96Feb 9, 2023Updated 3 years ago
- The official implementation of the paper "Text Classification in the Wild: a Large-scale Long-tailed Name Normalization Dataset"(ICASSP 2…☆12Feb 19, 2023Updated 2 years ago
- Shared MuJoCo simulation scenes and assets for ROBEL environments.☆14Jul 31, 2020Updated 5 years ago
- Simple repository for training small reasoning models☆49Feb 6, 2025Updated last year
- Code for improving the performance of sequence-to-expression models for making individual-specific gene expression predictions by fine-tu…☆15Dec 12, 2025Updated 2 months ago
- Predicting train delay using previous weather and delay information of trains☆11Oct 25, 2017Updated 8 years ago
- LLM Building Blocks for Python Course☆15Nov 17, 2025Updated 2 months ago
- This is for the AI enzyme design course☆13Nov 10, 2025Updated 3 months ago
- Token-free Language Modeling with ByGPT5 & Friends!☆12Jul 18, 2025Updated 6 months ago
- Persistent memory system for agentic AI via MCP - remember, recall, forget with semantic search with knowledge graph☆24Updated this week
- Code Roberta version of RetroMAE: Pre-Training Retrieval-oriented Language Models Via Masked Auto-Encoder☆10Mar 16, 2023Updated 2 years ago
- Cross-lingual Fact-to-Text Alignment and Generation for Low-Resource Languages☆11Jan 1, 2023Updated 3 years ago
- Conversion of audio files to text using whisper from OpenAI with a simple tkinter GUI☆10Apr 13, 2023Updated 2 years ago
- An open source project on estimating train delays in India.☆11Oct 29, 2018Updated 7 years ago
- ☆18Jun 25, 2025Updated 7 months ago
- PSI-MOD ontology for modified and unmodified amino acid residues☆14Jan 8, 2026Updated last month
- Self-Supervised Document-to-Document Similarity Ranking via Contextualized Language Models and Hierarchical Inference☆45Nov 28, 2022Updated 3 years ago