joey00072 / TinyLoraLinks
Low-Rank Adaptation of Large Language Models clean implementation
β8Updated last year
Alternatives and similar repositories for TinyLora
Users that are interested in TinyLora are comparing it to the libraries listed below
Sorting:
- ππ€ A collection of templates for Hugging Face Spacesβ35Updated last year
- β22Updated last year
- Code for the examples presented in the talk "Training a Llama in your backyard: fine-tuning very large models on consumer hardware" givenβ¦β14Updated last year
- β23Updated last year
- β43Updated 2 years ago
- Command-line script for inferencing from models such as LLaMA, in a chat scenario, with LoRA adaptationsβ33Updated 2 years ago
- Zeus LLM Trainer is a rewrite of Stanford Alpaca aiming to be the trainer for all Large Language Modelsβ69Updated last year
- QAmeleon introduces synthetic multilingual QA data using PaLM, a 540B large language model. This dataset was generated by prompt tuning Pβ¦β34Updated last year
- β38Updated last year
- A library for squeakily cleaning and filtering language datasets.β47Updated last year
- This repository contains code for cleaning your training data of benchmark data to help combat data snooping.β25Updated 2 years ago
- Simple Model Similarities Analysisβ21Updated last year
- Implementation of a holodeck, written in Pytorchβ18Updated last year
- Jax like function transformation engine but micro, microjaxβ32Updated 8 months ago
- β23Updated 6 months ago
- A clone of OpenAI's Tokenizer page for HuggingFace Modelsβ45Updated last year
- Tools for merging pretrained large language models.β19Updated last year
- Training and Inference Notebooks for the RedPajama (OpenLlama) modelsβ18Updated 2 years ago
- π€ Trade any tensors over the networkβ30Updated last year
- Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limitβ63Updated 2 years ago
- Example for Logging LLM Evaluator Prompt Responsesβ16Updated last year
- Describe the format of image/text datasetsβ11Updated 3 years ago
- Repository containing the SPIN experiments on the DIBT 10k ranked promptsβ24Updated last year
- A sample pattern for running CI tests on Modalβ18Updated 2 months ago
- Pixel Parsing. A reproduction of OCR-free end-to-end document understanding models with open dataβ21Updated 10 months ago
- β35Updated last year
- Engineering the state of RNN language models (Mamba, RWKV, etc.)β32Updated last year
- β17Updated last year
- β13Updated 2 years ago
- LLM attention pattern visualizerβ10Updated last year