rasbt / low-rank-adaptation-blogLinks

☆29

Alternatives and similar repositories for low-rank-adaptation-blog

Users that are interested in low-rank-adaptation-blog are comparing it to the libraries listed below

Sorting:

tanyuqian / cappy
NeurIPS 2023 - Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorer
☆43Updated last year
explodinggradients / nemesis
Reward Model framework for LLM RLHF
☆61Updated 2 years ago
LLM360 / amber-data-prep
Data preparation code for Amber 7B LLM
☆91Updated last year
argilla-io / notus
Notus is a collection of fine-tuned LLMs using SFT, DPO, SFT+DPO, and/or any other RLHF techniques, while always keeping a data-first app…
☆168Updated last year
daniel-furman / sft-demos
Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.
☆78Updated 9 months ago
Upaya07 / NeurIPS-llm-efficiency-challenge
Code for NeurIPS LLM Efficiency Challenge
☆59Updated last year
CarperAI / InstructGPT
For experiments involving instruct gpt. Currently used for documenting open research questions.
☆71Updated 2 years ago
EleutherAI / stackexchange-dataset
Python tools for processing the stackexchange data dumps into a text dataset for Language Models
☆83Updated last year
hamelsmu / llama-inference
experiments with inference on llama
☆104Updated last year
kyleliang919 / Long-context-transformers
Exploring finetuning public checkpoints on filter 8K sequences on Pile
☆116Updated 2 years ago
para-lost / ReBase
ReBase: Training Task Experts through Retrieval Based Distillation
☆29Updated 6 months ago
google-research-datasets / QAmeleon
QAmeleon introduces synthetic multilingual QA data using PaLM, a 540B large language model. This dataset was generated by prompt tuning P…
☆34Updated 2 years ago
akjindal53244 / Arithmo
Small and Efficient Mathematical Reasoning LLMs
☆71Updated last year
EleutherAI / semantic-memorization
☆44Updated 8 months ago
automix-llm / automix
Mixing Language Models with Self-Verification and Meta-Verification
☆105Updated 8 months ago
LLM360 / Analysis360
Open Implementations of LLM Analyses
☆106Updated 10 months ago
allenai / catwalk
This project studies the performance and robustness of language models and task-adaptation methods.
☆150Updated last year
geronimi73 / phi2-finetune
☆88Updated last year
austrian-code-wizard / c3po
☆29Updated last week
allenai / CommonGen-Eval
Evaluating LLMs with CommonGen-Lite
☆90Updated last year
microsoft / SafeNLP
Safety Score for Pre-Trained Language Models
☆95Updated last year
huggingface / llm-swarm
Manage scalable open LLM inference endpoints in Slurm clusters
☆269Updated last year
KaiNylund / lm-weights-encode-time
☆69Updated last year
SALT-NLP / demonstrated-feedback
☆125Updated 10 months ago
kernelmachine / cbtm
Code repository for the c-BTM paper
☆107Updated last year
Anni-Zou / Meta-CoT
Meta-CoT: Generalizable Chain-of-Thought Prompting in Mixed-task Scenarios with Large Language Models
☆97Updated last year
LLM360 / crystalcoder-train
Pre-training code for CrystalCoder 7B LLM
☆55Updated last year
sayakpaul / count-tokens-hf-datasets
This project shows how to derive the total number of training tokens from a large text dataset from 🤗 datasets with Apache Beam and Data…
☆27Updated 2 years ago
huggingface / olm-training
Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.
☆93Updated 2 years ago
sunyt32 / torchscale
Transformers at any scale
☆41Updated last year