git-cloner / llama-lora-fine-tuning
llama fine-tuning with lora
β140Updated 8 months ago
Alternatives and similar repositories for llama-lora-fine-tuning:
Users that are interested in llama-lora-fine-tuning are comparing it to the libraries listed below
- llama2 finetuning with deepspeed and loraβ171Updated last year
- π An unofficial implementation of Self-Alignment with Instruction Backtranslation.β136Updated 6 months ago
- Code for "Lion: Adversarial Distillation of Proprietary Large Language Models (EMNLP 2023)"β204Updated 11 months ago
- Generative Judge for Evaluating Alignmentβ223Updated last year
- A large-scale, fine-grained, diverse preference dataset (and models).β325Updated last year
- YuLan-IR: Information Retrieval Boosted LMsβ218Updated 10 months ago
- Deita: Data-Efficient Instruction Tuning for Alignment [ICLR2024]β526Updated last month
- β93Updated last year
- Finetuning LLaMA with RLHF (Reinforcement Learning with Human Feedback) based on DeepSpeed Chatβ109Updated last year
- FireAct: Toward Language Agent Fine-tuningβ261Updated last year
- MEASURING MASSIVE MULTITASK CHINESE UNDERSTANDINGβ88Updated 9 months ago
- A full pipeline to finetune Vicuna LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Humanβ¦β210Updated 8 months ago
- InsTag: A Tool for Data Analysis in LLM Supervised Fine-tuningβ236Updated last year
- β137Updated 6 months ago
- [ACL 2024] An Easy-to-use Instruction Processing Framework for LLMs.β388Updated 3 weeks ago
- β128Updated 9 months ago
- A Multi-Turn Dialogue Corpus based on Alpaca Instructionsβ165Updated last year
- β159Updated last year
- [NAACL'24] Self-data filtering of LLM instruction-tuning data using a novel perplexity-based difficulty score, without using any other moβ¦β331Updated 4 months ago
- Large Language Models Are Reasoning Teachers (ACL 2023)β312Updated last year
- β268Updated last year
- The repository for the survey paper <<Survey on Large Language Models Factuality: Knowledge, Retrieval and Domain-Specificity>>β329Updated 8 months ago
- [ACL 2024 Demo] Official GitHub repo for UltraEval: An open source framework for evaluating foundation models.β231Updated 2 months ago
- ToolQA, a new dataset to evaluate the capabilities of LLMs in answering challenging questions with external tools. It offers two levels β¦β249Updated last year
- This is the official implementation of "Progressive-Hint Prompting Improves Reasoning in Large Language Models"β205Updated last year
- Naive Bayes-based Context Extensionβ320Updated last month
- β305Updated 6 months ago
- All available datasets for Instruction Tuning of Large Language Modelsβ240Updated last year
- [ICML 2024] LESS: Selecting Influential Data for Targeted Instruction Tuningβ403Updated 3 months ago
- Dataset and evaluation script for "Evaluating Hallucinations in Chinese Large Language Models"β113Updated 7 months ago