git-cloner / llama-lora-fine-tuningLinks
llama fine-tuning with lora
β139Updated last year
Alternatives and similar repositories for llama-lora-fine-tuning
Users that are interested in llama-lora-fine-tuning are comparing it to the libraries listed below
Sorting:
- π An unofficial implementation of Self-Alignment with Instruction Backtranslation.β140Updated 2 months ago
- llama2 finetuning with deepspeed and loraβ175Updated last year
- Naive Bayes-based Context Extensionβ326Updated 7 months ago
- Large Language Models Are Reasoning Teachers (ACL 2023)β339Updated 4 months ago
- [EMNLP 2023] Lion: Adversarial Distillation of Proprietary Large Language Modelsβ209Updated last year
- This is the official implementation of "Progressive-Hint Prompting Improves Reasoning in Large Language Models"β209Updated last year
- Finetuning LLaMA with RLHF (Reinforcement Learning with Human Feedback) based on DeepSpeed Chatβ114Updated 2 years ago
- β96Updated last year
- YuLan-IR: Information Retrieval Boosted LMsβ222Updated last year
- β324Updated last year
- Data and Code for Program of Thoughts [TMLR 2023]β279Updated last year
- MEASURING MASSIVE MULTITASK CHINESE UNDERSTANDINGβ87Updated last year
- [ACL 2024] An Easy-to-use Instruction Processing Framework for LLMs.β406Updated 6 months ago
- Large language Model fintuning bloom , opt , gpt, gpt2 ,llama,llama-2,cpmant and so onβ97Updated last year
- A full pipeline to finetune Vicuna LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Humanβ¦β218Updated last year
- a Fine-tuned LLaMA that is Good at Arithmetic Tasksβ178Updated last year
- Challenging BIG-Bench Tasks and Whether Chain-of-Thought Can Solve Themβ501Updated last year
- A paper & resource list of large language models, including course, paper, demo, figuresβ199Updated last year
- A Massive Multi-Level Multi-Subject Knowledge Evaluation benchmarkβ102Updated last year
- LLaMA-TRL: Fine-tuning LLaMA with PPO and LoRAβ218Updated 2 years ago
- Deita: Data-Efficient Instruction Tuning for Alignment [ICLR2024]β558Updated 7 months ago
- InsTag: A Tool for Data Analysis in LLM Supervised Fine-tuningβ263Updated last year
- A large-scale, fine-grained, diverse preference dataset (and models).β343Updated last year
- Generative Judge for Evaluating Alignmentβ244Updated last year
- Codes and Data for Scaling Relationship on Learning Mathematical Reasoning with Large Language Modelsβ266Updated 10 months ago
- β144Updated last year
- A Multi-Turn Dialogue Corpus based on Alpaca Instructionsβ172Updated 2 years ago
- Collaborative Training of Large Language Models in an Efficient Wayβ416Updated 10 months ago
- Official implementation of paper "Cumulative Reasoning With Large Language Models" (https://arxiv.org/abs/2308.04371)β294Updated 9 months ago
- Paper collection on building and evaluating language model agents via executable language groundingβ356Updated last year