git-cloner / llama-lora-fine-tuning
llama fine-tuning with lora
☆139Updated 11 months ago
Alternatives and similar repositories for llama-lora-fine-tuning:
Users that are interested in llama-lora-fine-tuning are comparing it to the libraries listed below
- llama2 finetuning with deepspeed and lora☆174Updated last year
- Naive Bayes-based Context Extension☆326Updated 4 months ago
- 🐋 An unofficial implementation of Self-Alignment with Instruction Backtranslation.☆139Updated 10 months ago
- Finetuning LLaMA with RLHF (Reinforcement Learning with Human Feedback) based on DeepSpeed Chat☆115Updated last year
- Large Language Models Are Reasoning Teachers (ACL 2023)☆330Updated last month
- A full pipeline to finetune Vicuna LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human…☆213Updated 11 months ago
- [EMNLP 2023] Lion: Adversarial Distillation of Proprietary Large Language Models☆205Updated last year
- ☆97Updated last year
- Large language Model fintuning bloom , opt , gpt, gpt2 ,llama,llama-2,cpmant and so on☆97Updated last year
- MEASURING MASSIVE MULTITASK CHINESE UNDERSTANDING☆87Updated last year
- A Multi-Turn Dialogue Corpus based on Alpaca Instructions☆169Updated last year
- YuLan-IR: Information Retrieval Boosted LMs☆218Updated last year
- Codes and Data for Scaling Relationship on Learning Mathematical Reasoning with Large Language Models☆256Updated 7 months ago
- Dataset and evaluation script for "Evaluating Hallucinations in Chinese Large Language Models"☆124Updated 10 months ago
- This is the official implementation of "Progressive-Hint Prompting Improves Reasoning in Large Language Models"☆207Updated last year
- ☆143Updated 9 months ago
- [NAACL'24] Self-data filtering of LLM instruction-tuning data using a novel perplexity-based difficulty score, without using any other mo…☆361Updated 7 months ago
- LLM Zoo collects information of various open- and close-sourced LLMs☆271Updated last year
- FireAct: Toward Language Agent Fine-tuning☆275Updated last year
- InsTag: A Tool for Data Analysis in LLM Supervised Fine-tuning☆252Updated last year
- All available datasets for Instruction Tuning of Large Language Models☆248Updated last year
- ☆139Updated last year
- make LLM easier to use☆59Updated last year
- A full pipeline to finetune ChatGLM LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Huma…☆135Updated last year
- ☆128Updated last year
- deep learning☆149Updated last month
- train llama on a single A100 80G node using 🤗 transformers and 🚀 Deepspeed Pipeline Parallelism☆218Updated last year
- CLongEval: A Chinese Benchmark for Evaluating Long-Context Large Language Models☆41Updated last year
- ☆278Updated last year
- a Fine-tuned LLaMA that is Good at Arithmetic Tasks☆177Updated last year