git-cloner / llama-lora-fine-tuning
llama fine-tuning with lora
☆140Updated 10 months ago
Alternatives and similar repositories for llama-lora-fine-tuning:
Users that are interested in llama-lora-fine-tuning are comparing it to the libraries listed below
- llama2 finetuning with deepspeed and lora☆174Updated last year
- 🐋 An unofficial implementation of Self-Alignment with Instruction Backtranslation.☆137Updated 9 months ago
- [EMNLP 2023] Lion: Adversarial Distillation of Proprietary Large Language Models☆204Updated last year
- Finetuning LLaMA with RLHF (Reinforcement Learning with Human Feedback) based on DeepSpeed Chat☆114Updated last year
- Naive Bayes-based Context Extension☆322Updated 3 months ago
- Large Language Models Are Reasoning Teachers (ACL 2023)☆327Updated 2 weeks ago
- ☆95Updated last year
- A full pipeline to finetune ChatGLM LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Huma…☆134Updated last year
- MEASURING MASSIVE MULTITASK CHINESE UNDERSTANDING☆87Updated last year
- a Fine-tuned LLaMA that is Good at Arithmetic Tasks☆178Updated last year
- ☆134Updated 11 months ago
- All available datasets for Instruction Tuning of Large Language Models☆247Updated last year
- LLaMA-TRL: Fine-tuning LLaMA with PPO and LoRA☆206Updated last year
- This is the official implementation of "Progressive-Hint Prompting Improves Reasoning in Large Language Models"☆206Updated last year
- Logiqa2.0 dataset - logical reasoning in MRC and NLI tasks☆90Updated last year
- 怎么训练一个LLM分词器☆142Updated last year
- [NAACL'24] Self-data filtering of LLM instruction-tuning data using a novel perplexity-based difficulty score, without using any other mo…☆347Updated 6 months ago
- Generative Judge for Evaluating Alignment☆230Updated last year
- ☆160Updated last year
- InsTag: A Tool for Data Analysis in LLM Supervised Fine-tuning☆246Updated last year
- Official implementation of paper "Cumulative Reasoning With Large Language Models" (https://arxiv.org/abs/2308.04371)☆291Updated 6 months ago
- 大语言模型指令调优工具(支持 FlashAttention)☆171Updated last year
- FireAct: Toward Language Agent Fine-tuning☆271Updated last year
- ☆459Updated 9 months ago
- ☆128Updated last year
- ☆142Updated 8 months ago
- A Multi-Turn Dialogue Corpus based on Alpaca Instructions☆168Updated last year
- [ACL 2024] An Easy-to-use Instruction Processing Framework for LLMs.☆398Updated 3 months ago
- A full pipeline to finetune Vicuna LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human…☆212Updated 10 months ago
- Data and Code for Program of Thoughts (TMLR 2023)☆264Updated 10 months ago