git-cloner / llama-lora-fine-tuning
llama fine-tuning with lora
☆137Updated 6 months ago
Related projects ⓘ
Alternatives and complementary repositories for llama-lora-fine-tuning
- llama2 finetuning with deepspeed and lora☆167Updated last year
- Deita: Data-Efficient Instruction Tuning for Alignment [ICLR2024]☆498Updated 6 months ago
- Generative Judge for Evaluating Alignment☆217Updated 10 months ago
- InsTag: A Tool for Data Analysis in LLM Supervised Fine-tuning☆218Updated last year
- Finetuning LLaMA with RLHF (Reinforcement Learning with Human Feedback) based on DeepSpeed Chat☆106Updated last year
- All available datasets for Instruction Tuning of Large Language Models☆237Updated 11 months ago
- YuLan-IR: Information Retrieval Boosted LMs☆215Updated 8 months ago
- Naive Bayes-based Context Extension☆313Updated last year
- MEASURING MASSIVE MULTITASK CHINESE UNDERSTANDING☆87Updated 7 months ago
- 🐋 An unofficial implementation of Self-Alignment with Instruction Backtranslation.☆132Updated 4 months ago
- Official implementation of paper "Cumulative Reasoning With Large Language Models" (https://arxiv.org/abs/2308.04371)☆287Updated 2 months ago
- Data and Code for Program of Thoughts (TMLR 2023)☆243Updated 6 months ago
- [NAACL'24] Self-data filtering of LLM instruction-tuning data using a novel perplexity-based difficulty score, without using any other mo…☆307Updated 2 months ago
- Large language Model fintuning bloom , opt , gpt, gpt2 ,llama,llama-2,cpmant and so on☆96Updated 6 months ago
- Dataset and evaluation script for "Evaluating Hallucinations in Chinese Large Language Models"☆109Updated 5 months ago
- ☆91Updated 11 months ago
- Official code for "Large Language Models Are Reasoning Teachers", ACL 2023☆306Updated last year
- FireAct: Toward Language Agent Fine-tuning☆255Updated last year
- A Multi-Turn Dialogue Corpus based on Alpaca Instructions☆164Updated last year
- Codes and Data for Scaling Relationship on Learning Mathematical Reasoning with Large Language Models☆219Updated 2 months ago
- ☆120Updated 7 months ago
- ☆129Updated 4 months ago
- A full pipeline to finetune ChatGLM LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Huma…☆126Updated last year
- [ACL 2024] An Easy-to-use Instruction Processing Framework for LLMs.☆375Updated 2 weeks ago
- This is the official implementation of "Progressive-Hint Prompting Improves Reasoning in Large Language Models"☆201Updated last year
- Collection of training data management explorations for large language models☆286Updated 3 months ago
- LLaMA-TRL: Fine-tuning LLaMA with PPO and LoRA☆185Updated last year
- ☆133Updated last year
- A full pipeline to finetune Vicuna LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human…☆208Updated 6 months ago
- Evaluating LLMs' multi-round chatting capability via assessing conversations generated by two LLM instances.☆139Updated last year