git-cloner / llama-lora-fine-tuningLinks
llama fine-tuning with lora
☆140Updated last year
Alternatives and similar repositories for llama-lora-fine-tuning
Users that are interested in llama-lora-fine-tuning are comparing it to the libraries listed below
Sorting:
- llama2 finetuning with deepspeed and lora☆175Updated 2 years ago
- Large Language Models Are Reasoning Teachers (ACL 2023)☆342Updated 8 months ago
- [EMNLP 2023] Lion: Adversarial Distillation of Proprietary Large Language Models☆212Updated last year
- Naive Bayes-based Context Extension☆325Updated 11 months ago
- 🐋 An unofficial implementation of Self-Alignment with Instruction Backtranslation.☆138Updated 6 months ago
- This is the official implementation of "Progressive-Hint Prompting Improves Reasoning in Large Language Models"☆209Updated 2 years ago
- Large language Model fintuning bloom , opt , gpt, gpt2 ,llama,llama-2,cpmant and so on☆99Updated last year
- A full pipeline to finetune Vicuna LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human…☆219Updated last year
- YuLan-IR: Information Retrieval Boosted LMs☆221Updated last year
- MEASURING MASSIVE MULTITASK CHINESE UNDERSTANDING☆89Updated last year
- A Multi-Turn Dialogue Corpus based on Alpaca Instructions☆177Updated 2 years ago
- An opensource ChatBot built with ExpertPrompting which achieves 96% of ChatGPT's capability.☆299Updated 2 years ago
- [ACL 2024] An Easy-to-use Instruction Processing Framework for LLMs.☆407Updated 10 months ago
- a Fine-tuned LLaMA that is Good at Arithmetic Tasks☆178Updated 2 years ago
- A paper & resource list of large language models, including course, paper, demo, figures☆200Updated 2 years ago
- ☆330Updated last year
- LLM Zoo collects information of various open- and close-sourced LLMs☆271Updated 2 years ago
- Datasets for Instruction Tuning of Large Language Models☆258Updated last year
- Finetuning LLaMA with RLHF (Reinforcement Learning with Human Feedback) based on DeepSpeed Chat☆115Updated 2 years ago
- Generative Judge for Evaluating Alignment☆247Updated last year
- ☆98Updated last year
- ☆123Updated last year
- ☆147Updated last year
- https://acl2023-retrieval-lm.github.io/☆158Updated 2 years ago
- ☆142Updated 2 years ago
- Must-read papers, related blogs and API tools on the pre-training and tuning methods for ChatGPT.☆323Updated 2 years ago
- LLaMA-TRL: Fine-tuning LLaMA with PPO and LoRA☆233Updated 3 months ago
- Data and Code for Program of Thoughts [TMLR 2023]☆292Updated last year
- ☆128Updated 2 years ago
- Prod Env☆434Updated 2 years ago