git-cloner / llama-lora-fine-tuningLinks
llama fine-tuning with lora
☆138Updated last year
Alternatives and similar repositories for llama-lora-fine-tuning
Users that are interested in llama-lora-fine-tuning are comparing it to the libraries listed below
Sorting:
- Large Language Models Are Reasoning Teachers (ACL 2023)☆341Updated 5 months ago
- llama2 finetuning with deepspeed and lora☆176Updated 2 years ago
- [EMNLP 2023] Lion: Adversarial Distillation of Proprietary Large Language Models☆210Updated last year
- Naive Bayes-based Context Extension☆326Updated 8 months ago
- This is the official implementation of "Progressive-Hint Prompting Improves Reasoning in Large Language Models"☆209Updated last year
- 🐋 An unofficial implementation of Self-Alignment with Instruction Backtranslation.☆139Updated 3 months ago
- An opensource ChatBot built with ExpertPrompting which achieves 96% of ChatGPT's capability.☆301Updated 2 years ago
- YuLan-IR: Information Retrieval Boosted LMs☆222Updated last year
- A paper & resource list of large language models, including course, paper, demo, figures☆200Updated 2 years ago
- A full pipeline to finetune Vicuna LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human…☆219Updated last year
- Finetuning LLaMA with RLHF (Reinforcement Learning with Human Feedback) based on DeepSpeed Chat☆114Updated 2 years ago
- [ACL 2024] An Easy-to-use Instruction Processing Framework for LLMs.☆406Updated 8 months ago
- MEASURING MASSIVE MULTITASK CHINESE UNDERSTANDING☆88Updated last year
- ☆325Updated last year
- Large language Model fintuning bloom , opt , gpt, gpt2 ,llama,llama-2,cpmant and so on☆98Updated last year
- Data and Code for Program of Thoughts [TMLR 2023]☆283Updated last year
- LLM Zoo collects information of various open- and close-sourced LLMs☆271Updated 2 years ago
- Datasets for Instruction Tuning of Large Language Models☆255Updated last year
- Prod Env☆428Updated last year
- ☆96Updated last year
- LLaMA-TRL: Fine-tuning LLaMA with PPO and LoRA☆226Updated 2 weeks ago
- a Fine-tuned LLaMA that is Good at Arithmetic Tasks☆177Updated last year
- Challenging BIG-Bench Tasks and Whether Chain-of-Thought Can Solve Them☆509Updated last year
- Generative Judge for Evaluating Alignment☆245Updated last year
- [NIPS2023] RRHF & Wombat☆812Updated last year
- Multi-agent Social Simulation + Efficient, Effective, and Stable alternative of RLHF. Code for the paper "Training Socially Aligned Langu…☆353Updated 2 years ago
- train llama on a single A100 80G node using 🤗 transformers and 🚀 Deepspeed Pipeline Parallelism☆224Updated last year
- Paper List for a new paradigm of NLP: Interactive NLP (https://arxiv.org/abs/2305.13246)☆215Updated 2 years ago
- Paper collection on building and evaluating language model agents via executable language grounding☆361Updated last year
- Collaborative Training of Large Language Models in an Efficient Way☆416Updated last year