git-cloner / llama-lora-fine-tuningLinks
llama fine-tuning with lora
โ139Updated last year
Alternatives and similar repositories for llama-lora-fine-tuning
Users that are interested in llama-lora-fine-tuning are comparing it to the libraries listed below
Sorting:
- llama2 finetuning with deepspeed and loraโ175Updated last year
- ๐ An unofficial implementation of Self-Alignment with Instruction Backtranslation.โ140Updated last month
- Naive Bayes-based Context Extensionโ326Updated 6 months ago
- MEASURING MASSIVE MULTITASK CHINESE UNDERSTANDINGโ87Updated last year
- โ97Updated last year
- Large Language Models Are Reasoning Teachers (ACL 2023)โ339Updated 3 months ago
- โ323Updated last year
- Large language Model fintuning bloom , opt , gpt, gpt2 ,llama,llama-2,cpmant and so onโ97Updated last year
- Finetuning LLaMA with RLHF (Reinforcement Learning with Human Feedback) based on DeepSpeed Chatโ114Updated 2 years ago
- [EMNLP 2023] Lion: Adversarial Distillation of Proprietary Large Language Modelsโ207Updated last year
- A Multi-Turn Dialogue Corpus based on Alpaca Instructionsโ171Updated 2 years ago
- A collection of phenomenons observed during the scaling of big foundation models, which may be developed into consensus, principles, or lโฆโ282Updated last year
- All available datasets for Instruction Tuning of Large Language Modelsโ252Updated last year
- A full pipeline to finetune ChatGLM LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Humaโฆโ136Updated 2 years ago
- Codes and Data for Scaling Relationship on Learning Mathematical Reasoning with Large Language Modelsโ264Updated 9 months ago
- LLaMA-TRL: Fine-tuning LLaMA with PPO and LoRAโ218Updated 2 years ago
- โ281Updated last year
- Paper List for In-context Learning ๐ทโ183Updated last year
- โ142Updated 11 months ago
- A full pipeline to finetune Vicuna LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Humanโฆโ216Updated last year
- A large-scale, fine-grained, diverse preference dataset (and models).โ342Updated last year
- [NIPS2023] RRHF & Wombatโ808Updated last year
- InsTag: A Tool for Data Analysis in LLM Supervised Fine-tuningโ261Updated last year
- This is the official implementation of "Progressive-Hint Prompting Improves Reasoning in Large Language Models"โ207Updated last year
- โ138Updated last year
- ็ฎๅๆๆ็LLaMAๅพฎ่ฐๆๅใโ399Updated last year
- Generative Judge for Evaluating Alignmentโ239Updated last year
- โ459Updated last year
- YuLan-IR: Information Retrieval Boosted LMsโ222Updated last year
- An opensource ChatBot built with ExpertPrompting which achieves 96% of ChatGPT's capability.โ300Updated 2 years ago