git-cloner / llama-lora-fine-tuningLinks
llama fine-tuning with lora
☆140Updated last year
Alternatives and similar repositories for llama-lora-fine-tuning
Users that are interested in llama-lora-fine-tuning are comparing it to the libraries listed below
Sorting:
- Naive Bayes-based Context Extension☆326Updated last year
- [EMNLP 2023] Lion: Adversarial Distillation of Proprietary Large Language Models☆212Updated last year
- llama2 finetuning with deepspeed and lora☆175Updated 2 years ago
- Large Language Models Are Reasoning Teachers (ACL 2023)☆343Updated 9 months ago
- 🐋 An unofficial implementation of Self-Alignment with Instruction Backtranslation.☆138Updated 7 months ago
- A full pipeline to finetune Vicuna LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human…☆219Updated last year
- A paper & resource list of large language models, including course, paper, demo, figures☆200Updated 2 years ago
- ☆98Updated 2 years ago
- [ACL 2024] An Easy-to-use Instruction Processing Framework for LLMs.☆409Updated last year
- MEASURING MASSIVE MULTITASK CHINESE UNDERSTANDING☆89Updated last year
- An opensource ChatBot built with ExpertPrompting which achieves 96% of ChatGPT's capability.☆299Updated 2 years ago
- YuLan-IR: Information Retrieval Boosted LMs☆222Updated last year
- Finetuning LLaMA with RLHF (Reinforcement Learning with Human Feedback) based on DeepSpeed Chat☆116Updated 2 years ago
- LLM Zoo collects information of various open- and close-sourced LLMs☆271Updated 2 years ago
- ☆333Updated last year
- Large language Model fintuning bloom , opt , gpt, gpt2 ,llama,llama-2,cpmant and so on☆99Updated last year
- A Massive Multi-Level Multi-Subject Knowledge Evaluation benchmark☆103Updated 2 years ago
- [NAACL'24] Self-data filtering of LLM instruction-tuning data using a novel perplexity-based difficulty score, without using any other mo…☆412Updated 6 months ago
- ☆129Updated 2 years ago
- ☆282Updated last year
- Deita: Data-Efficient Instruction Tuning for Alignment [ICLR2024]☆579Updated last year
- ☆164Updated 2 years ago
- Generative Judge for Evaluating Alignment☆248Updated last year
- Paper List for a new paradigm of NLP: Interactive NLP (https://arxiv.org/abs/2305.13246)☆217Updated 2 years ago
- a Fine-tuned LLaMA that is Good at Arithmetic Tasks☆178Updated 2 years ago
- Dataset and evaluation script for "Evaluating Hallucinations in Chinese Large Language Models"☆136Updated last year
- Papers & Works for large languange models (OpenAI GPT-4, Meta Llama, etc.).☆317Updated last month
- A full pipeline to finetune ChatGLM LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Huma…☆140Updated 2 years ago
- Codes and Data for Scaling Relationship on Learning Mathematical Reasoning with Large Language Models☆270Updated last year
- SOTA Math Opensource LLM☆332Updated 2 years ago