git-cloner / llama-lora-fine-tuningLinks
llama fine-tuning with lora
β139Updated last year
Alternatives and similar repositories for llama-lora-fine-tuning
Users that are interested in llama-lora-fine-tuning are comparing it to the libraries listed below
Sorting:
- llama2 finetuning with deepspeed and loraβ174Updated last year
- π An unofficial implementation of Self-Alignment with Instruction Backtranslation.β140Updated 3 weeks ago
- Naive Bayes-based Context Extensionβ326Updated 5 months ago
- Finetuning LLaMA with RLHF (Reinforcement Learning with Human Feedback) based on DeepSpeed Chatβ115Updated 2 years ago
- Large Language Models Are Reasoning Teachers (ACL 2023)β336Updated 2 months ago
- [EMNLP 2023] Lion: Adversarial Distillation of Proprietary Large Language Modelsβ206Updated last year
- β138Updated last year
- YuLan-IR: Information Retrieval Boosted LMsβ221Updated last year
- MEASURING MASSIVE MULTITASK CHINESE UNDERSTANDINGβ87Updated last year
- Generative Judge for Evaluating Alignmentβ238Updated last year
- Codes and Data for Scaling Relationship on Learning Mathematical Reasoning with Large Language Modelsβ264Updated 8 months ago
- Large language Model fintuning bloom , opt , gpt, gpt2 ,llama,llama-2,cpmant and so onβ97Updated last year
- All available datasets for Instruction Tuning of Large Language Modelsβ250Updated last year
- LLaMA-TRL: Fine-tuning LLaMA with PPO and LoRAβ217Updated 2 years ago
- Project for the paper entitled `Instruction Tuning for Large Language Models: A Survey`β178Updated 6 months ago
- a Fine-tuned LLaMA that is Good at Arithmetic Tasksβ178Updated last year
- FireAct: Toward Language Agent Fine-tuningβ278Updated last year
- A large-scale, fine-grained, diverse preference dataset (and models).β340Updated last year
- Dataset and evaluation script for "Evaluating Hallucinations in Chinese Large Language Models"β128Updated 11 months ago
- β97Updated last year
- Open Source WizardCoder Datasetβ158Updated last year