git-cloner / llama-lora-fine-tuning
llama fine-tuning with lora
β139Updated last year
Alternatives and similar repositories for llama-lora-fine-tuning
Users that are interested in llama-lora-fine-tuning are comparing it to the libraries listed below
Sorting:
- llama2 finetuning with deepspeed and loraβ174Updated last year
- π An unofficial implementation of Self-Alignment with Instruction Backtranslation.β139Updated last week
- Naive Bayes-based Context Extensionβ326Updated 5 months ago
- Large Language Models Are Reasoning Teachers (ACL 2023)β334Updated 2 months ago
- This is the official implementation of "Progressive-Hint Prompting Improves Reasoning in Large Language Models"β206Updated last year
- All available datasets for Instruction Tuning of Large Language Modelsβ250Updated last year
- β97Updated last year
- Large language Model fintuning bloom , opt , gpt, gpt2 ,llama,llama-2,cpmant and so onβ97Updated last year
- [EMNLP 2023] Lion: Adversarial Distillation of Proprietary Large Language Modelsβ206Updated last year
- A full pipeline to finetune Vicuna LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Humanβ¦β213Updated 11 months ago
- LLM Zoo collects information of various open- and close-sourced LLMsβ271Updated last year
- [NIPS2023] RRHF & Wombatβ807Updated last year
- Official implementation of paper "Cumulative Reasoning With Large Language Models" (https://arxiv.org/abs/2308.04371)β293Updated 7 months ago
- Generative Judge for Evaluating Alignmentβ237Updated last year
- Codes and Data for Scaling Relationship on Learning Mathematical Reasoning with Large Language Modelsβ261Updated 8 months ago
- MEASURING MASSIVE MULTITASK CHINESE UNDERSTANDINGβ87Updated last year
- Finetuning LLaMA with RLHF (Reinforcement Learning with Human Feedback) based on DeepSpeed Chatβ115Updated last year
- YuLan-IR: Information Retrieval Boosted LMsβ219Updated last year
- β143Updated 10 months ago
- An opensource ChatBot built with ExpertPrompting which achieves 96% of ChatGPT's capability.β300Updated last year
- Source code for the paper "Active Prompting with Chain-of-Thought for Large Language Models"β237Updated last year
- A Multi-Turn Dialogue Corpus based on Alpaca Instructionsβ171Updated last year
- a Fine-tuned LLaMA that is Good at Arithmetic Tasksβ177Updated last year
- Code for "Small Models are Valuable Plug-ins for Large Language Models"β129Updated last year
- Challenging BIG-Bench Tasks and Whether Chain-of-Thought Can Solve Themβ488Updated 10 months ago
- Implementation of ICML 23 Paper: Specializing Smaller Language Models towards Multi-Step Reasoning.β130Updated last year
- A full pipeline to finetune ChatGLM LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Humaβ¦β135Updated 2 years ago
- LLaMA-TRL: Fine-tuning LLaMA with PPO and LoRAβ213Updated last year
- Multi-agent Social Simulation + Efficient, Effective, and Stable alternative of RLHF. Code for the paper "Training Socially Aligned Languβ¦β347Updated last year
- Prod Envβ417Updated last year