gigio1023 / alpaca-lora-for-huggingfaceLinks

Alpaca-lora for huggingface implementation using Deepspeed and FullyShardedDataParallel

☆24

Alternatives and similar repositories for alpaca-lora-for-huggingface

Users that are interested in alpaca-lora-for-huggingface are comparing it to the libraries listed below

Sorting:

KyujinHan / Sakura-SOLAR-DPO
Sakura-SOLAR-DPO: Merge, SFT, and DPO
☆116Updated last year
seonghyeonye / TAPP
[AAAI 2024] Investigating the Effectiveness of Task-Agnostic Prefix Prompt for Instruction Following
☆79Updated 10 months ago
thomasgauthier / LLM-self-play
Minimal implementation of the Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models paper (ArXiv 20232401.01335)
☆29Updated last year
lcw99 / evolve-instruct
evolve llm training instruction, from english instruction to any language.
☆118Updated last year
seonghyeonye / Flipped-Learning
[ICLR 2023] Guess the Instruction! Flipped Learning Makes Language Models Stronger Zero-Shot Learners
☆116Updated last month
deep-diver / LLM-Pref-Mark-UI
☆37Updated 2 years ago
kaistAI / Janus
[NeurIPS 2024] Train LLMs with diverse system messages reflecting individualized preferences to generalize to unseen system messages
☆49Updated 8 months ago
daniel-furman / sft-demos
Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.
☆77Updated 9 months ago
Digitous / LLM-SLERP-Merge
Spherical Merge Pytorch/HF format Language Models with minimal feature loss.
☆135Updated last year
soochan-lee / RoT
Official code for ACL 2023 (short, findings) paper "Recursion of Thought: A Divide and Conquer Approach to Multi-Context Reasoning with L…
☆43Updated 2 years ago
zguo0525 / Dr.LLaMA
☆56Updated 2 years ago
SeungoneKim / CoTEVer
[EACL 2023] CoTEVer: Chain of Thought Prompting Annotation Toolkit for Explanation Verification
☆41Updated 2 years ago
kyegomez / LM-Infinite
Implementation of "LM-Infinite: Simple On-the-Fly Length Generalization for Large Language Models"
☆40Updated 8 months ago
Anni-Zou / Meta-CoT
Meta-CoT: Generalizable Chain-of-Thought Prompting in Mixed-task Scenarios with Large Language Models
☆97Updated last year
jackaduma / Alpaca-LoRA-RLHF-PyTorch
A full pipeline to finetune Alpaca LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human…
☆59Updated 2 years ago
facebookresearch / Shepherd
This is the repo for the paper Shepherd -- A Critic for Language Model Generation
☆219Updated 2 years ago
leehanchung / lora-instruct
Finetune Falcon, LLaMA, MPT, and RedPajama on consumer hardware using PEFT LoRA
☆104Updated 2 months ago
hills-code / open-instruct
☆17Updated last year
GAIR-NLP / Entropy-ABF
Official implementation for 'Extending LLMs’ Context Window with 100 Samples'
☆78Updated last year
kyleliang919 / Long-context-transformers
Exploring finetuning public checkpoints on filter 8K sequences on Pile
☆116Updated 2 years ago
LAION-AI / Anh
Anh - LAION's multilingual assistant datasets and models
☆27Updated 2 years ago
SALT-NLP / demonstrated-feedback
☆125Updated 10 months ago
FreedomIntelligence / MultilingualSIFT
MultilingualSIFT: Multilingual Supervised Instruction Fine-tuning
☆94Updated last year
gangiswag / llm-reranker
☆50Updated 6 months ago
swj0419 / detect-pretrain-code-contamination
☆77Updated last year
kaistAI / KtrlF
[NAACL 2024] Official repository for "KTRL+F: Knowledge-Augmented In-Document Search"
☆23Updated 9 months ago
gauss5930 / iDUS
An unofficial implementation of SOLAR-10.7B model and the newly proposed interlocked-DUS(iDUS) implementation and experiment details.
☆12Updated last year
deep-diver / PingPong
manage histories of LLM applied applications
☆91Updated last year
qhjqhj00 / WebBrain
☆68Updated 2 years ago
bigcode-project / astraios
Astraios: Parameter-Efficient Instruction Tuning Code Language Models
☆59Updated last year