TianyiPeng / Colab_for_Alpaca_LoraLinks

Here is a Google Colab Notebook for fine-tuning Alpaca Lora (within 3 hours with a 40GB A100 GPU)

☆38

Alternatives and similar repositories for Colab_for_Alpaca_Lora

Users that are interested in Colab_for_Alpaca_Lora are comparing it to the libraries listed below

Sorting:

leehanchung / lora-instruct
Finetune Falcon, LLaMA, MPT, and RedPajama on consumer hardware using PEFT LoRA
☆103Updated 6 months ago
iwalton3 / mpt-lora-patch
Patch for MPT-7B which allows using and training a LoRA
☆58Updated 2 years ago
togethercomputer / Llama-2-7B-32K-Instruct
☆85Updated 2 years ago
jondurbin / qlora
QLoRA: Efficient Finetuning of Quantized LLMs
☆79Updated last year
dsdanielpark / open-llm-leaderboard-report
Weekly visualization report of Open LLM model performance based on 4 metrics.
☆86Updated last year
Gryphe / BlockMerge_Gradient
Merge Transformers language models by use of gradient parameters.
☆208Updated last year
emrgnt-cmplxty / zero-shot-replication
☆73Updated 2 years ago
vihangd / alpaca-qlora
Instruct-tune Open LLaMA / RedPajama / StableLM models on consumer hardware using QLoRA
☆81Updated last year
argilla-io / notus
Notus is a collection of fine-tuned LLMs using SFT, DPO, SFT+DPO, and/or any other RLHF techniques, while always keeping a data-first app…
☆169Updated last year
soochan-lee / RoT
Official code for ACL 2023 (short, findings) paper "Recursion of Thought: A Divide and Conquer Approach to Multi-Context Reasoning with L…
☆45Updated 2 years ago
RunzheYang / SocraticAI
Problem solving by engaging multiple AI agents in conversation with each other and the user.
☆230Updated last year
kaistAI / SelFee
Official codebase for "SelFee: Iterative Self-Revising LLM Empowered by Self-Feedback Generation"
☆228Updated 2 years ago
automix-llm / automix
Mixing Language Models with Self-Verification and Meta-Verification
☆109Updated 11 months ago
kyegomez / Kosmos-X
The Next Generation Multi-Modality Superintelligence
☆69Updated last year
gigio1023 / alpaca-lora-for-huggingface
Alpaca-lora for huggingface implementation using Deepspeed and FullyShardedDataParallel
☆24Updated 2 years ago
rmihaylov / mpttune
Tune MPTs
☆84Updated 2 years ago
togethercomputer / OpenDataHub
☆128Updated 2 years ago
toufunao / SCM4LLMs
☆32Updated 2 years ago
akjindal53244 / Arithmo
Small and Efficient Mathematical Reasoning LLMs
☆72Updated last year
stunningpixels / lou-eval
Track the progress of LLM context utilisation
☆55Updated 7 months ago
hydrallm / llama-moe-v1
☆95Updated 2 years ago
deep-diver / PingPong
manage histories of LLM applied applications
☆90Updated 2 years ago
taprosoft / llm_finetuning
Convenient wrapper for fine-tuning and inference of Large Language Models (LLMs) with several quantization techniques (GTPQ, bitsandbytes…
☆146Updated 2 years ago
thisserand / alpaca-lora-finetune-language
☆123Updated 2 years ago
rlancemartin / generative_agents
Generative Agents: Interactive Simulacra of Human Behavior
☆102Updated 2 years ago
swj0419 / detect-pretrain-code-contamination
☆78Updated last year
QuangBK / localLLM_guidance
Local LLM ReAct Agent with Guidance
☆158Updated 2 years ago
teknium1 / LLM-Benchmark-Logs
Just a bunch of benchmark logs for different LLMs
☆118Updated last year
catid / self-discover
Implementation of Google's SELF-DISCOVER
☆300Updated last year
UranusSeven / llama_generative_agent
A generative agent implementation for LLaMA based models, derived from langchain's implementation.
☆178Updated 2 years ago