gururise / AlpacaDataCleanedLinks

Alpaca dataset from Stanford, cleaned and curated

☆1,560

Alternatives and similar repositories for AlpacaDataCleaned

Users that are interested in AlpacaDataCleaned are comparing it to the libraries listed below

Sorting:

teknium1 / GPTeacher
A collection of modular datasets generated by GPT-4, General-Instruct - Roleplay-Instruct - Code-Instruct - and Toolformer
☆1,637Updated last year
young-geng / EasyLM
Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Fl…
☆2,484Updated 11 months ago
qwopqwop200 / GPTQ-for-LLaMa
4 bits quantization of LLaMA using GPTQ
☆3,059Updated last year
sahil280114 / codealpaca
☆1,479Updated 2 years ago
mbzuai-nlp / LaMini-LM
LaMini-LM: A Diverse Herd of Distilled Models from Large-Scale Instructions
☆820Updated 2 years ago
yaodongC / awesome-instruction-dataset
A collection of open-source dataset to train instruction-following LLMs (ChatGPT,LLaMA,Alpaca)
☆1,125Updated last year
yizhongw / self-instruct
Aligning pretrained language models with instruction data generated by themselves.
☆4,429Updated 2 years ago
yuchenlin / LLM-Blender
[ACL2023] We introduce LLM-Blender, an innovative ensembling framework to attain consistently superior performance by leveraging the dive…
☆952Updated 9 months ago
togethercomputer / RedPajama-Data
The RedPajama-Data repository contains code for preparing large datasets for training large language models.
☆4,777Updated 7 months ago
rmihaylov / falcontune
Tune any FALCON in 4-bit
☆467Updated last year
johnsmith0031 / alpaca_lora_4bit
☆535Updated last year
jondurbin / airoboros
Customizable implementation of the self-instruct paper.
☆1,046Updated last year
Instruction-Tuning-with-GPT-4 / GPT-4-LLM
Instruction Tuning with GPT-4
☆4,314Updated 2 years ago
CStanKonrad / long_llama
LongLLaMA is a large language model capable of handling long contexts. It is based on OpenLLaMA and fine-tuned with the Focused Transform…
☆1,459Updated last year
FranxYao / chain-of-thought-hub
Benchmarking large language models' complex reasoning ability with chain-of-thought prompting
☆2,741Updated 11 months ago
jquesnelle / yarn
YaRN: Efficient Context Window Extension of Large Language Models
☆1,521Updated last year
deep-diver / LLM-As-Chatbot
LLM as a Chatbot Service
☆3,327Updated last year
anthropics / hh-rlhf
Human preference data for "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback"
☆1,764Updated last month
henrywoo / chatllama
ChatLLaMA 📢 Open source implementation for LLaMA-based ChatGPT runnable in a single GPU. 15x faster training process than ChatGPT
☆1,203Updated 6 months ago
IBM / Dromedary
Dromedary: towards helpful, ethical and reliable LLMs.
☆1,148Updated 2 months ago
OpenLMLab / LOMO
LOMO: LOw-Memory Optimization
☆988Updated last year
kuleshov-group / llmtools
Finetuning Large Language Models on One Consumer GPU in 2 Bits
☆727Updated last year
xlang-ai / instructor-embedding
[ACL 2023] One Embedder, Any Task: Instruction-Finetuned Text Embeddings
☆1,988Updated 6 months ago
arielnlee / Platypus
Code for fine-tuning Platypus fam LLMs using LoRA
☆628Updated last year
randaller / llama-chat
Chat with Meta's LLaMA models at home made easy
☆837Updated 2 years ago
AetherCortex / Llama-X
Open Academic Research on Improving LLaMA to SOTA LLM
☆1,619Updated last year
booydar / recurrent-memory-transformer
[NeurIPS 22] [AAAI 24] Recurrent Transformer-based long-context architecture.
☆766Updated 9 months ago
abertsch72 / unlimiformer
Public repo for the NeurIPS 2023 paper "Unlimiformer: Long-Range Transformers with Unlimited Length Input"
☆1,062Updated last year
melodysdreamj / WizardVicunaLM
LLM that combines the principles of wizardLM and vicunaLM
☆716Updated 2 years ago
EleutherAI / pythia
The hub for EleutherAI's work on interpretability and learning dynamics
☆2,570Updated last month