lxe / llama-tune
LLaMa Tuning with Stanford Alpaca Dataset using Deepspeed and Transformers
☆51Updated 2 years ago
Alternatives and similar repositories for llama-tune:
Users that are interested in llama-tune are comparing it to the libraries listed below
- Unofficial implementation of AlpaGasus☆90Updated last year
- ☆104Updated last year
- code for Scaling Laws of RoPE-based Extrapolation☆73Updated last year
- The aim of this repository is to utilize LLaMA to reproduce and enhance the Stanford Alpaca☆97Updated 2 years ago
- Official implementation for 'Extending LLMs’ Context Window with 100 Samples'☆76Updated last year
- An Experiment on Dynamic NTK Scaling RoPE☆63Updated last year
- An experimental implementation of the retrieval-enhanced language model☆74Updated 2 years ago
- Open Instruction Generalist is an assistant trained on massive synthetic instructions to perform many millions of tasks☆208Updated last year
- Inference script for Meta's LLaMA models using Hugging Face wrapper☆110Updated 2 years ago
- Spherical Merge Pytorch/HF format Language Models with minimal feature loss.☆120Updated last year
- ☆67Updated last year
- ☆97Updated last year
- train llama on a single A100 80G node using 🤗 transformers and 🚀 Deepspeed Pipeline Parallelism☆216Updated last year
- MultilingualShareGPT, the free multi-language corpus for LLM training☆73Updated 2 years ago
- Instruct-tune Open LLaMA / RedPajama / StableLM models on consumer hardware using QLoRA☆81Updated last year
- A Multi-Turn Dialogue Corpus based on Alpaca Instructions☆169Updated last year
- CLongEval: A Chinese Benchmark for Evaluating Long-Context Large Language Models☆41Updated last year
- Implementation of Reinforcement Learning from Human Feedback (RLHF)☆172Updated 2 years ago
- A dataset for training/evaluating Question Answering Retrieval models on ChatGPT responses with the possibility to training/evaluating on…☆141Updated last year
- Open Source WizardCoder Dataset☆157Updated last year
- Source code for ACL 2023 paper Decoder Tuning: Efficient Language Understanding as Decoding☆49Updated last year
- Light local website for displaying performances from different chat models.☆86Updated last year
- The complete training code of the open-source high-performance Llama model, including the full process from pre-training to RLHF.☆65Updated 2 years ago
- OPD: Chinese Open-Domain Pre-trained Dialogue Model☆75Updated last year
- MEASURING MASSIVE MULTITASK CHINESE UNDERSTANDING☆87Updated last year
- All available datasets for Instruction Tuning of Large Language Models☆248Updated last year
- ☆172Updated last year
- ☆98Updated 6 months ago
- Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]☆139Updated 5 months ago
- Retrieves parquet files from Hugging Face, identifies and quantifies junky data, duplication, contamination, and biased content in datase…☆54Updated last year