lxe / llama-tuneLinks
LLaMa Tuning with Stanford Alpaca Dataset using Deepspeed and Transformers
☆50Updated 2 years ago
Alternatives and similar repositories for llama-tune
Users that are interested in llama-tune are comparing it to the libraries listed below
Sorting:
- ☆98Updated 2 years ago
- The aim of this repository is to utilize LLaMA to reproduce and enhance the Stanford Alpaca☆98Updated 2 years ago
- Open Source WizardCoder Dataset☆163Updated 2 years ago
- ☆105Updated 2 years ago
- ☆180Updated 2 years ago
- Unofficial implementation of AlpaGasus☆94Updated 2 years ago
- [ICLR 2023] Codebase for Copy-Generator model, including an implementation of kNN-LM☆190Updated last year
- Open Instruction Generalist is an assistant trained on massive synthetic instructions to perform many millions of tasks☆209Updated 2 years ago
- A dataset for training/evaluating Question Answering Retrieval models on ChatGPT responses with the possibility to training/evaluating on…☆142Updated 2 years ago
- train llama on a single A100 80G node using 🤗 transformers and 🚀 Deepspeed Pipeline Parallelism☆224Updated 2 years ago
- a Fine-tuned LLaMA that is Good at Arithmetic Tasks☆178Updated 2 years ago
- A Multi-Turn Dialogue Corpus based on Alpaca Instructions☆178Updated 2 years ago
- Implementation of Toolformer: Language Models Can Teach Themselves to Use Tools☆144Updated 2 years ago
- Datasets for Instruction Tuning of Large Language Models☆260Updated 2 years ago
- realize the reinforcement learning training for gpt2 llama bloom and so on llm model☆26Updated 2 years ago
- 🐋 An unofficial implementation of Self-Alignment with Instruction Backtranslation.☆137Updated 8 months ago
- ☆123Updated 2 years ago
- MEASURING MASSIVE MULTITASK CHINESE UNDERSTANDING☆89Updated last year
- YuLan-IR: Information Retrieval Boosted LMs☆220Updated last year
- Source codes and datasets for How well do Large Language Models perform in Arithmetic tasks?☆57Updated 2 years ago
- OPD: Chinese Open-Domain Pre-trained Dialogue Model☆75Updated 2 years ago
- An experimental implementation of the retrieval-enhanced language model☆75Updated 3 years ago
- ☆173Updated 2 years ago
- The complete training code of the open-source high-performance Llama model, including the full process from pre-training to RLHF.☆67Updated 2 years ago
- MultilingualShareGPT, the free multi-language corpus for LLM training☆73Updated 2 years ago
- Code for paper titled "Towards the Law of Capacity Gap in Distilling Language Models"☆102Updated last year
- Reverse Instructions to generate instruction tuning data with corpus examples☆216Updated last year
- ⏳ ChatLog: Recording and Analysing ChatGPT Across Time☆103Updated last year
- Light local website for displaying performances from different chat models.☆87Updated 2 years ago
- Inference script for Meta's LLaMA models using Hugging Face wrapper☆110Updated 2 years ago