lxe / llama-tune
LLaMa Tuning with Stanford Alpaca Dataset using Deepspeed and Transformers
☆51Updated last year
Related projects ⓘ
Alternatives and complementary repositories for llama-tune
- ☆103Updated last year
- Unofficial implementation of AlpaGasus☆84Updated last year
- The aim of this repository is to utilize LLaMA to reproduce and enhance the Stanford Alpaca☆96Updated last year
- ☆175Updated last year
- Inference script for Meta's LLaMA models using Hugging Face wrapper☆111Updated last year
- ☆69Updated last year
- Official implementation for 'Extending LLMs’ Context Window with 100 Samples'☆74Updated 10 months ago
- ☆94Updated last year
- code for Scaling Laws of RoPE-based Extrapolation☆70Updated last year
- Spherical Merge Pytorch/HF format Language Models with minimal feature loss.☆112Updated last year
- MultilingualShareGPT, the free multi-language corpus for LLM training☆72Updated last year
- Source codes and datasets for How well do Large Language Models perform in Arithmetic tasks?☆57Updated last year
- Instruct-tune Open LLaMA / RedPajama / StableLM models on consumer hardware using QLoRA☆80Updated 11 months ago
- An Experiment on Dynamic NTK Scaling RoPE☆61Updated 11 months ago
- Source code for ACL 2023 paper Decoder Tuning: Efficient Language Understanding as Decoding☆48Updated last year
- 百川Dynamic NTK-ALiBi的代码实现:无需微调即可推理更长文本☆46Updated last year
- ☆88Updated last month
- MEASURING MASSIVE MULTITASK CHINESE UNDERSTANDING☆87Updated 7 months ago
- CLongEval: A Chinese Benchmark for Evaluating Long-Context Large Language Models☆38Updated 8 months ago
- Open Source WizardCoder Dataset☆153Updated last year
- OPD: Chinese Open-Domain Pre-trained Dialogue Model☆74Updated last year
- A prototype repo for hybrid training of pipeline parallel and distributed data parallel with comments on core code snippets. Feel free to…☆49Updated last year
- train llama on a single A100 80G node using 🤗 transformers and 🚀 Deepspeed Pipeline Parallelism☆207Updated last year
- [ICLR 2023] Codebase for Copy-Generator model, including an implementation of kNN-LM☆181Updated last year
- Official codebase for "SelFee: Iterative Self-Revising LLM Empowered by Self-Feedback Generation"☆220Updated last year
- Open Instruction Generalist is an assistant trained on massive synthetic instructions to perform many millions of tasks☆206Updated 10 months ago
- Positional Skip-wise Training for Efficient Context Window Extension of LLMs to Extremely Length (ICLR 2024)☆199Updated 6 months ago
- A full pipeline to finetune Alpaca LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human…☆56Updated last year
- An experimental implementation of the retrieval-enhanced language model☆75Updated last year
- Implementation of the LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens Paper☆124Updated 4 months ago