mallorbc / Finetune_LLMs
Repo for fine-tuning Casual LLMs
☆449Updated 5 months ago
Related projects: ⓘ
- Guide: Finetune GPT2-XL (1.5 Billion Parameters) and finetune GPT-NEO (2.7 B) on a single GPU with Huggingface Transformers using DeepSpe…☆428Updated last year
- Tune any FALCON in 4-bit☆469Updated last year
- This repository contains code for extending the Stanford Alpaca synthetic instruction tuning to existing instruction-tuned models such as…☆347Updated last year
- ☆533Updated 9 months ago
- ☆453Updated 11 months ago
- ☆266Updated this week
- LaMini-LM: A Diverse Herd of Distilled Models from Large-Scale Instructions☆810Updated last year
- User-friendly LLaMA: Train or Run the model using PyTorch. Nothing else.☆328Updated last year
- Ask Me Anything language model prompting☆536Updated last year
- Alpaca dataset from Stanford, cleaned and curated☆1,493Updated last year
- Code for fine-tuning Platypus fam LLMs using LoRA☆625Updated 7 months ago
- Customizable implementation of the self-instruct paper.☆1,004Updated 6 months ago
- Expanding natural instructions☆941Updated 9 months ago
- Crosslingual Generalization through Multitask Finetuning☆510Updated last year
- UI tool for fine-tuning and testing your own LoRA models base on LLaMA, GPT-J and more. One-click run on Google Colab. + A Gradio ChatGPT…☆435Updated last year
- [ACL2023] We introduce LLM-Blender, an innovative ensembling framework to attain consistently superior performance by leveraging the dive…☆858Updated 4 months ago
- Landmark Attention: Random-Access Infinite Context Length for Transformers☆405Updated 8 months ago
- Public repo for the NeurIPS 2023 paper "Unlimiformer: Long-Range Transformers with Unlimited Length Input"☆1,049Updated 6 months ago
- SGPT: GPT Sentence Embeddings for Semantic Search☆838Updated 7 months ago
- A command-line interface to generate textual and conversational datasets with LLMs.☆291Updated last year
- A central, open resource for data and tools related to chain-of-thought reasoning in large language models. Developed @ Samwald research …☆867Updated 3 months ago
- The prime repository for state-of-the-art Multilingual Question Answering research and development.☆724Updated 3 weeks ago
- simpleT5 is built on top of PyTorch-lightning⚡️ and Transformers🤗 that lets you quickly train your T5 models.☆382Updated last year
- A full pipeline to finetune Vicuna LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human…☆206Updated 3 months ago
- Fast & Simple repository for pre-training and fine-tuning T5-style models☆957Updated 3 weeks ago
- Open-source pre-training implementation of Google's LaMDA in PyTorch. Adding RLHF similar to ChatGPT.☆459Updated 6 months ago
- OpenAlpaca: A Fully Open-Source Instruction-Following Model Based On OpenLLaMA☆301Updated last year
- Reproduce results and replicate training fo T0 (Multitask Prompted Training Enables Zero-Shot Task Generalization)☆456Updated last year
- ☆123Updated last year
- ☆406Updated last year