cedrickchee / transformers-llama
LLaMA implementation for HuggingFace Transformers
☆38Updated 2 years ago
Alternatives and similar repositories for transformers-llama:
Users that are interested in transformers-llama are comparing it to the libraries listed below
- The "GPT-API-Accelerate" project provides a set of Python classes for accelerating the process of generating responses to prompts using t…☆23Updated 6 months ago
- ☆75Updated last year
- Adversarial Training and SFT for Bot Safety Models☆39Updated 2 years ago
- An Implementation of "Orca: Progressive Learning from Complex Explanation Traces of GPT-4"☆43Updated 6 months ago
- Inference script for Meta's LLaMA models using Hugging Face wrapper☆110Updated 2 years ago
- Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit☆63Updated last year
- ☆37Updated last year
- ☆53Updated 10 months ago
- LLaMa Tuning with Stanford Alpaca Dataset using Deepspeed and Transformers☆51Updated 2 years ago
- Pre-training code for CrystalCoder 7B LLM☆54Updated 11 months ago
- Small and Efficient Mathematical Reasoning LLMs☆71Updated last year
- Official implementation for 'Extending LLMs’ Context Window with 100 Samples'☆76Updated last year
- LLMs as Collaboratively Edited Knowledge Bases☆45Updated last year
- ☆37Updated 2 years ago
- Multi-Domain Expert Learning☆67Updated last year
- ☆17Updated last year
- ☆84Updated last year
- Data preparation code for Amber 7B LLM☆88Updated 11 months ago
- Data preparation code for CrystalCoder 7B LLM☆44Updated 11 months ago
- Flacuna was developed by fine-tuning Vicuna on Flan-mini, a comprehensive instruction collection encompassing various tasks. Vicuna is al…☆111Updated last year
- GPT-4 Level Conversational QA Trained In a Few Hours☆59Updated 8 months ago
- FuseAI Project☆85Updated 3 months ago
- ☆19Updated 3 months ago
- A full pipeline to finetune Alpaca LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human…☆58Updated last year
- ☆73Updated last year
- The data processing pipeline for the Koala chatbot language model☆117Updated 2 years ago
- ☆27Updated last month
- Advanced Reasoning Benchmark Dataset for LLMs☆45Updated last year
- Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format☆27Updated last year
- ☆51Updated 9 months ago