cedrickchee / transformers-llama
LLaMA implementation for HuggingFace Transformers
☆38Updated last year
Alternatives and similar repositories for transformers-llama:
Users that are interested in transformers-llama are comparing it to the libraries listed below
- Pre-training code for CrystalCoder 7B LLM☆55Updated 9 months ago
- LLaMa Tuning with Stanford Alpaca Dataset using Deepspeed and Transformers☆50Updated last year
- minimal LLM scripts for 24GB VRAM GPUs. training, inference, whatever☆37Updated 3 weeks ago
- ☆74Updated last year
- Tools for content datamining and NLP at scale☆42Updated 8 months ago
- ☆36Updated last year
- Here is a Google Colab Notebook for fine-tuning Alpaca Lora (within 3 hours with a 40GB A100 GPU)☆38Updated last year
- Data preparation code for Amber 7B LLM☆85Updated 9 months ago
- ☆37Updated last year
- Implementation of "LM-Infinite: Simple On-the-Fly Length Generalization for Large Language Models"☆42Updated 3 months ago
- Data preparation code for CrystalCoder 7B LLM☆44Updated 9 months ago
- Code for paper titled "Towards the Law of Capacity Gap in Distilling Language Models"☆99Updated 7 months ago
- [SIGIR 2024 (Demo)] CoSearchAgent: A Lightweight Collborative Search Agent with Large Language Models☆22Updated last year
- Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.☆67Updated 4 months ago
- Small and Efficient Mathematical Reasoning LLMs☆71Updated last year
- The data processing pipeline for the Koala chatbot language model☆117Updated last year
- Evaluating tool-augmented LLMs in conversation settings☆77Updated 8 months ago
- Flacuna was developed by fine-tuning Vicuna on Flan-mini, a comprehensive instruction collection encompassing various tasks. Vicuna is al…☆111Updated last year
- Official repo for EMNLP 2023 paper "Explain-then-Translate: An Analysis on Improving Program Translation with Self-generated Explanations…☆27Updated last year
- My implementation of "Algorithm of Thoughts: Enhancing Exploration of Ideas in Large Language Models"☆97Updated last year
- Codebase accompanying the Summary of a Haystack paper.☆74Updated 5 months ago
- Inference script for Meta's LLaMA models using Hugging Face wrapper☆111Updated last year
- FuseAI Project☆83Updated 3 weeks ago
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)☆76Updated 4 months ago
- Adversarial Training and SFT for Bot Safety Models☆39Updated last year
- Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format☆27Updated last year
- A Python implementation of Toolformer using Huggingface Transformers☆15Updated last year
- Family of instruction-following LLMs powered by Evol-Instruct: WizardLM, WizardCoder☆45Updated 10 months ago
- ☆74Updated last year
- An Implementation of "Orca: Progressive Learning from Complex Explanation Traces of GPT-4"☆44Updated 4 months ago