cedrickchee / transformers-llama
LLaMA implementation for HuggingFace Transformers
☆38Updated last year
Related projects: ⓘ
- Inference script for Meta's LLaMA models using Hugging Face wrapper☆110Updated last year
- Evaluation and analysis code for LLM360☆75Updated 3 months ago
- ☆73Updated 8 months ago
- ☆20Updated 6 months ago
- Evaluating tool-augmented LLMs in conversation settings☆72Updated 3 months ago
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)☆67Updated 2 months ago
- Data preparation code for CrystalCoder 7B LLM☆42Updated 4 months ago
- ☆71Updated last year
- Flacuna was developed by fine-tuning Vicuna on Flan-mini, a comprehensive instruction collection encompassing various tasks. Vicuna is al…☆109Updated last year
- minimal LLM scripts for 24GB VRAM GPUs. training, inference, whatever☆32Updated last week
- Pre-training code for CrystalCoder 7B LLM☆52Updated 4 months ago
- ☆86Updated last year
- A streamlit app for visualizing LLM evals.☆38Updated 8 months ago
- ☆37Updated 9 months ago
- Data preparation code for Amber 7B LLM☆76Updated 4 months ago
- Code repo for "Agent Instructs Large Language Models to be General Zero-Shot Reasoners"☆68Updated last week
- Learning to Program with Natural Language☆5Updated 9 months ago
- ☆83Updated last year
- The "GPT-API-Accelerate" project provides a set of Python classes for accelerating the process of generating responses to prompts using t…☆22Updated last year
- ☆50Updated 3 months ago
- FuseAI Project☆75Updated 3 weeks ago
- Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.☆64Updated 2 months ago
- Small and Efficient Mathematical Reasoning LLMs☆69Updated 7 months ago
- Official repo for NAACL 2024 Findings paper "LeTI: Learning to Generate from Textual Interactions."☆60Updated last year
- ☆47Updated last year
- Official implementation for 'Extending LLMs’ Context Window with 100 Samples'☆72Updated 8 months ago
- The data processing pipeline for the Koala chatbot language model☆115Updated last year
- Official code for "MAmmoTH2: Scaling Instructions from the Web"☆106Updated last week
- An Implementation of "Orca: Progressive Learning from Complex Explanation Traces of GPT-4"☆43Updated last year
- Spherical Merge Pytorch/HF format Language Models with minimal feature loss.☆107Updated last year