cedrickchee / transformers-llama
LLaMA implementation for HuggingFace Transformers
☆38Updated 2 years ago
Alternatives and similar repositories for transformers-llama:
Users that are interested in transformers-llama are comparing it to the libraries listed below
- Official implementation for 'Extending LLMs’ Context Window with 100 Samples'☆75Updated last year
- Inference script for Meta's LLaMA models using Hugging Face wrapper☆110Updated 2 years ago
- An Implementation of "Orca: Progressive Learning from Complex Explanation Traces of GPT-4"☆44Updated 5 months ago
- Self-Controlled Memory System for LLMs☆46Updated 11 months ago
- ☆33Updated last year
- minimal LLM scripts for 24GB VRAM GPUs. training, inference, whatever☆38Updated last week
- MultilingualShareGPT, the free multi-language corpus for LLM training☆73Updated last year
- Universal text classifier for generative models☆22Updated 8 months ago
- An Experiment on Dynamic NTK Scaling RoPE☆62Updated last year
- Fast LLM Training CodeBase With dynamic strategy choosing [Deepspeed+Megatron+FlashAttention+CudaFusionKernel+Compiler];☆36Updated last year
- ☆53Updated 10 months ago
- QLoRA: Efficient Finetuning of Quantized LLMs☆77Updated 11 months ago
- Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit☆63Updated last year
- The data processing pipeline for the Koala chatbot language model☆117Updated last year
- FuseAI Project☆84Updated 2 months ago
- Official code for ACL 2023 (short, findings) paper "Recursion of Thought: A Divide and Conquer Approach to Multi-Context Reasoning with L…☆43Updated last year
- LLaMa Tuning with Stanford Alpaca Dataset using Deepspeed and Transformers☆51Updated 2 years ago
- kimi-chat 测试数据☆7Updated last year
- ☆37Updated last year
- SpeechAgents: Human-Communication Simulation with Multi-Modal Multi-Agent Systems☆81Updated last year
- Flacuna was developed by fine-tuning Vicuna on Flan-mini, a comprehensive instruction collection encompassing various tasks. Vicuna is al…☆111Updated last year
- The "GPT-API-Accelerate" project provides a set of Python classes for accelerating the process of generating responses to prompts using t…☆23Updated 5 months ago
- Codebase for LLM story generation; updated version of https//github.com/yangkevin2/doc-story-generation☆78Updated last year
- Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.☆73Updated 5 months ago
- Tools for content datamining and NLP at scale☆42Updated 9 months ago
- ☆84Updated last year
- Code and models for BERT on STILTs☆53Updated 2 years ago
- Data preparation code for CrystalCoder 7B LLM☆44Updated 10 months ago
- Leveraging large language models for text-to-SQL synthesis, this project fine-tunes WizardLM/WizardCoder-15B-V1.0 with QLoRA on a custom …☆43Updated last year
- Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format☆27Updated last year