dvlab-research / LongLoRA
Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)
☆2,654Updated 8 months ago
Alternatives and similar repositories for LongLoRA:
Users that are interested in LongLoRA are comparing it to the libraries listed below
- An Open-source Toolkit for LLM Development☆2,776Updated 3 months ago
- AgentTuning: Enabling Generalized Agent Abilities for LLMs☆1,431Updated last year
- Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads☆2,516Updated 10 months ago
- Open Academic Research on Improving LLaMA to SOTA LLM☆1,621Updated last year
- 🩹Editing large language models within 10 seconds⚡☆1,327Updated last year
- LOMO: LOw-Memory Optimization☆985Updated 10 months ago
- ⚡LLM Zoo is a project that provides data, models, and evaluation benchmark for large language models.⚡☆2,943Updated last year
- Secrets of RLHF in Large Language Models Part I: PPO☆1,360Updated last year
- Benchmarking large language models' complex reasoning ability with chain-of-thought prompting☆2,723Updated 9 months ago
- [ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters☆5,868Updated last year
- Aligning pretrained language models with instruction data generated by themselves.☆4,359Updated 2 years ago
- Code for the ICLR 2023 paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers".☆2,099Updated last year
- Instruction Tuning with GPT-4☆4,301Updated last year
- An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.☆4,831Updated 3 weeks ago
- YaRN: Efficient Context Window Extension of Large Language Models☆1,479Updated last year
- [MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration☆2,984Updated 3 weeks ago
- A family of open-sourced Mixture-of-Experts (MoE) Large Language Models☆1,526Updated last year
- We unified the interfaces of instruction-tuning data (e.g., CoT data), multiple LLMs and parameter-efficient methods (e.g., lora, p-tunin…☆2,740Updated last year
- A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)☆2,529Updated 3 months ago
- S-LoRA: Serving Thousands of Concurrent LoRA Adapters☆1,821Updated last year
- Mixture-of-Experts for Large Vision-Language Models☆2,153Updated 5 months ago
- Large-scale, Informative, and Diverse Multi-round Chat Data (and Models)☆2,561Updated last year
- Reference implementation for DPO (Direct Preference Optimization)☆2,560Updated 8 months ago
- [ACL 2024] An Easy-to-use Knowledge Editing Framework for LLMs.☆2,265Updated this week
- Macaw-LLM: Multi-Modal Language Modeling with Image, Video, Audio, and Text Integration☆1,563Updated 4 months ago
- A toolkit for inference and evaluation of 'mixtral-8x7b-32kseqlen' from Mistral AI☆769Updated last year
- AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:☆2,145Updated this week
- 4 bits quantization of LLaMA using GPTQ☆3,049Updated 9 months ago
- GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection☆1,549Updated 6 months ago
- [ICLR'24 spotlight] An open platform for training, serving, and evaluating large language model for tool learning.☆5,016Updated 5 months ago