dvlab-research / LongLoRA
Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)
☆2,593Updated last month
Related projects: ⓘ
- An Open-source Toolkit for LLM Development☆2,684Updated 3 months ago
- Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads☆2,206Updated 2 months ago
- ⚡LLM Zoo is a project that provides data, models, and evaluation benchmark for large language models.⚡☆2,922Updated 9 months ago
- Large-scale, Informative, and Diverse Multi-round Chat Data (and Models)☆2,210Updated 6 months ago
- A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)☆2,115Updated 3 weeks ago
- Macaw-LLM: Multi-Modal Language Modeling with Image, Video, Audio, and Text Integration☆1,521Updated 3 months ago
- [ICLR'24 spotlight] An open platform for training, serving, and evaluating large language model for tool learning.☆4,742Updated last week
- AgentTuning: Enabling Generalized Agent Abilities for LLMs☆1,329Updated 10 months ago
- LMDeploy is a toolkit for compressing, deploying, and serving LLMs.☆4,184Updated this week
- MOSS-RLHF☆1,267Updated 6 months ago
- An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)☆2,026Updated this week
- Benchmarking large language models' complex reasoning ability with chain-of-thought prompting☆2,513Updated last month
- LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalabili…☆2,292Updated this week
- An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.☆4,326Updated last month
- [MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration☆2,333Updated 2 months ago
- InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output☆2,449Updated 2 weeks ago
- AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:☆1,624Updated this week
- OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, …☆3,741Updated this week
- A family of open-sourced Mixture-of-Experts (MoE) Large Language Models☆1,352Updated 6 months ago
- LOMO: LOw-Memory Optimization☆974Updated 2 months ago
- [ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters☆5,691Updated 6 months ago
- Mixture-of-Experts for Large Vision-Language Models☆1,911Updated 4 months ago
- Code and models for NExT-GPT: Any-to-Any Multimodal Large Language Model☆3,200Updated 7 months ago
- An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.☆1,436Updated this week
- Emu Series: Generative Multimodal Models from BAAI☆1,604Updated 6 months ago
- An open-source framework for training large multimodal models.☆3,658Updated 2 weeks ago
- An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)☆3,730Updated 3 weeks ago
- Tools for merging pretrained large language models.☆4,501Updated this week
- Instruction Tuning with GPT-4☆4,165Updated last year
- 🩹Editing large language models within 10 seconds⚡☆1,268Updated last year