dvlab-research / LongLoRALinks

Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)

☆2,674

Alternatives and similar repositories for LongLoRA

Users that are interested in LongLoRA are comparing it to the libraries listed below

Sorting:

FasterDecoding / Medusa
Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads
☆2,583Updated last year
Alpha-VLLM / LLaMA2-Accessory
An Open-source Toolkit for LLM Development
☆2,786Updated 6 months ago
FranxYao / chain-of-thought-hub
Benchmarking large language models' complex reasoning ability with chain-of-thought prompting
☆2,742Updated 11 months ago
OpenLMLab / LOMO
LOMO: LOw-Memory Optimization
☆988Updated last year
THUDM / AgentTuning
AgentTuning: Enabling Generalized Agent Abilities for LLMs
☆1,454Updated last year
OpenLMLab / MOSS-RLHF
Secrets of RLHF in Large Language Models Part I: PPO
☆1,384Updated last year
FreedomIntelligence / LLMZoo
⚡LLM Zoo is a project that provides data, models, and evaluation benchmark for large language models.⚡
☆2,945Updated last year
AetherCortex / Llama-X
Open Academic Research on Improving LLaMA to SOTA LLM
☆1,619Updated last year
Xwin-LM / Xwin-LM
Xwin-LM: Powerful, Stable, and Reproducible LLM Alignment
☆1,040Updated last year
Instruction-Tuning-with-GPT-4 / GPT-4-LLM
Instruction Tuning with GPT-4
☆4,319Updated 2 years ago
hiyouga / FastEdit
🩹Editing large language models within 10 seconds⚡
☆1,339Updated last year
PhoebusSi / Alpaca-CoT
We unified the interfaces of instruction-tuning data (e.g., CoT data), multiple LLMs and parameter-efficient methods (e.g., lora, p-tunin…
☆2,761Updated last year
AGI-Edgerunners / LLM-Adapters
Code for our EMNLP 2023 Paper: "LLM-Adapters: An Adapter Family for Parameter-Efficient Fine-Tuning of Large Language Models"
☆1,185Updated last year
lyuchenyang / Macaw-LLM
Macaw-LLM: Multi-Modal Language Modeling with Image, Video, Audio, and Text Integration
☆1,578Updated 7 months ago
jquesnelle / yarn
YaRN: Efficient Context Window Extension of Large Language Models
☆1,536Updated last year
yizhongw / self-instruct
Aligning pretrained language models with instruction data generated by themselves.
☆4,437Updated 2 years ago
tatsu-lab / alpaca_eval
An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.
☆1,816Updated 7 months ago
THUDM / AgentBench
A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
☆2,704Updated 6 months ago
XueFuzhao / OpenMoE
A family of open-sourced Mixture-of-Experts (MoE) Large Language Models
☆1,568Updated last year
open-compass / MixtralKit
A toolkit for inference and evaluation of 'mixtral-8x7b-32kseqlen' from Mistral AI
☆767Updated last year
AutoGPTQ / AutoGPTQ
An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.
☆4,905Updated 3 months ago
MLGroupJLU / LLM-eval-survey
The official GitHub page for the survey paper "A Survey on Evaluation of Large Language Models".
☆1,548Updated last month
microsoft / ToRA
ToRA is a series of Tool-integrated Reasoning LLM Agents designed to solve challenging mathematical reasoning problems by interacting wit…
☆1,083Updated last year
openai / prm800k
800,000 step-level correctness labels on LLM solutions to MATH problems
☆2,032Updated 2 years ago
X-PLUG / mPLUG-Owl
mPLUG-Owl: The Powerful Multi-modal Large Language Model Family
☆2,502Updated 3 months ago
CStanKonrad / long_llama
LongLLaMA is a large language model capable of handling long contexts. It is based on OpenLLaMA and fine-tuned with the Focused Transform…
☆1,460Updated last year
OpenBMB / ToolBench
[ICLR'24 spotlight] An open platform for training, serving, and evaluating large language model for tool learning.
☆5,182Updated 2 months ago
WeOpenML / PandaLM
☆920Updated last year
mit-han-lab / llm-awq
[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration
☆3,181Updated 2 weeks ago
lucidrains / self-rewarding-lm-pytorch
Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI
☆1,394Updated last year