RUCAIBox / LLMSurvey
The official GitHub page for the survey paper "A Survey of Large Language Models".
☆10,346Updated 2 months ago
Related projects ⓘ
Alternatives and complementary repositories for LLMSurvey
- 🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.☆16,354Updated this week
- The paper list of the 86-page paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.☆6,765Updated 3 months ago
- A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers)☆9,457Updated 5 months ago
- Retrieval and Retrieval-augmented LLMs☆7,471Updated this week
- OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, …☆4,062Updated this week
- Official release of InternLM2.5 base and chat models. 1M context support☆6,436Updated 3 weeks ago
- Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"☆10,661Updated 2 months ago
- Latest Advances on Multimodal Large Language Models☆12,542Updated this week
- QLoRA: Efficient Finetuning of Quantized LLMs☆10,028Updated 4 months ago
- An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.☆8,264Updated this week
- RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best…☆12,630Updated last week
- Instruction Tuning with GPT-4☆4,199Updated last year
- Train transformer language models with reinforcement learning.☆9,967Updated this week
- Fast and memory-efficient exact attention☆14,109Updated this week
- 总结Prompt&LLM论文,开源数据&模型,AIGC应用☆2,671Updated this week
- The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.☆5,015Updated 3 months ago
- We unified the interfaces of instruction-tuning data (e.g., CoT data), multiple LLMs and parameter-efficient methods (e.g., lora, p-tunin…☆2,612Updated 10 months ago
- Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)☆33,825Updated this week
- The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.☆13,946Updated last month
- 【LLMs九层妖塔】分享 LLMs在自然语言处理(ChatGLM、Chinese-LLaMA-Alpaca、小羊驼 Vicuna、LLaMA、GPT4ALL等)、信息检索(langchain)、语言合成、语言识别、多模态等领域(Stable Diffusion、MiniGP…☆1,785Updated 7 months ago
- An Open-Source Framework for Prompt-Learning.☆4,355Updated 3 months ago
- Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、…☆5,834Updated 2 weeks ago
- Example models using DeepSpeed☆6,069Updated this week
- Aligning pretrained language models with instruction data generated by themselves.☆4,133Updated last year
- A series of large language models developed by Baichuan Intelligent Technology☆4,086Updated this week
- 本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)☆10,386Updated this week
- A high-throughput and memory-efficient inference and serving engine for LLMs☆29,785Updated this week
- LMDeploy is a toolkit for compressing, deploying, and serving LLMs.☆4,593Updated this week
- ⚡LLM Zoo is a project that provides data, models, and evaluation benchmark for large language models.⚡☆2,934Updated 11 months ago
- ☆2,571Updated last week