microsoft / WizardLM2
☆60Updated 10 months ago
Alternatives and similar repositories for WizardLM2:
Users that are interested in WizardLM2 are comparing it to the libraries listed below
- Mixture-of-Experts (MoE) Language Model☆184Updated 5 months ago
- Implementation of the LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens Paper☆125Updated 7 months ago
- A Toolkit for Running On-device Large Language Models (LLMs) in APP☆60Updated 7 months ago
- Receipts for creating AI Applications with APIs from DashScope (and friends)!☆42Updated 4 months ago
- Family of instruction-following LLMs powered by Evol-Instruct: WizardLM, WizardCoder☆45Updated 10 months ago
- ☆91Updated 2 months ago
- The code of our paper "InfLLM: Unveiling the Intrinsic Capacity of LLMs for Understanding Extremely Long Sequences with Training-Free Mem…☆326Updated 10 months ago
- FuseAI Project☆83Updated 3 weeks ago
- AI for all: Build the large graph of the language models☆254Updated 8 months ago
- GLM Series Edge Models☆130Updated this week
- A high-throughput and memory-efficient inference and serving engine for LLMs☆129Updated 2 months ago
- [EMNLP 2024] LongAlign: A Recipe for Long Context Alignment of LLMs☆241Updated 2 months ago
- Its an open source LLM based on MOE Structure.☆58Updated 7 months ago
- Skywork-MoE: A Deep Dive into Training Techniques for Mixture-of-Experts Language Models☆128Updated 8 months ago
- Imitate OpenAI with Local Models☆86Updated 5 months ago
- ☆304Updated 5 months ago
- ☆51Updated 6 months ago
- Deployment a light and full OpenAI API for production with vLLM to support /v1/embeddings with all embeddings models.☆40Updated 7 months ago
- ☆168Updated 2 months ago
- An open platform for enhancing the capability of LLMs in workflow orchestration.☆98Updated 2 months ago
- ☆139Updated 7 months ago
- XVERSE-65B: A multilingual large language model developed by XVERSE Technology Inc.☆135Updated 10 months ago
- LongCite: Enabling LLMs to Generate Fine-grained Citations in Long-context QA☆465Updated last month
- SUS-Chat: Instruction tuning done right☆48Updated last year
- 我们是第一个完全可商用的角色大模型。☆39Updated 6 months ago
- ☆209Updated 9 months ago
- Offical Repo for "Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale"☆217Updated this week
- ☆152Updated 7 months ago
- ☆125Updated 3 weeks ago
- connecting humans and agents☆69Updated 2 months ago