microsoft / WizardLM2
☆59Updated 11 months ago
Alternatives and similar repositories for WizardLM2:
Users that are interested in WizardLM2 are comparing it to the libraries listed below
- Implementation of the LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens Paper☆129Updated 8 months ago
- Its an open source LLM based on MOE Structure.☆58Updated 8 months ago
- Mixture-of-Experts (MoE) Language Model☆185Updated 6 months ago
- A Toolkit for Running On-device Large Language Models (LLMs) in APP☆67Updated 8 months ago
- GLM Series Edge Models☆131Updated last month
- ☆312Updated 6 months ago
- Skywork-MoE: A Deep Dive into Training Techniques for Mixture-of-Experts Language Models☆130Updated 9 months ago
- AI for all: Build the large graph of the language models☆263Updated 9 months ago
- [ICLR 2025] The official implementation of paper "ToolGen: Unified Tool Retrieval and Calling via Generation"☆133Updated last month
- Family of instruction-following LLMs powered by Evol-Instruct: WizardLM, WizardCoder☆45Updated 11 months ago
- FuseAI Project☆84Updated 2 months ago
- [ICLR 2025] A trinity of environments, tools, and benchmarks for general virtual agents☆198Updated last month
- Imitate OpenAI with Local Models☆88Updated 7 months ago
- SUS-Chat: Instruction tuning done right☆48Updated last year
- A high-throughput and memory-efficient inference and serving engine for LLMs☆132Updated 3 months ago
- 👑 Qwen Blog.☆52Updated this week
- ☆216Updated 11 months ago
- ☆74Updated last year
- Code for Husky, an open-source language agent that solves complex, multi-step reasoning tasks. Husky v1 addresses numerical, tabular and …☆338Updated 9 months ago
- ☆182Updated last month
- [ACL 2024] Progressive LLaMA with Block Expansion.☆499Updated 10 months ago
- LLM based agents with proactive interactions, long-term memory, external tool integration, and local deployment capabilities.☆99Updated this week
- ☆44Updated 3 months ago
- ☆94Updated 3 months ago
- XVERSE-65B: A multilingual large language model developed by XVERSE Technology Inc.☆139Updated 11 months ago
- DRT-o1: Optimized Deep Reasoning Translation via Long Chain-of-Thought☆211Updated 3 months ago
- TableLLM: Enabling Tabular Data Manipulation by LLMs in Real Office Usage Scenarios☆185Updated 6 months ago
- A demo built on Megrez-3B-Instruct, integrating a web search tool to enhance the model's question-and-answer capabilities.☆37Updated 3 months ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆131Updated 9 months ago
- ☆29Updated 7 months ago