microsoft / WizardLM2Links
☆59Updated last year
Alternatives and similar repositories for WizardLM2
Users that are interested in WizardLM2 are comparing it to the libraries listed below
Sorting:
- Mixture-of-Experts (MoE) Language Model☆189Updated 10 months ago
- Code for Husky, an open-source language agent that solves complex, multi-step reasoning tasks. Husky v1 addresses numerical, tabular and …☆345Updated last year
- ☆319Updated 9 months ago
- AI for all: Build the large graph of the language models☆270Updated last year
- ☆280Updated last month
- Official repo for "LongRAG: Enhancing Retrieval-Augmented Generation with Long-context LLMs".☆235Updated 10 months ago
- Official repo for "Make Your LLM Fully Utilize the Context"☆252Updated last year
- Implementation of the LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens Paper☆147Updated 11 months ago
- LongCite: Enabling LLMs to Generate Fine-grained Citations in Long-context QA☆502Updated 6 months ago
- [ACL 2024] Progressive LLaMA with Block Expansion.☆505Updated last year
- NexusRaven-13B, a new SOTA Open-Source LLM for function calling. This repo contains everything for reproducing our evaluation on NexusRav…☆316Updated last year
- ☆61Updated 11 months ago
- FuseAI Project☆87Updated 5 months ago
- ☆94Updated 7 months ago
- GLM Series Edge Models☆144Updated last month
- [ICLR 2025] A trinity of environments, tools, and benchmarks for general virtual agents☆212Updated last month
- Repository for “PlanRAG: A Plan-then-Retrieval Augmented Generation for Generative Large Language Models as Decision Makers”, NAACL24☆141Updated last year
- ☆157Updated last year
- [ICLR 2025] The official implementation of paper "ToolGen: Unified Tool Retrieval and Calling via Generation"☆150Updated 3 months ago
- [EMNLP 2024 Findings] OneGen: Efficient One-Pass Unified Generation and Retrieval for LLMs.☆148Updated 8 months ago
- Deep Reasoning Translation (DRT) Project☆226Updated last month
- A high-throughput and memory-efficient inference and serving engine for LLMs☆136Updated 7 months ago
- ☆158Updated 10 months ago
- ☆121Updated last year
- Langchain implementation of HuggingGPT☆132Updated 2 years ago
- EvolKit is an innovative framework designed to automatically enhance the complexity of instructions used for fine-tuning Large Language M…☆229Updated 8 months ago
- Beating the GAIA benchmark with Transformers Agents. 🚀☆129Updated 4 months ago
- XVERSE-65B: A multilingual large language model developed by XVERSE Technology Inc.☆139Updated last year
- The code of our paper "InfLLM: Unveiling the Intrinsic Capacity of LLMs for Understanding Extremely Long Sequences with Training-Free Mem…☆370Updated last year
- Using Groq or OpenAI or Ollama to create o1-like reasoning chains☆297Updated 10 months ago