IEIT-Yuan / Yuan2.0-M32
Mixture-of-Experts (MoE) Language Model
☆179Updated last week
Related projects: ⓘ
- Skywork-MoE: A Deep Dive into Training Techniques for Mixture-of-Experts Language Models☆121Updated 3 months ago
- ☆196Updated 4 months ago
- 🐳 Aurora is a [Chinese Version] MoE model. Aurora is a further work based on Mixtral-8x7B, which activates the chat capability of the mo …☆257Updated 4 months ago
- XVERSE-65B: A multilingual large language model developed by XVERSE Technology Inc.☆128Updated 5 months ago
- ☆268Updated last month
- ☆180Updated 4 months ago
- Imitate OpenAI with Local Models☆83Updated 3 weeks ago
- LongQLoRA: Extent Context Length of LLMs Efficiently☆156Updated 10 months ago
- Inferflow is an efficient and highly configurable inference engine for large language models (LLMs).☆232Updated 6 months ago
- SOTA Math Opensource LLM☆296Updated 9 months ago
- LongAlign: A Recipe for Long Context Alignment Encompassing Data, Training, and Evaluation☆194Updated 4 months ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆120Updated 9 months ago
- [ACL 2024 Demo] Official GitHub repo for UltraEval: An open source framework for evaluating foundation models.☆208Updated last month
- ☆286Updated 2 months ago
- The official repo of Aquila2 series proposed by BAAI, including pretrained & chat large language models.☆429Updated 7 months ago
- Implementation of the LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens Paper☆119Updated last month
- Efficient AI Inference & Serving☆452Updated 8 months ago
- ☆105Updated this week
- Train a Chinese LLM From 0 by Personal☆145Updated last week
- 大模型多维度中文对齐评测基准 (ACL 2024)☆292Updated last month
- Codes for the paper "∞Bench: Extending Long Context Evaluation Beyond 100K Tokens": https://arxiv.org/abs/2402.13718☆244Updated last week
- SUS-Chat: Instruction tuning done right☆47Updated 8 months ago
- InternEvo is an open-sourced lightweight training framework aims to support model pre-training without the need for extensive dependencie…☆276Updated this week
- Naive Bayes-based Context Extension☆310Updated last year
- ☆104Updated 11 months ago
- An automated pipeline for evaluating LLMs for role-playing.☆118Updated this week
- LongCite: Enabling LLMs to Generate Fine-grained Citations in Long-context QA☆227Updated this week
- ☆29Updated 2 weeks ago
- ToolAlpaca: Generalized Tool Learning for Language Models with 3000 Simulated Cases☆281Updated 2 months ago
- Official Pytorch Implementation for MathGLM☆315Updated 9 months ago