open-compass / MixtralKitLinks

A toolkit for inference and evaluation of 'mixtral-8x7b-32kseqlen' from Mistral AI

☆767

Alternatives and similar repositories for MixtralKit

Users that are interested in MixtralKit are comparing it to the libraries listed below

Sorting:

TencentARC / LLaMA-Pro
[ACL 2024] Progressive LLaMA with Block Expansion.
☆507Updated last year
DachengLi1 / LongChat
Official repository for LongChat and LongEval
☆524Updated last year
OpenLMLab / LOMO
LOMO: LOw-Memory Optimization
☆989Updated last year
THUDM / AgentTuning
AgentTuning: Enabling Generalized Agent Abilities for LLMs
☆1,454Updated last year
pjlab-sys4nlp / llama-moe
⛷️ LLaMA-MoE: Building Mixture-of-Experts from LLaMA with Continual Pre-training (EMNLP 2024)
☆977Updated 8 months ago
ruixiangcui / AGIEval
☆758Updated last year
jquesnelle / yarn
YaRN: Efficient Context Window Extension of Large Language Models
☆1,553Updated last year
hpcaitech / SwiftInfer
Efficient AI Inference & Serving
☆472Updated last year
multimodal-art-projection / MAP-NEO
☆952Updated 6 months ago
princeton-nlp / LLM-Shearing
[ICLR 2024] Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning
☆626Updated last year
GPT-Fathom / GPT-Fathom
GPT-Fathom is an open-source and reproducible LLM evaluation suite, benchmarking 10+ leading open-source and closed-source LLMs as well a…
☆350Updated last year
Xwin-LM / Xwin-LM
Xwin-LM: Powerful, Stable, and Reproducible LLM Alignment
☆1,041Updated last year
jzhang38 / EasyContext
Memory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.
☆739Updated 10 months ago
XueFuzhao / OpenMoE
A family of open-sourced Mixture-of-Experts (MoE) Large Language Models
☆1,571Updated last year
microsoft / ToRA
ToRA is a series of Tool-integrated Reasoning LLM Agents designed to solve challenging mathematical reasoning problems by interacting wit…
☆1,083Updated last year
hao-ai-lab / LookaheadDecoding
[ICML 2024] Break the Sequential Dependency of LLM Inference Using Lookahead Decoding
☆1,263Updated 5 months ago
yule-BUAA / MergeLM
Codebase for Merging Language Models (ICML 2024)
☆842Updated last year
magpie-align / magpie
[ICLR 2025] Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing. Your efficient and high-quality synthetic data …
☆744Updated 4 months ago
FlagAI-Open / Aquila2
The official repo of Aquila2 series proposed by BAAI, including pretrained & chat large language models.
☆444Updated 9 months ago
datamllab / LongLM
[ICML'24 Spotlight] LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning
☆660Updated last year
GanjinZero / RRHF
[NIPS2023] RRHF & Wombat
☆811Updated last year
THUDM / LongBench
LongBench v2 and LongBench (ACL 25'&24')
☆940Updated 6 months ago
arielnlee / Platypus
Code for fine-tuning Platypus fam LLMs using LoRA
☆628Updated last year
hiyouga / FastEdit
🩹Editing large language models within 10 seconds⚡
☆1,340Updated last year
IEIT-Yuan / Yuan-2.0
Yuan 2.0 Large Language Model
☆689Updated last year
dzhulgakov / llama-mistral
Inference code for Mistral and Mixtral hacked up into original Llama implementation
☆371Updated last year
InternLM / InternLM-techreport
☆905Updated 2 years ago
abacusai / Long-Context
This repository contains code and tooling for the Abacus.AI LLM Context Expansion project. Also included are evaluation scripts and bench…
☆591Updated last year
epfLLM / Megatron-LLM
distributed trainer for LLMs
☆578Updated last year
FranxYao / Long-Context-Data-Engineering
Implementation of paper Data Engineering for Scaling Language Models to 128K Context
☆467Updated last year