InternLM / InternEvoLinks

InternEvo is an open-sourced lightweight training framework aims to support model pre-training without the need for extensive dependencies.

☆411

Alternatives and similar repositories for InternEvo

Users that are interested in InternEvo are comparing it to the libraries listed below

Sorting:

flagos-ai / FlagScale
FlagScale is a large model toolkit based on open-sourced projects.
☆407Updated last week
alibaba / ChatLearn
A flexible and efficient training framework for large-scale alignment tasks
☆437Updated 3 weeks ago
feifeibear / long-context-attention
USP: Unified (a.k.a. Hybrid, 2D) Sequence Parallel Attention for Long Context Transformers Model Training and Inference
☆598Updated last month
modelscope / dash-infer
DashInfer is a native LLM inference engine aiming to deliver industry-leading performance atop various hardware architectures, including …
☆268Updated 3 months ago
DeepLink-org / dlinfer
☆65Updated last week
inferflow / inferflow
Inferflow is an efficient and highly configurable inference engine for large language models (LLMs).
☆249Updated last year
FMInference / H2O
[NeurIPS'23] H2O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models.
☆487Updated last year
alibaba / Megatron-LLaMA
Best practice for training LLaMA models in Megatron-LM
☆659Updated last year
stepfun-ai / Step3
☆435Updated 3 months ago
alipay / PainlessInferenceAcceleration
Accelerate inference without tears
☆367Updated last month
madsys-dev / deepseekv2-profile
☆151Updated 8 months ago
InternLM / Agent-FLAN
[ACL2024 Findings] Agent-FLAN: Designing Data and Methods of Effective Agent Tuning for Large Language Models
☆355Updated last year
sgl-project / SpecForge
Train speculative decoding models effortlessly and port them smoothly to SGLang serving.
☆483Updated this week
openpsi-project / ReaLHF
Super-Efficient RLHF Training of LLMs with Parameter Reallocation
☆323Updated 6 months ago
hemingkx / Spec-Bench
Spec-Bench: A Comprehensive Benchmark and Unified Evaluation Platform for Speculative Decoding (ACL 2024 Findings)
☆334Updated 7 months ago
feifeibear / LLMSpeculativeSampling
Fast inference from large lauguage models via speculative decoding
☆853Updated last year
zhuzilin / ring-flash-attention
Ring attention implementation with flash attention
☆910Updated 2 months ago
hahnyuan / LLM-Viewer
Analyze the inference of Large Language Models (LLMs). Analyze aspects like computation, storage, transmission, and hardware roofline mod…
☆578Updated last year
qingkelab / qingketalk
青稞Talk
☆161Updated last week
Strivin0311 / long-llms-learning
A repository sharing the literatures about long-context large language models, including the methodologies and the evaluation benchmarks
☆269Updated last year
step-law / steplaw
☆205Updated 3 weeks ago
mit-han-lab / Quest
[ICML 2024] Quest: Query-Aware Sparsity for Efficient Long-Context LLM Inference
☆353Updated 4 months ago
thunlp / InfLLM
The code of our paper "InfLLM: Unveiling the Intrinsic Capacity of LLMs for Understanding Extremely Long Sequences with Training-Free Mem…
☆388Updated last year
Tencent / KsanaLLM
☆512Updated 2 months ago
ModelTC / LightCompress
[EMNLP 2024 & AAAI 2026] A powerful toolkit for compressing large models including LLM, VLM, and video generation models.
☆622Updated last week
OpenBMB / InfiniteBench
Codes for the paper "∞Bench: Extending Long Context Evaluation Beyond 100K Tokens": https://arxiv.org/abs/2402.13718
☆355Updated last year
NVIDIA-NeMo / Megatron-Bridge
Training library for Megatron-based models
☆193Updated this week
ISEEKYAN / mbridge
Bridge Megatron-Core to Hugging Face/Reinforcement Learning
☆159Updated last week
infinigence / LVEval
Repository of LV-Eval Benchmark
☆71Updated last year
THUDM / LongBench
LongBench v2 and LongBench (ACL 25'&24')
☆1,020Updated 10 months ago