intelligent-machine-learning / dlrover
DLRover: An Automatic Distributed Deep Learning System
☆1,196Updated last week
Related projects: ⓘ
- FlagPerf is an open-source software platform for benchmarking AI chips.☆300Updated this week
- RTP-LLM: Alibaba's high-performance LLM inference engine for diverse applications.☆512Updated last week
- The official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.☆649Updated last week
- Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.☆1,045Updated last month
- GLake: optimizing GPU memory management and IO transmission.☆351Updated last month
- A PyTorch Native LLM Training Framework☆581Updated 3 weeks ago
- Best practice for training LLaMA models in Megatron-LM☆606Updated 8 months ago
- Ongoing research training transformer language models at scale, including: BERT & GPT-2☆1,833Updated 2 weeks ago
- Official github repo for C-Eval, a Chinese evaluation suite for foundation models [NeurIPS 2023]☆1,601Updated 10 months ago
- 📖A curated list of Awesome LLM Inference Paper with codes, TensorRT-LLM, vLLM, streaming-llm, AWQ, SmoothQuant, WINT8/4, Continuous Batc…☆2,475Updated this week
- Easy Parallel Library (EPL) is a general and efficient deep learning framework for distributed model training.☆260Updated last year
- Efficient Training (including pre-training and fine-tuning) for Big Models☆548Updated last month
- LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalabili…☆2,300Updated this week
- ☆251Updated last week
- Panda项目是于2023年5月启动的开源海外中文大语言模型项目,致力于大模型时代探索整个技术栈,旨在推动中文自然语言处理领域的创新和合作。☆1,063Updated 11 months ago
- 一种任务级GPU算力分时调度的高性能深度学习训练平台☆291Updated 10 months ago
- Tencent Pre-training framework in PyTorch & Pre-trained Model Zoo☆1,014Updated last month
- The road to hack SysML and become an system expert☆424Updated 2 weeks ago
- FlagScale is a large model toolkit based on open-sourced projects.☆129Updated last week
- BladeDISC is an end-to-end DynamIc Shape Compiler project for machine learning workloads.☆796Updated 3 weeks ago
- LiBai(李白): A Toolbox for Large-Scale Distributed Parallel Training☆389Updated this week
- ☆284Updated 2 months ago
- 总结Prompt&LLM论文,开源数据&模型,AIGC应用☆2,574Updated this week
- An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)☆2,026Updated this week
- Industrial-first evaluation benchmark for LLMs in the DevOps/AIOps domain.☆671Updated 2 months ago
- 🎉 CUDA Learn Notes with PyTorch: fp32、fp16/bf16、fp8/int8、flash_attn、sgemm、sgemv、warp/block reduce、dot prod、elementwise、softmax、layernorm…☆1,158Updated this week
- LLMs interview notes and answers:该仓库主要记录大模型(LLMs)算法工程师相关的面试题和参考答案☆1,141Updated 9 months ago
- Official codes for ACL 2023 paper "WebCPM: Interactive Web Search for Chinese Long-form Question Answering"☆972Updated 9 months ago
- 纯c++的全平台llm加速库,支持python调用,chatglm-6B级模型单卡可达10000+token / s,支持glm, llama, moss基座,手机端流畅运行☆3,285Updated this week
- FlashInfer: Kernel Library for LLM Serving☆1,143Updated last week