second-state / meetups
☆69Updated last month
Alternatives and similar repositories for meetups:
Users that are interested in meetups are comparing it to the libraries listed below
- ☆58Updated 4 years ago
- Compare different hardware platforms via the Roofline Model for LLM inference tasks.☆93Updated 11 months ago
- ☆127Updated last month
- ☆22Updated last year
- FlagScale is a large model toolkit based on open-sourced projects.☆223Updated this week
- ☆23Updated last year
- Transformer related optimization, including BERT, GPT☆39Updated 2 years ago
- ☆76Updated last year
- export llama to onnx☆112Updated last month
- PyTorch distributed training acceleration framework☆39Updated last week
- run ChatGLM2-6B in BM1684X☆49Updated 11 months ago
- PaddlePaddle custom device implementaion. (『飞桨』自定义硬件接入实现)☆80Updated this week
- Models and examples built with OneFlow☆96Updated 4 months ago
- ☆52Updated last year
- AI Accelerator Benchmark focuses on evaluating AI Accelerators from a practical production perspective, including the ease of use and ver…☆222Updated last week
- ☆67Updated 2 months ago
- ☆140Updated 9 months ago
- Deploy ChatGLM on Modelz☆15Updated last year
- LLM Inference benchmark☆392Updated 6 months ago
- Inferflow is an efficient and highly configurable inference engine for large language models (LLMs).☆238Updated 11 months ago
- ☆57Updated 3 months ago
- OneFlow->ONNX☆42Updated last year
- Fast and memory-efficient exact attention☆44Updated this week
- PaddlePaddle Developer Community☆97Updated this week
- ☆43Updated this week
- ☆33Updated last year
- Efficient, Flexible, and Highly Fault-Tolerant Model Service Management Based on SGLang☆35Updated 3 months ago
- ☆314Updated last month