nod-ai / transformer-benchmarksLinks

benchmarking some transformer deployments

☆26

Alternatives and similar repositories for transformer-benchmarks

Users that are interested in transformer-benchmarks are comparing it to the libraries listed below

Sorting:

nod-ai / SRT
Nod.ai 🦈 version of 👻 . You probably want to start at https://github.com/nod-ai/shark for the product and the upstream IREE repository …
☆106Updated 6 months ago
nunoplopes / torchy
A tracing JIT compiler for PyTorch
☆13Updated 3 years ago
facebookresearch / FAMBench
Benchmarks to capture important workloads.
☆31Updated 6 months ago
octoml / octoml-profile
Home for OctoML PyTorch Profiler
☆113Updated 2 years ago
pytorch / torchdistx
Torch Distributed Experimental
☆117Updated last year
sdpython / onnxcustom
Tutorial on how to convert machine learned models into ONNX
☆16Updated 2 years ago
graphcore / tutorials
Training material for IPU users: tutorials, feature examples, simple applications
☆86Updated 2 years ago
NVIDIA / LDDL
Distributed preprocessing and data loading for language datasets
☆39Updated last year
octoml / Apple-M1-BERT
3X speedup over Apple’s TensorFlow plugin by using Apache TVM on M1
☆136Updated 3 years ago
graphcore / poptorch
PyTorch interface for the IPU
☆180Updated last year
onnx / steering-committee
Notes and artifacts from the ONNX steering committee
☆26Updated last week
deepspeedai / DeepSpeed-Kernels
☆74Updated 4 months ago
octoml / synr
A library for syntactically rewriting Python programs, pronounced (sinner).
☆69Updated 3 years ago
facebookresearch / fairring
Fairring (FAIR + Herring) is a plug-in for PyTorch that provides a process group for distributed training that outperforms NCCL at large …
☆65Updated 3 years ago
pytorch / rfcs
PyTorch RFCs (experimental)
☆133Updated 2 months ago
hpcaitech / TensorNVMe
A Python library transfers PyTorch tensors between CPU and NVMe
☆117Updated 8 months ago
Jokeren / triton-samples
☆28Updated 6 months ago
UmerHA / triton_util
Make triton easier
☆47Updated last year
triton-inference-server / pytorch_backend
The Triton backend for the PyTorch TorchScript models.
☆157Updated 2 weeks ago
spcl / substation
Research and development for optimizing transformers
☆129Updated 4 years ago
Harry-Chen / InfMoE
Inference framework for MoE layers based on TensorRT with Python binding
☆41Updated 4 years ago
pytorch / torchsnapshot
A performant, memory-efficient checkpointing library for PyTorch applications, designed with large, complex distributed workloads in mind…
☆158Updated last month
lianakoleva / no-libtorch-compile
☆21Updated 5 months ago
Michaelvll / llm-ie-benchmarks
A collection of reproducible inference engine benchmarks
☆32Updated 3 months ago
pytorch / multipy
torch::deploy (multipy for non-torch uses) is a system that lets you get around the GIL problem by running multiple Python interpreters i…
☆180Updated 3 weeks ago
parasj / checkmate
Training neural networks in TensorFlow 2.0 with 5x less memory
☆132Updated 3 years ago
fidelity / stoke
A lightweight wrapper for PyTorch that provides a simple declarative API for context switching between devices, distributed modes, mixed-…
☆67Updated 2 years ago
CentML / DeepView.Profile
🏙 Interactive performance profiling and debugging tool for PyTorch neural networks.
☆64Updated 6 months ago
graphcore / examples
Example code and applications for machine learning on Graphcore IPUs
☆323Updated last year
intel / torch-ccl
oneCCL Bindings for Pytorch*
☆99Updated 3 weeks ago