lucadiliello / pytorch-apple-silicon-benchmarks
Performance of PyTorch on Apple Silicon
☆43Updated 11 months ago
Related projects ⓘ
Alternatives and complementary repositories for pytorch-apple-silicon-benchmarks
- 3X speedup over Apple’s TensorFlow plugin by using Apache TVM on M1☆135Updated 2 years ago
- ☆15Updated 2 years ago
- PyTorch's full-scratch build and install for Apple Silicon☆30Updated 11 months ago
- Nod.ai 🦈 version of 👻 . You probably want to start at https://github.com/nod-ai/shark for the product and the upstream IREE repository …☆107Updated this week
- Tutorial on how to convert machine learned models into ONNX☆15Updated last year
- ☆47Updated 3 years ago
- A conda-smithy repository for nvcc.☆12Updated 2 weeks ago
- Model compression for ONNX☆75Updated this week
- A user-friendly tool chain that enables the seamless execution of ONNX models using JAX as the backend.☆98Updated 2 months ago
- New operators for the ReferenceEvaluator, new kernels for onnxruntime, CPU, CUDA☆28Updated last month
- benchmarking some transformer deployments☆26Updated last year
- A place to store reusable transformer components of my own creation or found on the interwebs☆44Updated 2 weeks ago
- CUDA implementation of autoregressive linear attention, with all the latest research findings☆43Updated last year
- torch::deploy (multipy for non-torch uses) is a system that lets you get around the GIL problem by running multiple Python interpreters i…☆176Updated this week
- Benchmarking PyTorch 2.0 different models☆21Updated last year
- Perf monitoring CLI tool for Apple Silicon☆11Updated last year
- Productionize machine learning predictions, with ONNX or without☆66Updated 10 months ago
- The correct way to resize images or tensors. For Numpy or Pytorch (differentiable).☆16Updated 2 years ago
- ☆36Updated 2 years ago
- Conversions between kornia and other computer vision libraries formats☆34Updated last year
- ☆15Updated 3 years ago
- NVIDIA GPU tools - monitoring on CLI & web app with multiple agents☆83Updated 6 months ago
- Automatically insert nvtx ranges to PyTorch models☆17Updated 3 years ago
- No-GIL Python environment featuring NVIDIA Deep Learning libraries.☆22Updated this week
- Customized matrix multiplication kernels☆53Updated 2 years ago
- ☆51Updated last year
- Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit☆63Updated last year