lucadiliello / pytorch-apple-silicon-benchmarks
Performance of PyTorch on Apple Silicon
☆50Updated last year
Alternatives and similar repositories for pytorch-apple-silicon-benchmarks
Users that are interested in pytorch-apple-silicon-benchmarks are comparing it to the libraries listed below
Sorting:
- 3X speedup over Apple’s TensorFlow plugin by using Apache TVM on M1☆136Updated 3 years ago
- Nod.ai 🦈 version of 👻 . You probably want to start at https://github.com/nod-ai/shark for the product and the upstream IREE repository …☆106Updated 4 months ago
- TensorFlow Metal Backend on Apple Silicon Experiments (just for fun)☆277Updated 3 years ago
- Benchmark of Apple MLX operations on all Apple Silicon chips (GPU, CPU) + MPS and CUDA.☆180Updated last month
- ☆15Updated 3 years ago
- ONNX Command-Line Toolbox☆35Updated 7 months ago
- MLX support for the Open Neural Network Exchange (ONNX)☆49Updated last year
- Productionize machine learning predictions, with ONNX or without☆65Updated last year
- Python bindings for ggml☆140Updated 8 months ago
- No-GIL Python environment featuring NVIDIA Deep Learning libraries.☆60Updated last month
- FlashAttention (Metal Port)☆483Updated 7 months ago
- New operators for the ReferenceEvaluator, new kernels for onnxruntime, CPU, CUDA☆32Updated 2 months ago
- ☆56Updated 2 years ago
- C API for MLX☆109Updated 3 weeks ago
- ☆18Updated 2 years ago
- Hacks for PyTorch☆19Updated 2 years ago
- example of using CoreML from c++☆24Updated last year
- Serialize JAX, Flax, Haiku, or Objax model params with 🤗`safetensors`☆44Updated 11 months ago
- AdamW optimizer for bfloat16 models in pytorch 🔥.☆32Updated 11 months ago
- PyTorch's full-scratch build and install for Apple Silicon☆29Updated last year
- A user-friendly tool chain that enables the seamless execution of ONNX models using JAX as the backend.☆111Updated last week
- A lightweight wrapper for PyTorch that provides a simple declarative API for context switching between devices, distributed modes, mixed-…☆67Updated last year
- Docker image NVIDIA GH200 machines - optimized for vllm serving and hf trainer finetuning☆40Updated 2 months ago
- NVIDIA GPU tools - monitoring on CLI & web app with multiple agents☆87Updated last year
- TorchFix - a linter for PyTorch-using code with autofix support☆141Updated 3 months ago
- Tutorial on how to convert machine learned models into ONNX☆16Updated 2 years ago
- benchmarking some transformer deployments☆26Updated 2 years ago
- ☆345Updated 7 months ago
- torch::deploy (multipy for non-torch uses) is a system that lets you get around the GIL problem by running multiple Python interpreters i…☆177Updated 5 months ago
- A sample pattern for running CI tests on Modal☆17Updated last month