cansik / onnxruntime-siliconLinks

ONNX Runtime prebuilt wheels for Apple Silicon (M1 / M2 / M3 / ARM64)

☆224

Alternatives and similar repositories for onnxruntime-silicon

Users that are interested in onnxruntime-silicon are comparing it to the libraries listed below

Sorting:

RobertRiachi / ANE-Optimized-Whisper-OpenAI
☆55Updated 2 years ago
abetlen / ggml-python
Python bindings for ggml
☆146Updated last year
monatis / lmm.cpp
Inference of Large Multimodal Models in C/C++. LLaVA and others
☆48Updated 2 years ago
trzy / llava-cpp-server
LLaVA server (llama.cpp).
☆183Updated 2 years ago
justinchuby / onnx-safetensors
Use safetensors with ONNX 🤗
☆73Updated last month
william-murray1204 / stable-diffusion-cpp-python
stable-diffusion.cpp bindings for python
☆67Updated 2 weeks ago
runpod / runpod-python
🐍 | Python library for RunPod API and serverless worker SDK.
☆255Updated 2 weeks ago
argmaxinc / whisperkittools
Python tools for WhisperKit: Model conversion, optimization and evaluation
☆229Updated 3 months ago
huggingface / frp
FRP Fork
☆175Updated 6 months ago
philipturner / metal-flash-attention
FlashAttention (Metal Port)
☆545Updated last year
argmaxinc / DiffusionKit
On-device Image Generation for Apple Silicon
☆663Updated 6 months ago
ggerganov / bark.cpp
Port of Suno AI's Bark in C/C++ for fast inference
☆52Updated last year
riccardomusmeci / mlx-image
mlx image models for Apple Silicon machines
☆86Updated 6 months ago
sunu / SAM-in-Browser
A PoC to run Segment Anything Model (SAM) entirely in the browser without any backend
☆76Updated 2 years ago
mmwillet / TTS.cpp
TTS support with GGML
☆184Updated 3 weeks ago
monatis / clip.cpp
CLIP inference in plain C/C++ with no extra dependencies
☆527Updated 4 months ago
herrera-luis / vision-core-ai
Demo python script app to interact with llama.cpp server using whisper API, microphone and webcam devices.
☆45Updated last year
basetenlabs / truss-examples
Examples of models deployable with Truss
☆206Updated this week
huggingface / optimum-onnx
🤗 Optimum ONNX: Export your model to ONNX and run inference with ONNX Runtime
☆70Updated last week
louis-she / gradio-log
A Gradio component designed to continuously show any logs.
☆51Updated 10 months ago
PABannier / encodec.cpp
Port of Meta's Encodec in C/C++
☆222Updated 10 months ago
j-csc / mlx_bark
Port of Suno's Bark TTS transformer in Apple's MLX Framework
☆84Updated last year
Lednik7 / CLIP-ONNX
It is a simple library to speed up CLIP inference up to 3x (K80 GPU)
☆223Updated 2 years ago
runpod / containers
🐳 | Dockerfiles for the RunPod container images used for our official templates.
☆209Updated this week
togethercomputer / redpajama.cpp
Extend the original llama.cpp repo to support redpajama model.
☆118Updated last year
PABannier / biogpt.cpp
Port of Microsoft's BioGPT in C/C++ using ggml
☆85Updated last year
LLukas22 / llm-rs-python
Unofficial python bindings for the rust llm library. 🐍❤️🦀
☆76Updated 2 years ago
zhuzilin / whisper-openvino
openvino version of openai/whisper
☆176Updated last year
bananaml / serverless-template
☆86Updated 2 years ago
aarnphm / whispercpp
Pybind11 bindings for Whisper.cpp
☆340Updated 10 months ago