cansik / onnxruntime-siliconLinks
ONNX Runtime prebuilt wheels for Apple Silicon (M1 / M2 / M3 / ARM64)
☆219Updated last year
Alternatives and similar repositories for onnxruntime-silicon
Users that are interested in onnxruntime-silicon are comparing it to the libraries listed below
Sorting:
- Python bindings for ggml☆145Updated 11 months ago
- Inference of Large Multimodal Models in C/C++. LLaVA and others☆48Updated last year
- 🐍 | Python library for RunPod API and serverless worker SDK.☆246Updated last week
- stable-diffusion.cpp bindings for python☆58Updated last month
- FRP Fork☆175Updated 4 months ago
- ☆55Updated 2 years ago
- A Gradio component designed to continuously show any logs.☆49Updated 8 months ago
- LLaVA server (llama.cpp).☆181Updated last year
- Pybind11 bindings for Whisper.cpp☆337Updated 8 months ago
- Prebuilt Google MediaPipe packages for arm64.☆79Updated 2 years ago
- Python tools for WhisperKit: Model conversion, optimization and evaluation☆224Updated 2 weeks ago
- Demo python script app to interact with llama.cpp server using whisper API, microphone and webcam devices.☆46Updated last year
- Extend the original llama.cpp repo to support redpajama model.☆118Updated 11 months ago
- TTS support with GGML☆146Updated this week
- Port of Suno AI's Bark in C/C++ for fast inference☆52Updated last year
- Deploy stable diffusion model with onnx/tenorrt + tritonserver☆125Updated 2 years ago
- Port of Microsoft's BioGPT in C/C++ using ggml☆87Updated last year
- 🧰 | RunPod CLI for pod management☆323Updated last month
- Demonstration of MobileSAM in the browser enabled through ONNX runtime web☆111Updated 2 months ago
- Use safetensors with ONNX 🤗☆69Updated last month
- It is a simple library to speed up CLIP inference up to 3x (K80 GPU)☆220Updated 2 years ago
- CLIP inference in plain C/C++ with no extra dependencies☆514Updated 2 months ago
- FlashAttention (Metal Port)☆517Updated 10 months ago
- openvino version of openai/whisper☆171Updated last year
- A ggml (C++) re-implementation of tortoise-tts☆188Updated last year
- ☆370Updated 10 months ago
- Python bindings for llama.cpp☆199Updated 2 years ago
- Python bindings for whisper.cpp☆243Updated last year
- Falcon LLM ggml framework with CPU and GPU support☆247Updated last year
- Tool to download models from Huggingface Hub and convert them to GGML/GGUF for llama.cpp☆157Updated 3 months ago