cansik / onnxruntime-silicon
ONNX Runtime prebuilt wheels for Apple Silicon (M1 / M2 / M3 / ARM64)
☆201Updated 6 months ago
Alternatives and similar repositories for onnxruntime-silicon:
Users that are interested in onnxruntime-silicon are comparing it to the libraries listed below
- Optimum version of a UI for Stable Diffusion, running on ONNX models for faster inference, working on most common GPU vendors: NVIDIA,AMD…☆22Updated last year
- Python bindings for ggml☆136Updated 4 months ago
- ⚙️ | REPLACED BY https://github.com/runpod-workers | Official set of serverless worker provided by RunPod as endpoints.☆57Updated last year
- The community repository for the Draw Things app.☆141Updated this week
- Deploy stable diffusion model with onnx/tenorrt + tritonserver☆122Updated last year
- Prebuilt Google MediaPipe packages for arm64.☆77Updated last year
- A Gradio component designed to continuously show any logs.☆35Updated last month
- 🐍 | Python library for RunPod API and serverless worker SDK.☆205Updated 2 weeks ago
- ☆23Updated 10 months ago
- Demonstration of MobileSAM in the browser enabled through ONNX runtime web☆97Updated last year
- FRP Fork☆145Updated last month
- Python tools for WhisperKit: Model conversion, optimization and evaluation☆196Updated last week
- Examples of models deployable with Truss☆152Updated this week
- openvino version of openai/whisper☆164Updated last year
- ☆54Updated last year
- ☆84Updated last year
- It is a simple library to speed up CLIP inference up to 3x (K80 GPU)☆211Updated last year
- ONNX-Powered Inference for State-of-the-Art Face Upscalers☆84Updated 6 months ago
- Demo python script app to interact with llama.cpp server using whisper API, microphone and webcam devices.☆46Updated last year
- stable-diffusion.cpp bindings for python☆34Updated last week
- A curated list of amazing RunPod projects, libraries, and resources☆104Updated 5 months ago
- A set of custom nodes for ComfyUI that allow you to use Core ML models in your ComfyUI workflows.☆144Updated 5 months ago
- ☆315Updated 3 months ago
- A ggml (C++) re-implementation of tortoise-tts☆175Updated 5 months ago
- The Triton backend for TensorRT.☆68Updated 2 weeks ago
- FlashAttention (Metal Port)☆430Updated 4 months ago
- Python bindings for whisper.cpp☆223Updated 7 months ago
- Fork of llama.cpp, extended for GPT-NeoX, RWKV-v4, and Falcon models☆30Updated last year
- ☆52Updated last year
- An OpenAI API compatible API for chat with image input and questions about the images. aka Multimodal.☆220Updated last month