cansik / onnxruntime-silicon
ONNX Runtime prebuilt wheels for Apple Silicon (M1 / M2 / M3 / ARM64)
☆197Updated 4 months ago
Related projects ⓘ
Alternatives and complementary repositories for onnxruntime-silicon
- Python bindings for ggml☆132Updated 2 months ago
- ⚙️ | REPLACED BY https://github.com/runpod-workers | Official set of serverless worker provided by RunPod as endpoints.☆55Updated last year
- Demo python script app to interact with llama.cpp server using whisper API, microphone and webcam devices.☆45Updated last year
- 🐍 | Python library for RunPod API and serverless worker SDK.☆184Updated this week
- ☆51Updated last year
- A ggml (C++) re-implementation of tortoise-tts☆160Updated 3 months ago
- A curated list of amazing RunPod projects, libraries, and resources☆102Updated 3 months ago
- A set of custom nodes for ComfyUI that allow you to use Core ML models in your ComfyUI workflows.☆131Updated 3 months ago
- ☆52Updated last year
- 🚀 | A simple worker that can be used as a starting point to build your own custom RunPod Endpoint API worker.☆78Updated 3 weeks ago
- Deploy stable diffusion model with onnx/tenorrt + tritonserver☆123Updated last year
- whisper.cpp bindings for python☆77Updated last year
- Extend the original llama.cpp repo to support redpajama model.☆117Updated 2 months ago
- example of using CoreML from c++☆22Updated last year
- LLaVA server (llama.cpp).☆177Updated last year
- Flux diffusion model implementation using quantized fp8 matmul & remaining layers use faster half precision accumulate, which is ~2x fast…