cansik / onnxruntime-siliconLinks
ONNX Runtime prebuilt wheels for Apple Silicon (M1 / M2 / M3 / ARM64)
☆232Updated last year
Alternatives and similar repositories for onnxruntime-silicon
Users that are interested in onnxruntime-silicon are comparing it to the libraries listed below
Sorting:
- Inference of Large Multimodal Models in C/C++. LLaVA and others☆48Updated 2 years ago
- Python tools for WhisperKit: Model conversion, optimization and evaluation☆236Updated 3 months ago
- Python bindings for ggml☆147Updated last year
- ☆58Updated 2 years ago
- Use safetensors with ONNX 🤗☆87Updated this week
- Tool to download models from Huggingface Hub and convert them to GGML/GGUF for llama.cpp☆170Updated 9 months ago
- LLaVA server (llama.cpp).☆183Updated 2 years ago
- 🐍 | Python library for RunPod API and serverless worker SDK.☆270Updated this week
- Prebuilt Google MediaPipe packages for arm64.☆81Updated 2 years ago
- Demo python script app to interact with llama.cpp server using whisper API, microphone and webcam devices.☆46Updated 2 years ago
- FRP Fork☆177Updated 10 months ago
- Pybind11 bindings for Whisper.cpp☆344Updated last year
- Port of Suno's Bark TTS transformer in Apple's MLX Framework☆86Updated 2 years ago
- stable-diffusion.cpp bindings for python☆98Updated this week
- Port of Suno AI's Bark in C/C++ for fast inference☆54Updated last year
- A ggml (C++) re-implementation of tortoise-tts☆193Updated last year
- Port of Meta's Encodec in C/C++☆227Updated last year
- On-device Image Generation for Apple Silicon☆687Updated 10 months ago
- openvino version of openai/whisper☆182Updated 2 years ago
- mlx image models for Apple Silicon machines☆91Updated 2 months ago
- Phi-3.5 for Mac: Locally-run Vision and Language Models for Apple Silicon☆273Updated 3 months ago
- 🧰 | Runpod CLI for pod management☆364Updated this week
- FlashAttention (Metal Port)☆579Updated last year
- Deploy stable diffusion model with onnx/tenorrt + tritonserver☆126Updated 2 years ago
- Whisper realtime streaming for long speech-to-text transcription and translation☆121Updated 2 years ago
- A PoC to run Segment Anything Model (SAM) entirely in the browser without any backend☆79Updated 2 years ago
- Local ML voice chat using high-end models.☆183Updated last month
- Port of Microsoft's BioGPT in C/C++ using ggml☆86Updated last year
- Examples of models deployable with Truss☆214Updated this week
- It is a simple library to speed up CLIP inference up to 3x (K80 GPU)☆231Updated 2 years ago