cansik / onnxruntime-siliconLinks
ONNX Runtime prebuilt wheels for Apple Silicon (M1 / M2 / M3 / ARM64)
☆227Updated last year
Alternatives and similar repositories for onnxruntime-silicon
Users that are interested in onnxruntime-silicon are comparing it to the libraries listed below
Sorting:
- Python tools for WhisperKit: Model conversion, optimization and evaluation☆233Updated last month
- Python bindings for ggml☆146Updated last year
- ☆57Updated 2 years ago
- FRP Fork☆176Updated 8 months ago
- LLaVA server (llama.cpp).☆183Updated 2 years ago
- Inference of Large Multimodal Models in C/C++. LLaVA and others☆48Updated 2 years ago
- Tool to download models from Huggingface Hub and convert them to GGML/GGUF for llama.cpp☆163Updated 7 months ago
- Use safetensors with ONNX 🤗☆77Updated 2 months ago
- On-device Image Generation for Apple Silicon☆674Updated 8 months ago
- 🐍 | Python library for RunPod API and serverless worker SDK.☆258Updated this week
- It is a simple library to speed up CLIP inference up to 3x (K80 GPU)☆226Updated 2 years ago
- Demo python script app to interact with llama.cpp server using whisper API, microphone and webcam devices.☆46Updated 2 years ago
- Whisper realtime streaming for long speech-to-text transcription and translation☆121Updated last year
- Pybind11 bindings for Whisper.cpp☆342Updated last year
- stable-diffusion.cpp bindings for python☆82Updated this week
- Local ML voice chat using high-end models.☆178Updated last month
- Port of Suno AI's Bark in C/C++ for fast inference☆52Updated last year
- Prebuilt Google MediaPipe packages for arm64.☆80Updated 2 years ago
- ⚙️ | REPLACED BY https://github.com/runpod-workers | Official set of serverless worker provided by RunPod as endpoints.☆61Updated 6 months ago
- 🐳 | Dockerfiles for the RunPod container images used for our official templates.☆213Updated last month
- Extend the original llama.cpp repo to support redpajama model.☆118Updated last year
- openvino version of openai/whisper☆178Updated 2 years ago
- A PoC to run Segment Anything Model (SAM) entirely in the browser without any backend☆77Updated 2 years ago
- Examples of models deployable with Truss☆212Updated last week
- CLIP inference in plain C/C++ with no extra dependencies☆542Updated 5 months ago
- Falcon LLM ggml framework with CPU and GPU support☆248Updated last year
- A ggml (C++) re-implementation of tortoise-tts☆193Updated last year
- FlashAttention (Metal Port)☆560Updated last year
- Port of Microsoft's BioGPT in C/C++ using ggml☆85Updated last year
- Phi-3.5 for Mac: Locally-run Vision and Language Models for Apple Silicon☆275Updated last month