cansik / onnxruntime-silicon
ONNX Runtime prebuilt wheels for Apple Silicon (M1 / M2 / M3 / ARM64)
☆196Updated 4 months ago
Related projects ⓘ
Alternatives and complementary repositories for onnxruntime-silicon
- ☆50Updated last year
- Python bindings for ggml☆132Updated 2 months ago
- 🐍 | Python library for RunPod API and serverless worker SDK.☆180Updated 2 weeks ago
- A ggml (C++) re-implementation of tortoise-tts☆155Updated 2 months ago
- Demo python script app to interact with llama.cpp server using whisper API, microphone and webcam devices.☆45Updated last year
- ONNX implementation of Whisper. PyTorch free.☆84Updated 2 months ago
- A set of custom nodes for ComfyUI that allow you to use Core ML models in your ComfyUI workflows.☆128Updated 2 months ago
- Python tools for WhisperKit: Model conversion, optimization and evaluation☆167Updated this week
- ⚙️ | REPLACED BY https://github.com/runpod-workers | Official set of serverless worker provided by RunPod as endpoints.☆55Updated last year
- LLaVA server (llama.cpp).☆177Updated last year
- FlashAttention (Metal Port)☆382Updated last month
- MLX Stable Diffusion WebUI for Apple MLX Stable Diffusion example code.☆95Updated 6 months ago
- Deploy stable diffusion model with onnx/tenorrt + tritonserver☆123Updated last year
- Inference Vision Transformer (ViT) in plain C/C++ with ggml☆229Updated 6 months ago
- For inferring and serving local LLMs using the MLX framework☆89Updated 7 months ago
- Local ML voice chat using high-end models.☆144Updated this week
- Port of Suno's Bark TTS transformer in Apple's MLX Framework☆71Updated 8 months ago
- The community repository for the Draw Things app.☆93Updated this week
- Optimum version of a UI for Stable Diffusion, running on ONNX models for faster inference, working on most common GPU vendors: NVIDIA,AMD…☆22Updated 10 months ago
- Python bindings for whisper.cpp☆216Updated 5 months ago
- Extend the original llama.cpp repo to support redpajama model.☆117Updated 2 months ago
- A curated list of amazing RunPod projects, libraries, and resources☆102Updated 2 months ago
- ☆129Updated 10 months ago
- Distributed Inference for mlx LLm☆68Updated 3 months ago
- Diffusion Animation Toolkit☆33Updated last year
- Inference of Large Multimodal Models in C/C++. LLaVA and others☆46Updated last year
- SSD-1B, an open-source text-to-image model, outperforming previous versions by being 50% smaller and 60% faster than SDXL.☆166Updated 6 months ago
- Falcon LLM ggml framework with CPU and GPU support☆244Updated 9 months ago
- Port of Suno AI's Bark in C/C++ for fast inference☆54Updated 6 months ago
- Pybind11 bindings for Whisper.cpp☆324Updated this week