Let's have some retro gaming fun with AI! Join the discord: https://discord.gg/5xXzkMu8Zk
☆71Nov 19, 2025Updated 4 months ago
Alternatives and similar repositories for infinity-arcade
Users that are interested in infinity-arcade are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Generate a llama-quantize command to copy the quantization parameters of any GGUF☆31Jan 23, 2026Updated 2 months ago
- Supporting code for "LLMs for your iPhone: Whole-Tensor 4 Bit Quantization"☆11Mar 31, 2024Updated last year
- Your models on any xPU☆53Updated this week
- ☆11Jan 28, 2024Updated 2 years ago
- ☆14Aug 25, 2024Updated last year
- Swift package for reading and writing Safetensors files.☆12Feb 6, 2026Updated last month
- Oobabooga "Hello World" API example for node.js with Express☆13Jul 2, 2023Updated 2 years ago
- ☆10May 2, 2023Updated 2 years ago
- ☆17Dec 16, 2024Updated last year
- ☆27Jun 7, 2024Updated last year
- Qwen3-TTS, Apple MLX, WebUI, API Server☆36Feb 12, 2026Updated last month
- Implementation of stop sequencer for Huggingface Transformers☆16Jun 6, 2023Updated 2 years ago
- ☆19Jun 5, 2023Updated 2 years ago
- A MacOS application showcasing DeepSeek's R1 Distill Qwen 1.5B LLM running locally with MLX Model Manager☆17Jan 20, 2025Updated last year
- A tool which checks compatibility of CoreML model with Apple Neural Engine☆14May 30, 2022Updated 3 years ago
- ☆14Jul 2, 2024Updated last year
- A guide to testing different runpod (and other linux VMs) configurations. Specifically the speed of LLM outputs☆17Jan 12, 2024Updated 2 years ago
- Personal voice assistant, with voice interruption and Twilio support☆18Feb 24, 2025Updated last year
- Tool for exporting Apple Neural Engine-accelerated versions of transformers models on HuggingFace Hub.☆13Mar 16, 2026Updated last week
- A tool that can be used to measure the sequential performance of any OpenAI-compatible LLM API☆23Aug 1, 2024Updated last year
- Submission to the inverse scaling prize☆23Jul 23, 2023Updated 2 years ago
- ☆16Jul 13, 2024Updated last year
- ☆41Feb 14, 2026Updated last month
- ☆21Dec 5, 2024Updated last year
- DirectDraw HAL implementation for VMDisp9x driver☆13Oct 18, 2025Updated 5 months ago
- The easiest & fastest way to run customized and fine-tuned LLMs locally or on the edge☆25Mar 5, 2025Updated last year
- Friendly Terminal Assistant for Developers☆17Mar 23, 2024Updated 2 years ago
- A Swift Wrapper for PyTorch and Torchvision.☆14Jul 19, 2019Updated 6 years ago
- See the device (CPU/GPU/ANE) and estimated cost for every layer in your CoreML model.☆25Oct 23, 2025Updated 4 months ago
- Benchmarking tool for vLLM inference performance with GPU monitoring☆42Nov 24, 2025Updated 3 months ago
- Open source MCP server for Vectara☆26Dec 5, 2025Updated 3 months ago
- VDO.Ninja module for Companion☆13Mar 12, 2026Updated last week
- Tune-Mode ConvBN Blocks For Efficient Transfer Learning☆18Aug 1, 2023Updated 2 years ago
- This Repo Help U Get The Information Of Telegram Account By Telethon String Session☆17Feb 6, 2022Updated 4 years ago
- KDE integration for gphoto2 cameras☆13Mar 13, 2026Updated last week
- A simple no-install web UI for Ollama and OAI-Compatible APIs!☆31Jan 30, 2025Updated last year
- Building synthetic data for preference tuning☆27Dec 26, 2024Updated last year
- An open-source platform for software vendors to deploy and operate their software in their customers' cloud accounts. aka Bring Your Own …☆83Updated this week
- Multilingual Knowledge Graph Enhancement (EMNLP 2023)☆24Nov 28, 2023Updated 2 years ago