experiments with MLX
☆68Dec 15, 2025Updated 3 months ago
Alternatives and similar repositories for mlx-rdma
Users that are interested in mlx-rdma are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆26Oct 7, 2025Updated 5 months ago
- grep for context, not just text. Local-first CLI for searching documents, notes, memories, and project context.☆23Mar 8, 2026Updated 2 weeks ago
- matmul using AMX instructions☆23May 7, 2024Updated last year
- High-performance KV cache storage for LLM inference — GPU offloading, SSD caching, and cross-node sharing via RDMA. Works with vLLM and S…☆27Updated this week
- ☆48Jan 3, 2026Updated 2 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Realtime Transcription with Voxtral in MLX☆90Feb 8, 2026Updated last month
- ☆49Mar 17, 2026Updated last week
- BH hackathon☆14Apr 4, 2024Updated last year
- Recover your DeSo seed phrase☆11Apr 13, 2022Updated 3 years ago
- ☆19Aug 23, 2025Updated 7 months ago
- A DataFusion-powered Serverless S3 Proxy.☆17Apr 15, 2024Updated last year
- PyTorch compilation tutorial covering TorchScript, torch.fx, and Slapo☆17Mar 13, 2023Updated 3 years ago
- load-balancer-algorithm by go☆63Mar 25, 2025Updated last year
- OpenSource deployment made easy☆10Jun 13, 2015Updated 10 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Notes for Computer Networks Course☆12Jan 14, 2018Updated 8 years ago
- 📊 LLM Context Benchmarks - A comprehensive benchmarking tool for testing LLMs with varying context sizes using Ollama. Features dual b…☆44Mar 16, 2026Updated last week
- Benchmarks comparing PyTorch and MLX on Apple Silicon GPUs☆93Jan 23, 2026Updated 2 months ago
- Examples for using the SiLLM framework for training and running Large Language Models (LLMs) on Apple Silicon☆16May 8, 2025Updated 10 months ago
- Flexible memory allocation tool for multi-tiered memory systems☆13Jan 7, 2026Updated 2 months ago
- A MIPS CPU with dual-issue, out-of-order, and 5-stage pipelines☆11Nov 28, 2019Updated 6 years ago
- Linux kernel hooking library☆21May 23, 2020Updated 5 years ago
- Cluster simulator with far memory☆12Apr 28, 2020Updated 5 years ago
- ☆13Jan 7, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Some microbenchmarks and design docs before commencement☆11Feb 1, 2021Updated 5 years ago
- MLX binary vectors and associated algorithms.☆14Mar 13, 2025Updated last year
- 该项目收集了 2000+ 高质量、开箱即用 的 n8n 自动工作流模板,涵盖官方示例、社区精华和用户实用场景。它内置一个基于 FastAPI 的本地搜索服务,支持全文搜索、分类筛选和 Mermaid 可视化展示,可一键下载 JSON 文件,方便导入你的 n8n 实例。all …☆31Sep 7, 2025Updated 6 months ago
- High performance RMSNorm Implement by using SM Core Storage(Registers and Shared Memory)☆30Jan 22, 2026Updated 2 months ago
- 'afm' command cli: macOS server and single prompt mode that exposes Apple's Foundation and MLX Models and other APIs running on your Mac …☆207Updated this week
- ☆15Feb 23, 2026Updated last month
- ☆23Mar 21, 2025Updated last year
- The end of Screenshot 2023-12-20-21.11.59.png☆15Dec 22, 2023Updated 2 years ago
- REAP expert pruning for MoE LLMs on Apple Silicon via MLX☆49Mar 16, 2026Updated last week
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- A React app to parse and play media from subreddits and comment threads☆14Mar 3, 2016Updated 10 years ago
- Ultra-minimal autoregressive diffusion model for image generation☆21Dec 26, 2025Updated 3 months ago
- ☆19Feb 11, 2026Updated last month
- generalized rust interface for subnets.☆22Nov 21, 2024Updated last year
- Rust implementation of Needleman-Wunsch & Smith-Waterman sequence alignment☆22Jun 25, 2025Updated 9 months ago
- Use hardware performance counters to find mapping of addresses to L3 slices in Intel processors☆18Jul 30, 2023Updated 2 years ago
- FastMLX is a high performance production ready API to host MLX models.☆347Mar 18, 2025Updated last year