okuvshynov / cubestatLinks
Horizon chart for CPU/GPU/Neural Engine utilization monitoring. Supports Apple M1-M4, Nvidia GPUs, AMD GPUs
☆26Updated 4 months ago
Alternatives and similar repositories for cubestat
Users that are interested in cubestat are comparing it to the libraries listed below
Sorting:
- ☆21Updated 2 months ago
- Benchmarks comparing PyTorch and MLX on Apple Silicon GPUs☆91Updated last year
- Transformer GPU VRAM estimator☆67Updated last year
- Spying on Apple’s new predictive text model☆133Updated last year
- Implementation of nougat that focuses on processing pdf locally.☆83Updated 11 months ago
- GGUF implementation in C as a library and a tools CLI program☆296Updated 3 months ago
- An implementation of bucketMul LLM inference☆223Updated last year
- First token cutoff sampling inference example☆31Updated last year
- 1.58 Bit LLM on Apple Silicon using MLX☆226Updated last year
- Simple high-throughput inference library☆151Updated 7 months ago
- A super simple web interface to perform blind tests on LLM outputs.☆29Updated last year
- Aana SDK is a powerful framework for building AI enabled multimodal applications.☆53Updated 3 months ago
- C API for MLX☆155Updated last week
- Benchmarking suite for popular AI APIs☆88Updated 10 months ago
- Pivotal Token Search☆134Updated this week
- Rust crates for XetHub☆75Updated last year
- Visualize expert firing frequencies across sentences in the Mixtral MoE model☆18Updated last year
- Heirarchical Navigable Small Worlds☆101Updated 4 months ago
- SWE-Bench Pro: Can AI Agents Solve Long-Horizon Software Engineering Tasks?☆223Updated 3 weeks ago
- A collection of reproducible inference engine benchmarks☆38Updated 7 months ago
- LLM plugin for clustering embeddings☆82Updated last year
- A library for incremental loading of large PyTorch checkpoints☆56Updated 2 years ago
- ☆138Updated 2 years ago
- ☆219Updated 10 months ago
- A minimalistic C++ Jinja templating engine for LLM chat templates☆200Updated 2 months ago
- Run models distributed as GGUF files using LLM☆81Updated last year
- Public reports detailing responses to sets of prompts by Large Language Models.☆32Updated 11 months ago
- GenAI & agent toolkit for Apple Silicon Mac, implementing JSON schema-steered structured output (3SO) and tool-calling in Python. For mor…☆129Updated last month
- A playground to make it easy to try crazy things☆33Updated 2 weeks ago
- ☆115Updated 10 months ago