okuvshynov / cubestat
Horizon chart for CPU/GPU/Neural Engine utilization monitoring on Apple M1/M2 and nVidia GPUs on Linux
☆24Updated 3 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for cubestat
- A super simple web interface to perform blind tests on LLM outputs.☆26Updated 8 months ago
- llm plugin for Cerebras fast inference API☆18Updated 2 weeks ago
- Benchmarks comparing PyTorch and MLX on Apple Silicon GPUs☆47Updated 3 months ago
- Implementation of nougat that focuses on processing pdf locally.☆73Updated 6 months ago
- Prompt-based software development☆21Updated 2 months ago
- A CLI tool for managing OpenAI batch processing jobs with ease.☆26Updated 2 months ago
- Vector Embedding Server in under 100 lines of code☆22Updated 8 months ago
- Visualize expert firing frequencies across sentences in the Mixtral MoE model☆17Updated 10 months ago
- LLM benchmark: Generate an SVG of a pelican riding a bicycle☆31Updated 2 weeks ago
- First token cutoff sampling inference example☆28Updated 9 months ago
- Run embedding models using ONNX☆23Updated 9 months ago
- MLX Swift implementation of Andrej Karpathy's Let's build GPT video☆53Updated 6 months ago
- Columnar database on SSD NVMe☆13Updated 3 years ago
- The fastest ACID-transactional persisted Key-Value store designed as modified LSM-Tree for NVMe block-devices with GPU-acceleration and S…☆57Updated last year
- Shared personal notes created while working with the Apple MLX machine learning framework☆19Updated 4 months ago
- GraphRag vs Embeddings☆13Updated 3 months ago
- A star for organising blocks and playing with transformers.☆25Updated 6 months ago
- Run transformers (incl. LLMs) on the Apple Neural Engine.☆52Updated 11 months ago
- Finetune your embeddings in-browser☆31Updated 6 months ago
- LLM-Powered Analyses of your GitHub Community using EvaDB☆22Updated last year
- ☆15Updated 10 months ago
- Roberta Question Answering using MLX.☆21Updated 10 months ago
- Some tough questions to test new models.☆26Updated 6 months ago
- A library for incremental loading of large PyTorch checkpoints☆56Updated last year
- Distributed Inference for mlx LLm☆68Updated 3 months ago
- ☆38Updated 7 months ago
- C API for MLX☆75Updated last month
- ☆96Updated 2 months ago
- Hierarchical topic segmentation of meeting transcripts using embeddings and divisive clustering.☆50Updated 3 months ago
- LLama implementations benchmarking framework☆12Updated last year