okuvshynov / cubestat
Horizon chart for CPU/GPU/Neural Engine utilization monitoring on Apple M1/M2 and nVidia GPUs on Linux
☆23Updated 3 months ago
Alternatives and similar repositories for cubestat:
Users that are interested in cubestat are comparing it to the libraries listed below
- A super simple web interface to perform blind tests on LLM outputs.☆27Updated 10 months ago
- llm plugin for Cerebras fast inference API☆18Updated 3 months ago
- Exploration of Vector database Index for fast approximate nearest neighbour search.☆17Updated 5 months ago
- Benchmark that evaluates LLMs using 436 NYT Connections puzzles☆12Updated this week
- Benchmarking suite for popular AI APIs☆80Updated 2 months ago
- Vector Embedding Server in under 100 lines of code☆22Updated 10 months ago
- A fork of llama3.c used to do some R&D on inferencing☆17Updated last month
- Visualize expert firing frequencies across sentences in the Mixtral MoE model☆17Updated last year
- ☆27Updated 9 months ago
- LLM code editor for backend services☆14Updated 3 months ago
- Finetune your embeddings in-browser☆32Updated 9 months ago
- Roberta Question Answering using MLX.☆23Updated last year
- Test server code for Phi-2 model. support OpenAI API spec☆17Updated last year
- Scalable Embedded Vector Index for Go and Rust☆35Updated 2 months ago
- 🛠 Self-hosted, fast, and consistent remote configuration for apps.☆14Updated 2 years ago
- Shared personal notes created while working with the Apple MLX machine learning framework☆21Updated 7 months ago
- A python package for serving LLM on OpenAI-compatible API endpoints with prompt caching using MLX.☆70Updated last month
- GraphRag vs Embeddings☆13Updated 6 months ago
- Download models from the Ollama library, without Ollama☆49Updated 2 months ago
- ☆52Updated last year
- A Python framework for building AI agent systems with robust task management in the form of a graph execution engine, inference capabilit…☆21Updated last week
- Benchmarks comparing PyTorch and MLX on Apple Silicon GPUs☆68Updated 6 months ago
- xargs for semgrep☆22Updated 10 months ago
- convert natural language into technical diagrams☆12Updated last month
- Go bindings for LLama.cpp☆12Updated last year
- FalkorDB-Browser is a visualization UI for FalkorDB.☆24Updated this week
- A library for incremental loading of large PyTorch checkpoints☆56Updated last year
- Transformer GPU VRAM estimator☆45Updated 10 months ago
- The Prime Intellect CLI provides a powerful command-line interface for managing GPU resources across various providers☆13Updated 2 weeks ago
- A SQLite extension for generating text embeddings from remote APIs (OpenAI, Nomic, Ollama, llamafile...)☆103Updated 2 months ago