okuvshynov / cubestatLinks
Horizon chart for CPU/GPU/Neural Engine utilization monitoring on Apple M1/M2 and nVidia GPUs on Linux
☆25Updated last month
Alternatives and similar repositories for cubestat
Users that are interested in cubestat are comparing it to the libraries listed below
Sorting:
- llm plugin for Cerebras fast inference API☆26Updated 2 months ago
- 🛠 Self-hosted, fast, and consistent remote configuration for apps.☆15Updated 2 years ago
- The Prime Intellect CLI provides a powerful command-line interface for managing GPU resources across various providers☆26Updated 2 weeks ago
- Smart reproducible analytical pipeline inspection☆17Updated last month
- First token cutoff sampling inference example☆29Updated last year
- A super simple web interface to perform blind tests on LLM outputs.☆28Updated last year
- Implementation of nougat that focuses on processing pdf locally.☆81Updated 4 months ago
- A text-to-SQL prototype on the northwind sqlite dataset☆12Updated 8 months ago
- Transformer GPU VRAM estimator☆64Updated last year
- GraphRag vs Embeddings☆13Updated 10 months ago
- Exploration of Vector database Index for fast approximate nearest neighbour search.☆25Updated 10 months ago
- All-in-Storage Solution based on DiskANN for DRAM-free Approximate Nearest Neighbor Search☆57Updated 4 months ago
- Benchmarks comparing PyTorch and MLX on Apple Silicon GPUs☆82Updated 10 months ago
- Simple high-throughput inference library☆115Updated 3 weeks ago
- Vector Embedding Server in under 100 lines of code☆22Updated last year
- Run transformers (incl. LLMs) on the Apple Neural Engine.☆61Updated last year
- A minimalistic C++ Jinja templating engine for LLM chat templates☆153Updated 3 weeks ago
- Concatenated documentation for use with LLMs☆36Updated last week
- Roberta Question Answering using MLX.☆24Updated last year
- LLama implementations benchmarking framework☆12Updated last year
- Create embeddings for LLM using the Nomic API☆23Updated 6 months ago
- Tools for formatting large language model prompts.☆13Updated last year
- Visualize expert firing frequencies across sentences in the Mixtral MoE model☆18Updated last year
- convert natural language into technical diagrams☆14Updated 5 months ago
- A CLI tool for managing OpenAI batch processing jobs with ease.☆36Updated last month
- A python command-line tool to download & manage MLX AI models from Hugging Face.☆17Updated 9 months ago
- Run Llama 2 using MLX on macOS☆34Updated last year
- Shared personal notes created while working with the Apple MLX machine learning framework☆24Updated 2 weeks ago
- A command-line utility to manage MLX models between your Hugging Face cache and LM Studio.☆38Updated 3 months ago
- A simple github actions script to build a llamafile and uploads to huggingface☆14Updated last year