groq / groqflowLinks
GroqFlow provides an automated tool flow for compiling machine learning and linear algebra workloads into Groq programs and executing those programs on GroqChip™ processors.
☆110Updated last week
Alternatives and similar repositories for groqflow
Users that are interested in groqflow are comparing it to the libraries listed below
Sorting:
- Tutorial to get started with SkyPilot!☆58Updated last year
- ☆123Updated last year
- Route LLM requests to the best model for the task at hand.☆87Updated last month
- Machine Learning Agility (MLAgility) benchmark and benchmarking tools☆39Updated last week
- ☆89Updated 10 months ago
- ☆111Updated last year
- inference code for mixtral-8x7b-32kseqlen☆101Updated last year
- Command line tool for Deep Infra cloud ML inference service☆32Updated last year
- 1.58 Bit LLM on Apple Silicon using MLX☆217Updated last year
- A framework for generative software.☆113Updated last month
- vLLM: A high-throughput and memory-efficient inference and serving engine for LLMs☆87Updated this week
- ☆121Updated last year
- A high-throughput and memory-efficient inference and serving engine for LLMs☆53Updated last year
- Distributed Inference for mlx LLm☆94Updated last year
- ☆160Updated 5 months ago
- Turing machines, Rule 110, and A::B reversal using Claude 3 Opus.☆58Updated last year
- 🏥 Health monitor for a Petals swarm☆39Updated last year
- ☆66Updated last year
- A tree-based prefix cache library that allows rapid creation of looms: hierarchal branching pathways of LLM generations.☆72Updated 5 months ago
- The Prime Intellect CLI provides a powerful command-line interface for managing GPU resources across various providers☆30Updated this week
- Fast parallel LLM inference for MLX☆204Updated last year
- Public repository containing METR's DVC pipeline for eval data analysis☆86Updated 4 months ago
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks☆31Updated last year
- GRDN.AI app for garden optimization☆70Updated last year
- ☆63Updated 7 months ago
- Run language models on consumer hardware.☆27Updated last year
- Deploy your autonomous agents to production grade environments with 99% Uptime Guarantee, Infinite Scalability, and self-healing.☆42Updated 2 weeks ago
- Inference examples☆55Updated 5 months ago
- Accelerate your Gen AI with NVIDIA NIM and NVIDIA AI Workbench☆178Updated 3 months ago
- A fast minimalistic implementation of guided generation on Apple Silicon using Outlines and MLX☆55Updated last year