groq / groqflowLinks
GroqFlow provides an automated tool flow for compiling machine learning and linear algebra workloads into Groq programs and executing those programs on GroqChip™ processors.
☆112Updated 3 weeks ago
Alternatives and similar repositories for groqflow
Users that are interested in groqflow are comparing it to the libraries listed below
Sorting:
- 1.58 Bit LLM on Apple Silicon using MLX☆221Updated last year
- Machine Learning Agility (MLAgility) benchmark and benchmarking tools☆39Updated 3 weeks ago
- Route LLM requests to the best model for the task at hand.☆96Updated 2 months ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆52Updated last year
- LMQL implementation of tree of thoughts☆34Updated last year
- Tutorial to get started with SkyPilot!☆58Updated last year
- ☆89Updated 10 months ago
- 🏥 Health monitor for a Petals swarm☆39Updated last year
- ☆36Updated last year
- A tree-based prefix cache library that allows rapid creation of looms: hierarchal branching pathways of LLM generations.☆73Updated 6 months ago
- never forget anything again! combine AI and intelligent tooling for a local knowledge base to track catalogue, annotate, and plan for you…☆37Updated last year
- ☆123Updated last year
- Accelerate your Gen AI with NVIDIA NIM and NVIDIA AI Workbench☆180Updated 3 months ago
- Transformer GPU VRAM estimator☆66Updated last year
- inference code for mixtral-8x7b-32kseqlen☆101Updated last year
- Deploy your autonomous agents to production grade environments with 99% Uptime Guarantee, Infinite Scalability, and self-healing.☆45Updated last week
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks☆31Updated last year
- ScalarLM - a unified training and inference stack☆55Updated 3 weeks ago
- vLLM: A high-throughput and memory-efficient inference and serving engine for LLMs☆88Updated last week
- Turing machines, Rule 110, and A::B reversal using Claude 3 Opus.☆58Updated last year
- A fast minimalistic implementation of guided generation on Apple Silicon using Outlines and MLX☆56Updated last year
- Run language models on consumer hardware.☆28Updated last year
- Inference examples☆56Updated 6 months ago
- look how they massacred my boy☆64Updated 10 months ago
- Public repository containing METR's DVC pipeline for eval data analysis☆96Updated 4 months ago
- Fast parallel LLM inference for MLX☆206Updated last year
- 1.58-bit LLaMa model☆82Updated last year
- A Learning Journey: Micrograd in Mojo 🔥☆62Updated 10 months ago
- LLM finetuning☆42Updated 2 years ago
- ☆63Updated 8 months ago