groq / groqflowLinks
GroqFlow provides an automated tool flow for compiling machine learning and linear algebra workloads into Groq programs and executing those programs on GroqChip™ processors.
☆114Updated 4 months ago
Alternatives and similar repositories for groqflow
Users that are interested in groqflow are comparing it to the libraries listed below
Sorting:
- Machine Learning Agility (MLAgility) benchmark and benchmarking tools☆40Updated 4 months ago
- Tutorial to get started with SkyPilot!☆58Updated last year
- ☆36Updated last year
- ☆89Updated last year
- Command line tool for Deep Infra cloud ML inference service☆33Updated last year
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks☆31Updated last year
- Inference examples☆63Updated 2 months ago
- Turing machines, Rule 110, and A::B reversal using Claude 3 Opus.☆58Updated last year
- 🏥 Health monitor for a Petals swarm☆39Updated last year
- Run language models on consumer hardware.☆27Updated 2 years ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆53Updated 2 years ago
- ScalarLM - a unified training and inference stack☆94Updated 3 weeks ago
- 1.58 Bit LLM on Apple Silicon using MLX☆226Updated last year
- ☆124Updated last year
- inference code for mixtral-8x7b-32kseqlen☆104Updated 2 years ago
- look how they massacred my boy☆63Updated last year
- A framework for generative software.☆114Updated 5 months ago
- Continuously learning web-browsing AI agent that extends the Voyager architecture.☆40Updated 6 months ago
- Roy: A lightweight, model-agnostic framework for crafting advanced multi-agent systems using large language models.☆79Updated 2 years ago
- Estimating hardware and cloud costs of LLMs and transformer projects☆20Updated last month
- vLLM: A high-throughput and memory-efficient inference and serving engine for LLMs☆93Updated this week
- never forget anything again! combine AI and intelligent tooling for a local knowledge base to track catalogue, annotate, and plan for you…☆37Updated last year
- ☆68Updated last year
- ☆112Updated 2 years ago
- Public repository containing METR's DVC pipeline for eval data analysis☆143Updated 8 months ago
- Repository of model demos using TT-Buda☆63Updated 8 months ago
- Deploy your autonomous agents to production grade environments with 99% Uptime Guarantee, Infinite Scalability, and self-healing.☆48Updated 2 months ago
- 1.58-bit LLaMa model☆83Updated last year
- Fast parallel LLM inference for MLX☆234Updated last year
- AskIt: Unified programming interface for programming with LLMs (GPT-3.5, GPT-4, Gemini, Claude, Cohere, Llama 2)☆79Updated 11 months ago