groq / groqflowLinks
GroqFlow provides an automated tool flow for compiling machine learning and linear algebra workloads into Groq programs and executing those programs on GroqChip™ processors.
☆114Updated 4 months ago
Alternatives and similar repositories for groqflow
Users that are interested in groqflow are comparing it to the libraries listed below
Sorting:
- Machine Learning Agility (MLAgility) benchmark and benchmarking tools☆40Updated 4 months ago
- Tutorial to get started with SkyPilot!☆58Updated last year
- A high-throughput and memory-efficient inference and serving engine for LLMs☆53Updated 2 years ago
- 🏥 Health monitor for a Petals swarm☆40Updated last year
- Command line tool for Deep Infra cloud ML inference service☆33Updated last year
- Transformer GPU VRAM estimator☆67Updated last year
- ☆89Updated last year
- ☆125Updated last year
- vLLM: A high-throughput and memory-efficient inference and serving engine for LLMs☆94Updated last week
- ☆36Updated last year
- Inference examples☆65Updated 3 months ago
- ScalarLM - a unified training and inference stack☆93Updated last month
- ReDel is a toolkit for researchers and developers to build, iterate on, and analyze recursive multi-agent systems. (EMNLP 2024 Demo)☆89Updated 2 weeks ago
- Official code for "SWARM Parallelism: Training Large Models Can Be Surprisingly Communication-Efficient"☆148Updated 2 years ago
- 1.58-bit LLaMa model☆83Updated last year
- 1.58 Bit LLM on Apple Silicon using MLX☆230Updated last year
- ☆112Updated 2 years ago
- A repository of projects and datasets under active development by Alignment Lab AI☆22Updated 2 years ago
- LMQL implementation of tree of thoughts☆35Updated last year
- ☆164Updated 10 months ago
- Roy: A lightweight, model-agnostic framework for crafting advanced multi-agent systems using large language models.☆78Updated 2 years ago
- Fast parallel LLM inference for MLX☆238Updated last year
- Deploy your autonomous agents to production grade environments with 99% Uptime Guarantee, Infinite Scalability, and self-healing.☆48Updated 2 months ago
- Repository of model demos using TT-Buda☆63Updated 8 months ago
- ☆142Updated 2 years ago
- AskIt: Unified programming interface for programming with LLMs (GPT-3.5, GPT-4, Gemini, Claude, Cohere, Llama 2)☆79Updated 11 months ago
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks☆31Updated last year
- Run language models on consumer hardware.☆27Updated 2 years ago
- Making the world's first and smartest opensource any-to-any AGI system☆44Updated last month
- Pipeline is an open source python SDK for building AI/ML workflows☆138Updated last year