groq / groqflowLinks
GroqFlow provides an automated tool flow for compiling machine learning and linear algebra workloads into Groq programs and executing those programs on GroqChip™ processors.
☆115Updated 6 months ago
Alternatives and similar repositories for groqflow
Users that are interested in groqflow are comparing it to the libraries listed below
Sorting:
- Transformer GPU VRAM estimator☆68Updated last year
- ☆91Updated last year
- Machine Learning Agility (MLAgility) benchmark and benchmarking tools☆40Updated 6 months ago
- 1.58-bit LLaMa model☆82Updated last year
- vLLM: A high-throughput and memory-efficient inference and serving engine for LLMs☆93Updated this week
- AskIt: Unified programming interface for programming with LLMs (GPT-3.5, GPT-4, Gemini, Claude, Cohere, Llama 2)☆80Updated last year
- 🏥 Health monitor for a Petals swarm☆40Updated last year
- Repository of model demos using TT-Buda☆63Updated 10 months ago
- Tutorial to get started with SkyPilot!☆58Updated last year
- ☆125Updated last year
- A high-throughput and memory-efficient inference and serving engine for LLMs☆53Updated 2 years ago
- 1.58 Bit LLM on Apple Silicon using MLX☆243Updated last year
- Deploy your autonomous agents to production grade environments with 99% Uptime Guarantee, Infinite Scalability, and self-healing.☆50Updated 3 months ago
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks☆31Updated last year
- Roy: A lightweight, model-agnostic framework for crafting advanced multi-agent systems using large language models.☆78Updated 2 years ago
- ☆119Updated last year
- ☆122Updated last year
- ScalarLM - a unified training and inference stack☆97Updated 2 months ago
- ☆162Updated last year
- Route LLM requests to the best model for the task at hand.☆177Updated 3 weeks ago
- Tenstorrent console based hardware information program☆58Updated this week
- GRDN.AI app for garden optimization☆69Updated 2 months ago
- ☆68Updated last year
- ☆112Updated 2 years ago
- Formal-LLM: Integrating Formal Language and Natural Language for Controllable LLM-based Agents☆132Updated last year
- 👩🤝🤖 A curated list of datasets for large language models (LLMs), RLHF and related resources (continually updated)☆24Updated 2 years ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆34Updated last year
- Fast parallel LLM inference for MLX☆246Updated last year
- inference code for mixtral-8x7b-32kseqlen☆105Updated 2 years ago
- ReDel is a toolkit for researchers and developers to build, iterate on, and analyze recursive multi-agent systems. (EMNLP 2024 Demo)☆90Updated last month