groq / groqflowLinks
GroqFlow provides an automated tool flow for compiling machine learning and linear algebra workloads into Groq programs and executing those programs on GroqChip™ processors.
☆112Updated last month
Alternatives and similar repositories for groqflow
Users that are interested in groqflow are comparing it to the libraries listed below
Sorting:
- ☆89Updated 11 months ago
- Tutorial to get started with SkyPilot!☆58Updated last year
- 1.58-bit LLaMa model☆82Updated last year
- Machine Learning Agility (MLAgility) benchmark and benchmarking tools☆39Updated last month
- ☆121Updated last year
- A high-throughput and memory-efficient inference and serving engine for LLMs☆52Updated last year
- AskIt: Unified programming interface for programming with LLMs (GPT-3.5, GPT-4, Gemini, Claude, Cohere, Llama 2)☆79Updated 8 months ago
- ☆123Updated last year
- ☆116Updated 9 months ago
- look how they massacred my boy☆64Updated 11 months ago
- The Prime Intellect CLI provides a powerful command-line interface for managing GPU resources across various providers☆84Updated this week
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks☆31Updated last year
- A Python library to orchestrate LLMs in a neural network-inspired structure☆50Updated 11 months ago
- inference code for mixtral-8x7b-32kseqlen☆101Updated last year
- 🏥 Health monitor for a Petals swarm☆39Updated last year
- vLLM: A high-throughput and memory-efficient inference and serving engine for LLMs☆89Updated this week
- ☆63Updated 8 months ago
- This repository explains and provides examples for "concept anchoring" in GPT4.☆71Updated last year
- A repository of prompts and Python scripts for intelligent transformation of raw text into diverse formats.☆30Updated 2 years ago
- Official code for "SWARM Parallelism: Training Large Models Can Be Surprisingly Communication-Efficient"☆143Updated last year
- ☆36Updated last year
- Estimating hardware and cloud costs of LLMs and transformer projects☆18Updated 2 months ago
- 1.58 Bit LLM on Apple Silicon using MLX☆223Updated last year
- A fast minimalistic implementation of guided generation on Apple Silicon using Outlines and MLX☆56Updated last year
- never forget anything again! combine AI and intelligent tooling for a local knowledge base to track catalogue, annotate, and plan for you…☆39Updated last year
- Turing machines, Rule 110, and A::B reversal using Claude 3 Opus.☆58Updated last year
- Transformer GPU VRAM estimator☆66Updated last year
- Repository of model demos using TT-Buda☆62Updated 5 months ago
- Fast parallel LLM inference for MLX☆217Updated last year
- PCCL (Prime Collective Communications Library) implements fault tolerant collective communications over IP☆120Updated last week