groq / groqflowLinks

GroqFlow provides an automated tool flow for compiling machine learning and linear algebra workloads into Groq programs and executing those programs on GroqChip™ processors.

☆114

Alternatives and similar repositories for groqflow

Users that are interested in groqflow are comparing it to the libraries listed below

Sorting:

groq / mlagility
Machine Learning Agility (MLAgility) benchmark and benchmarking tools
☆40Updated 3 months ago
deepsilicon / Sila
☆89Updated last year
skypilot-org / skypilot-tutorial
Tutorial to get started with SkyPilot!
☆57Updated last year
mistralai / vllm-release
A high-throughput and memory-efficient inference and serving engine for LLMs
☆53Updated last year
rafacelente / bllama
1.58-bit LLaMa model
☆83Updated last year
EmbeddedLLM / vllm
vLLM: A high-throughput and memory-efficient inference and serving engine for LLMs
☆93Updated this week
katsumiok / pyaskit
AskIt: Unified programming interface for programming with LLMs (GPT-3.5, GPT-4, Gemini, Claude, Cohere, Llama 2)
☆79Updated 10 months ago
exo-explore / mlx-bitnet
1.58 Bit LLM on Apple Silicon using MLX
☆225Updated last year
tairov / QStarLearning.mojo
☆112Updated last year
glaive-ai / function-calling-server
☆36Updated last year
tenstorrent / tt-smi
Tenstorrent console based hardware information program
☆57Updated this week
QuixiAI / kraken
☆67Updated last year
serp-ai / Parameter-Efficient-MoE
Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks
☆31Updated last year
furiousteabag / vram-calculator
Transformer GPU VRAM estimator
☆66Updated last year
teknium1 / ShareGPT-Builder
☆116Updated 11 months ago
xjdr-alt / llmri
look how they massacred my boy
☆63Updated last year
Liquid4All / liquid_client
☆29Updated last year
Alignment-Lab-AI / Our-Projects
A repository of projects and datasets under active development by Alignment Lab AI
☆22Updated last year
N8python / n8loom
A tree-based prefix cache library that allows rapid creation of looms: hierarchal branching pathways of LLM generations.
☆72Updated 9 months ago
willccbb / mlx_parallm
Fast parallel LLM inference for MLX
☆232Updated last year
petals-infra / health.petals.dev
🏥 Health monitor for a Petals swarm
☆39Updated last year
unslothai / llama.cpp
LLM inference in C/C++
☆102Updated 2 weeks ago
vikhyat / mixtral-inference
inference code for mixtral-8x7b-32kseqlen
☆102Updated last year
kenshin9000 / ConceptARC-Representations
This repository explains and provides examples for "concept anchoring" in GPT4.
☆71Updated last year
The-Swarm-Corporation / swarms-cloud
Deploy your autonomous agents to production grade environments with 99% Uptime Guarantee, Infinite Scalability, and self-healing.
☆47Updated last month
nicholasyager / llama-cpp-guidance
A guidance compatibility layer for llama-cpp-python
☆36Updated 2 years ago
LachlanGray / lmql-tree-of-thoughts
LMQL implementation of tree of thoughts
☆34Updated last year
normal-computing / extended-mind-transformers
☆124Updated last year
SpellcraftAI / turing
Turing machines, Rule 110, and A::B reversal using Claude 3 Opus.
☆58Updated last year
tensorwavecloud / ScalarLM
ScalarLM - a unified training and inference stack
☆93Updated this week