groq / groqflowLinks
GroqFlow provides an automated tool flow for compiling machine learning and linear algebra workloads into Groq programs and executing those programs on GroqChip™ processors.
☆109Updated 3 weeks ago
Alternatives and similar repositories for groqflow
Users that are interested in groqflow are comparing it to the libraries listed below
Sorting:
- Machine Learning Agility (MLAgility) benchmark and benchmarking tools☆39Updated 3 weeks ago
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆60Updated last week
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks☆31Updated last year
- vLLM: A high-throughput and memory-efficient inference and serving engine for LLMs☆86Updated this week
- ☆89Updated 8 months ago
- LMQL implementation of tree of thoughts☆34Updated last year
- Tutorial to get started with SkyPilot!☆57Updated last year
- Write a fast kernel and run it on Discord. See how you compare against the best!☆44Updated this week
- inference code for mixtral-8x7b-32kseqlen☆100Updated last year
- GRDN.AI app for garden optimization☆70Updated last year
- AskIt: Unified programming interface for programming with LLMs (GPT-3.5, GPT-4, Gemini, Claude, Cohere, Llama 2)☆78Updated 4 months ago
- A guidance compatibility layer for llama-cpp-python☆34Updated last year
- Samples of good AI generated CUDA kernels☆65Updated last week
- ☆68Updated 3 months ago
- Command line tool for Deep Infra cloud ML inference service☆30Updated 11 months ago
- AI Assistant running within your browser.☆67Updated 6 months ago
- A tree-based prefix cache library that allows rapid creation of looms: hierarchal branching pathways of LLM generations.☆69Updated 3 months ago
- LLM inference in C/C++☆77Updated 3 weeks ago
- 1.58-bit LLaMa model☆81Updated last year
- ☆72Updated last year
- Docker image NVIDIA GH200 machines - optimized for vllm serving and hf trainer finetuning☆42Updated 3 months ago
- Making the world's first and smartest opensource any-to-any AGI system☆41Updated this week
- A fast minimalistic implementation of guided generation on Apple Silicon using Outlines and MLX☆53Updated last year
- never forget anything again! combine AI and intelligent tooling for a local knowledge base to track catalogue, annotate, and plan for you…☆37Updated last year
- Turing machines, Rule 110, and A::B reversal using Claude 3 Opus.☆59Updated last year
- Modded vLLM to run pipeline parallelism over public networks☆35Updated 2 weeks ago
- Minimal, clean code implementation of RAG with mlx using gguf model weights☆50Updated last year
- Transformer GPU VRAM estimator☆64Updated last year
- Ongoing research training transformer models at scale☆37Updated last year
- Cray-LM unified training and inference stack.☆22Updated 4 months ago