groq / groqflow
GroqFlow provides an automated tool flow for compiling machine learning and linear algebra workloads into Groq programs and executing those programs on GroqChip™ processors.
☆109Updated last month
Alternatives and similar repositories for groqflow:
Users that are interested in groqflow are comparing it to the libraries listed below
- Machine Learning Agility (MLAgility) benchmark and benchmarking tools☆39Updated last month
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks☆31Updated 10 months ago
- A tree-based prefix cache library that allows rapid creation of looms: hierarchal branching pathways of LLM generations.☆68Updated 2 months ago
- look how they massacred my boy☆63Updated 6 months ago
- Transformer GPU VRAM estimator☆59Updated last year
- ☆66Updated 10 months ago
- ☆30Updated last month
- ☆89Updated 6 months ago
- This repository explains and provides examples for "concept anchoring" in GPT4.☆72Updated last year
- LMQL implementation of tree of thoughts☆34Updated last year
- A repository of prompts and Python scripts for intelligent transformation of raw text into diverse formats.☆30Updated last year
- papers.day☆93Updated last year
- AI Assistant running within your browser.☆62Updated 4 months ago
- A Learning Journey: Micrograd in Mojo 🔥☆61Updated 6 months ago
- never forget anything again! combine AI and intelligent tooling for a local knowledge base to track catalogue, annotate, and plan for you…☆37Updated 11 months ago
- vLLM: A high-throughput and memory-efficient inference and serving engine for LLMs☆86Updated this week
- Tutorial to get started with SkyPilot!☆57Updated 11 months ago
- A library for benchmarking the Long Term Memory and Continual learning capabilities of LLM based agents. With all the tests and code you…☆67Updated 4 months ago
- ☆53Updated 11 months ago
- Turing machines, Rule 110, and A::B reversal using Claude 3 Opus.☆59Updated 11 months ago
- 1.58 Bit LLM on Apple Silicon using MLX☆195Updated 11 months ago
- ☆22Updated last year
- Run language models on consumer hardware.☆25Updated last year
- The next evolution of Agents☆48Updated last week
- Write a fast kernel and run it on Discord. See how you compare against the best!☆38Updated this week
- Official code for "SWARM Parallelism: Training Large Models Can Be Surprisingly Communication-Efficient"☆137Updated last year
- ☆117Updated 8 months ago
- RAG example using DSPy, Gradio, FastAPI☆78Updated last year
- 1.58-bit LLaMa model☆81Updated last year
- ☆48Updated last year