mengwanguc / gpemuLinks
GPEmu, a GPU emulator for faster and cheaper prototyping and evaluation of deep learning system research
☆27Updated 8 months ago
Alternatives and similar repositories for gpemu
Users that are interested in gpemu are comparing it to the libraries listed below
Sorting:
- Tensor library & inference framework for machine learning☆106Updated 3 weeks ago
- Pytorch script hot swap: Change code without unloading your LLM from VRAM☆126Updated 3 months ago
- Pivotal Token Search☆119Updated 3 weeks ago
- xet client tech, used in huggingface_hub☆157Updated this week
- ☆196Updated 3 months ago
- The Engineer's Guide to Deep-Learning☆37Updated 6 months ago
- tiny code to access tenstorrent blackhole☆58Updated 2 months ago
- ☆249Updated last year
- Samples of good AI generated CUDA kernels☆88Updated 2 months ago
- Repository for the QUIK project, enabling the use of 4bit kernels for generative inference - EMNLP 2024☆181Updated last year
- ☆392Updated last week
- ☆163Updated last year
- Standalone commandline CLI tool for compiling Triton kernels☆17Updated 10 months ago
- GRPO training code which scales to 32xH100s for long horizon terminal/coding tasks. Base agent is now the top Qwen3 agent on Stanford's T…☆218Updated last week
- A playground to make it easy to try crazy things☆33Updated last month
- Gradual typing for tensor shapes in Rust☆77Updated last month
- Lightweight Llama 3 8B Inference Engine in CUDA C☆47Updated 4 months ago
- Heirarchical Navigable Small Worlds☆101Updated this week
- First token cutoff sampling inference example☆30Updated last year
- Make triton easier☆47Updated last year
- Hashed Lookup Table based Matrix Multiplication (halutmatmul) - Stella Nera accelerator☆211Updated last year
- Algebraic enhancements for GEMM & AI accelerators☆278Updated 5 months ago
- A copy of ONNX models, datasets, and code all in one GitHub repository. Follow the README to learn more.☆105Updated last year
- Because it's there.☆16Updated 10 months ago
- Official codebase for the paper "Beyond A* Better Planning with Transformers via Search Dynamics Bootstrapping".☆372Updated last year
- a small, lightweight crate for numerical integration written in Rust.☆107Updated 2 weeks ago
- Inference of Mamba models in pure C☆190Updated last year
- Official Rust Implementation of Model2Vec☆123Updated last month
- time to learn mlx☆40Updated 2 months ago
- throwaway GPT inference☆140Updated last year