Herdora / chiselLinks
CLI tool for developing and profiling GPU kernels locally. Just write, test, and profile GPU code from your laptop.
☆35Updated this week
Alternatives and similar repositories for chisel
Users that are interested in chisel are comparing it to the libraries listed below
Sorting:
- A Python library for dynamic dispatch based on module versions and backends.☆54Updated last month
- Numbast is a tool to build an automated pipeline that converts CUDA APIs into Numba bindings.☆47Updated this week
- A snappy + easy + pretty TUI debugger for Python.☆48Updated last month
- No-GIL Python environment featuring NVIDIA Deep Learning libraries.☆62Updated 3 months ago
- ML/DL Math and Method notes☆61Updated last year
- PyTorch centric eager mode debugger☆47Updated 7 months ago
- LLM training in simple, raw C/CUDA☆99Updated last year
- Because it's there.☆16Updated 9 months ago
- Personal solutions to the Triton Puzzles☆19Updated last year
- A high-performance library for compressed ndarrays, with a flexible computational engine☆158Updated this week
- ☆60Updated 3 years ago
- A user-friendly tool chain that enables the seamless execution of ONNX models using JAX as the backend.☆116Updated this week
- Notebooks for the "Deep Learning with JAX" book☆151Updated last month
- Playing around "Less Slow" coding practices in Python, from numerical micro-kernels to coroutines, ranges, and polymorphic state machines☆114Updated 3 months ago
- Documented and Unit Tested educational Deep Learning framework with Autograd from scratch.☆117Updated last year
- ☆21Updated 4 months ago
- Better bindings for Python☆17Updated 2 years ago
- Simplified implementation of UMAP like dimensionality reduction algorithm☆49Updated 8 months ago
- ☆12Updated last week
- The CUDA target for Numba☆149Updated last week
- Meta-GPU lesson covering general aspects of GPU programming as well as specific frameworks☆88Updated 2 months ago
- Quadra: Effortless and reproducible deep learning workflows with configuration files.☆49Updated last month
- Learning about CUDA by writing PTX code.☆133Updated last year
- Write your code as tree-like expressions, then transform it☆21Updated last year
- NVIDIA Math Libraries for the Python Ecosystem☆333Updated last week
- Competitive GPU kernel optimization platform.☆86Updated this week
- Rust Implementation of micrograd☆52Updated last year
- FAST Randomized SVD on a GPU with CUDA 🏎️☆12Updated 6 years ago
- Lightweight Llama 3 8B Inference Engine in CUDA C☆47Updated 3 months ago
- Parallel Computing starter project to build GPU & CPU kernels in CUDA & C++ and call them from Python without a single line of CMake usin…☆26Updated 4 months ago