bwasti / better_bindings
Better bindings for Python
☆17Updated last year
Related projects ⓘ
Alternatives and complementary repositories for better_bindings
- A thin, highly portable toolkit for efficiently compiling dense loop-based computation.☆147Updated last year
- Loop Nest - Linear algebra compiler and code generator.☆22Updated 2 years ago
- A tracing JIT compiler for PyTorch☆12Updated 2 years ago
- Write your code as tree-like expressions, then transform it☆21Updated 10 months ago
- An Aspiring Drop-In Replacement for Pandas at Scale☆73Updated 3 years ago
- ☆58Updated 2 years ago
- A small library for creating and manipulating custom JAX Pytree classes☆58Updated last year
- Unifying Python/C++/CUDA memory: Python buffered array ↔️ `std::vector` ↔️ CUDA managed memory☆81Updated last week
- Make triton easier☆41Updated 4 months ago
- PyTorch centric eager mode debugger☆43Updated 4 months ago
- Data and tooling to compare the API surfaces of various array libraries.☆54Updated 5 months ago
- ☆48Updated 3 months ago
- Experiment of using Tangent to autodiff triton☆71Updated 9 months ago
- benchmarking some transformer deployments☆26Updated last year
- A stand-alone implementation of several NumPy dtype extensions used in machine learning.☆208Updated this week
- ☆98Updated 4 months ago
- Nod.ai 🦈 version of 👻 . You probably want to start at https://github.com/nod-ai/shark for the product and the upstream IREE repository …☆107Updated this week
- No-GIL Python environment featuring NVIDIA Deep Learning libraries.☆20Updated this week
- ☆17Updated 2 weeks ago
- Demo of the unit_scaling library, showing how a model can be easily adapted to train in FP8.☆35Updated 3 months ago
- Automatically insert nvtx ranges to PyTorch models☆17Updated 3 years ago
- ☆40Updated 4 months ago
- Personal solutions to the Triton Puzzles☆15Updated 3 months ago
- GeoT: Tensor Centric Library for Graph Neural Network via Efficient Segment Reduction on GPU☆18Updated 2 weeks ago
- Worked example of the process from Python source to CUDA kernel execution with Numba☆36Updated last month
- A user-friendly tool chain that enables the seamless execution of ONNX models using JAX as the backend.☆98Updated last month
- Experimental scripts for researching data adaptive learning rate scheduling.☆23Updated last year
- Serialize JAX, Flax, Haiku, or Objax model params with 🤗`safetensors`☆42Updated 5 months ago
- TORCH_LOGS parser for PT2☆21Updated 3 weeks ago
- TorchFix - a linter for PyTorch-using code with autofix support☆98Updated last month