stadlmax / pyFIFOtax
Simple Tax Reporting Tool for share transactions on foreign exchanges
☆19Updated 2 months ago
Alternatives and similar repositories for pyFIFOtax:
Users that are interested in pyFIFOtax are comparing it to the libraries listed below
- CUDA kernel author's tools☆111Updated 3 years ago
- Deep Learning Primitives and Mini-Framework for OpenCL☆195Updated 8 months ago
- ☆58Updated 8 months ago
- Kernel Tuner☆331Updated this week
- CUDASW++4.0: Ultra-fast GPU-based Smith-Waterman Protein Sequence Database Search☆36Updated 5 months ago
- Intercept Layer for Debugging and Analyzing OpenCL Applications☆328Updated this week
- GitHub Action to install CUDA☆173Updated 3 weeks ago
- Online CUDA Occupancy Calculator☆75Updated 3 years ago
- CUDA Kernel Benchmarking Library☆631Updated this week
- Training material for Nsight developer tools☆157Updated 9 months ago
- ☆537Updated last week
- Generate simple index ranges in C++ and CUDA C++☆39Updated last year
- Archived implementation of BLAS using the SYCL open standard. See oneMath for a replacement.☆261Updated 3 months ago
- Thin, unified, C++-flavored wrappers for the CUDA APIs☆837Updated last week
- A single-header C++ library for simplifying the use of CUDA Runtime Compilation (NVRTC).☆533Updated last month
- A library to benchmark CUDA code, similar to google benchmark.☆28Updated 4 years ago
- Simple utilities to enable code reuse and portability between CUDA C/C++ and standard C/C++.☆347Updated 3 years ago
- Convert nvprof profiles into about:tracing compatible JSON files☆69Updated 4 years ago
- Header-only library of GPU-accelerated, concurrent data structures.☆10Updated 2 weeks ago
- A plugin for Jupyter Notebook to run CUDA C/C++ code☆227Updated 7 months ago
- Thrust, CUB, TBB, AVX2, AVX-512, CUDA, OpenCL, OpenMP, Metal - all it takes to sum a lot of numbers fast!☆96Updated this week
- Automatically insert nvtx ranges to PyTorch models☆17Updated 4 years ago
- Some CUDA design patterns and a bit of template magic for CUDA☆150Updated last year
- GPUOCelot: A dynamic compilation framework for PTX☆287Updated last year
- Full-speed Array of Structures access☆169Updated 2 years ago
- An extension library of WMMA API (Tensor Core API)☆96Updated 9 months ago
- A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")☆324Updated this week
- MatMul Performance Benchmarks for a Single CPU Core comparing both hand engineered and codegen kernels.☆130Updated last year
- HIPIFY: Convert CUDA to Portable C++ Code☆574Updated this week
- Experimental OpenCL SPIR-V to OpenCL C translator☆26Updated 3 months ago