dpuyda / scheduling
A simple and fast library allowing to run async tasks and execute task graphs.
☆41Updated 3 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for scheduling
- Parallel Tasking Library (PTL) - Lightweight C++11 mutilthreading tasking system featuring thread-pool, task-groups, and lock-free task q…☆43Updated 3 months ago
- Experimental ranges for CUDA☆25Updated 5 years ago
- The Farm-SVE package provides a header that implements the ARM C language extensions (ACLE) for the ARM Scalable Vector Extension (SVE) i…☆13Updated 9 months ago
- Thrust, CUB, TBB, AVX2, CUDA, OpenCL, OpenMP, SyCL - all it takes to sum a lot of numbers fast!☆73Updated 6 months ago
- Task graph-based asynchronous programming system using C++ coroutine☆84Updated 8 months ago
- A fast implementation of log() and exp()☆49Updated last year
- Profiling Taskflow Programs through Visualization☆47Updated last year
- This is part of the zeus library, just for sharing and funny.☆33Updated last year
- a CUDA implementation of a priority queue☆81Updated 4 years ago
- C99/C++ header-only library for division via fixed-point multiplication by inverse☆48Updated 6 months ago
- A header only library implementing common mathematical functions using SIMD intrinsics☆92Updated 2 weeks ago
- Fast integer division with divisor not known at compile time. To be used primarily in CUDA kernels.☆70Updated 9 years ago
- Runs a single CUDA/OpenCL kernel, taking its source from a file and arguments from the command-line☆18Updated last week
- ☆28Updated this week
- Fast C header-only library for popcnt, pospopcnt, and set algebraic operations☆45Updated 4 years ago
- C++ "borrowing" smart pointer.☆11Updated 2 years ago
- A Nonlinear Least Squares Minimizer☆34Updated 12 years ago
- C++数据流并行处理框架☆23Updated 3 years ago
- ☆17Updated 7 years ago
- Abstractions of memory, allocator, vector, tuple, shared_ptr, unique_ptr, bitset, variant and string working on both CPU and GPU☆30Updated 2 weeks ago
- SIMD-enabled descriptive statistics (mean, variance, covariance, correlation)☆18Updated 2 months ago
- Fast, shared, upgradeable, non-recursive and non-fair mutex☆29Updated 6 years ago
- Looking into the performance of heaps, starting with the Min-Max Heap☆63Updated 3 years ago
- Polymorphic memory resource for real-time applications.☆64Updated last year
- Concurrent CPU-GPU Programming using Task Models☆100Updated 4 years ago
- A minimalistic header only C++11 Neural Network library based on Eigen::Tensor☆20Updated 6 years ago
- Portable 128-bit SIMD intrinsics☆55Updated last year
- Clover: Quantized 4-bit Linear Algebra Library☆110Updated 6 years ago
- Scheduling examples using C++20 coroutines☆22Updated last year
- tokenizer and parser for circle projects☆11Updated 5 years ago