NVIDIA/free-threaded-python

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/NVIDIA/free-threaded-python)

NVIDIA / free-threaded-python

No-GIL Python environment featuring NVIDIA Deep Learning libraries.

☆71

Alternatives and similar repositories for free-threaded-python

Users that are interested in free-threaded-python are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

GiulioRomualdi / rsl-rl-mujoco-example
View on GitHub
☆15Apr 7, 2025Updated last year
TiledTensor / TiledBench
View on GitHub
Benchmark tests supporting the TiledCUDA library.
☆19Nov 19, 2024Updated last year
WaveSpeedAI / QuantumAttention
View on GitHub
[WIP] Better (FP8) attention for Hopper
☆33Feb 24, 2025Updated last year
rapidsai / dependency-file-generator
View on GitHub
☆16Updated this week
piyueh / TorchSWE
View on GitHub
A GPU shallow-water equation (SWE) solver
☆15May 5, 2022Updated 4 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
hunger / octoconda
View on GitHub
Octoconda runner
☆18Updated this week
yester31 / Cutlass_EX
View on GitHub
study of cutlass
☆22Nov 10, 2024Updated last year
chengzeyi / piflux
View on GitHub
(WIP) Parallel inference for black-forest-labs' FLUX model.
☆19Nov 18, 2024Updated last year
junhahyung / MagiCapture
View on GitHub
☆11Feb 26, 2024Updated 2 years ago
NVIDIA / compute-sanitizer-samples
View on GitHub
Samples demonstrating how to use the Compute Sanitizer Tools and Public API
☆99Nov 6, 2023Updated 2 years ago
YJMSTR / flash-linear-attention
View on GitHub
FLA but cuTile
☆27Apr 17, 2026Updated 3 months ago
nzw0301 / pb-contrastive
View on GitHub
#UAI2020 Codes for PAC-Bayesian Contrastive Unsupervised Representation Learning
☆14May 23, 2022Updated 4 years ago
eth-cscs / Tiled-MM
View on GitHub
Matrix multiplication on GPUs for matrices stored on a CPU. Similar to cublasXt, but ported to both NVIDIA and AMD GPUs.
☆33Apr 2, 2025Updated last year
nicolaswilde / amx-gemm-handwritten
View on GitHub
Handwritten GEMM using Intel AMX (Advanced Matrix Extension)
☆17Jan 11, 2025Updated last year
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
mandli / tsunami-models
View on GitHub
☆12Nov 15, 2024Updated last year
pmmilani / tbnns
View on GitHub
Tensor Basis Neural Network for Scalar Mixing
☆10Mar 24, 2023Updated 3 years ago
sile / hone
View on GitHub
A shell-friendly hyperparameter search tool inspired by Optuna
☆18Dec 17, 2024Updated last year
OPM / pyopmspe11
View on GitHub
A Python framework using OPM Flow for the SPE11 benchmark project
☆21Jun 12, 2026Updated last month
Yifei-Zuo / FlashLLA
View on GitHub
Official repository Flash Local Linear Attention
☆37May 28, 2026Updated last month
hvy / chainer-param-monitor
View on GitHub
Monitor parameter and gradient statistics during neural network training with Chainer
☆13Jan 24, 2017Updated 9 years ago
wahibium / KFF
View on GitHub
Scalable GPU Kernel Fission/Fusion Transformation for Memory-Bound Kernels
☆14Aug 26, 2015Updated 10 years ago
pydata / conf_site
View on GitHub
☆16Jun 10, 2025Updated last year
rwightman / imagenet-12k
View on GitHub
ImageNet-12k subset of ImageNet-21k (fall11)
☆23Jun 13, 2023Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
TiledTensor / TiledLower
View on GitHub
TiledLower is a Dataflow Analysis and Codegen Framework written in Rust.
☆13Nov 23, 2024Updated last year
shunk031 / human-attention-map-for-text-classification
View on GitHub
Reimplementation of the paper `Human Attention Maps for Text Classification: Do Humans and Neural Networks Focus on the Same Words? (ACL2…
☆17Jul 10, 2020Updated 6 years ago
cherichy / tilecute
View on GitHub
☆32Jul 2, 2025Updated last year
conbench / conbench-tmp
View on GitHub
General purpose, language-agnostic Continuous Benchmarking (CB) framework
☆35Apr 15, 2020Updated 6 years ago
GVProf / GVProf
View on GitHub
GVProf: A Value Profiler for GPU-based Clusters
☆54Mar 24, 2024Updated 2 years ago
Dao-AILab / gemm-cublas
View on GitHub
☆22May 5, 2025Updated last year
ademeure / DeeperGEMM
View on GitHub
DeeperGEMM: crazy optimized version
☆86May 5, 2025Updated last year
osqp / qdldl-python
View on GitHub
Python interface to the QDLDL (https://github.com/osqp/qdldl) free LDL factorization routine for quasi-definite linear systems
☆18Apr 6, 2026Updated 3 months ago
jrbourbeau / dask-optuna
View on GitHub
Scale Optuna with Dask
☆37Oct 1, 2020Updated 5 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
toyaix / triton-runner
View on GitHub
Multi-Level Triton Runner supporting Python, IR, PTX, AMDGCN, cubin and hasco.
☆98May 8, 2026Updated 2 months ago
NVIDIA / HMM_sample_code
View on GitHub
CUDA 12.2 HMM demos
☆21Jul 26, 2024Updated last year
Jokeren / GPA
View on GitHub
GPU Performance Advisor
☆66Jul 25, 2022Updated 3 years ago
hazan-lab / flash-stu
View on GitHub
PyTorch implementation of the Flash Spectral Transform Unit.
☆22Sep 19, 2024Updated last year
Mellanox / bluefield-linux
View on GitHub
Linux kernel to support Mellanox BlueField SoCs
☆14Nov 13, 2019Updated 6 years ago
arpitp / ssd-controller
View on GitHub
Open Source SSD Controller. NVMe and Lightstor variants
☆17May 21, 2014Updated 12 years ago
HanGuo97 / hilt
View on GitHub
☆40Dec 14, 2025Updated 7 months ago