dthuerck / culip
Code for the culip ("CUda for Linear and Integer Programming") project, containing GPU primitives for linear algebra, linear optimization and (someday) integer optimization.
☆18Updated 6 years ago
Alternatives and similar repositories for culip:
Users that are interested in culip are comparing it to the libraries listed below
- Some microbenchmarks and design docs before commencement☆12Updated 4 years ago
- ☆58Updated this week
- FlexAttention w/ FlashAttention3 Support☆26Updated 5 months ago
- MPI Code Generation through Domain-Specific Language Models☆13Updated 4 months ago
- Standalone commandline CLI tool for compiling Triton kernels☆17Updated 6 months ago
- Loop Nest - Linear algebra compiler and code generator.☆22Updated 2 years ago
- A tracing JIT compiler for PyTorch☆13Updated 3 years ago
- JAX implementation of "Fine-Tuning Language Models with Just Forward Passes"☆19Updated last year
- ☆9Updated 3 weeks ago
- [WIP] Better (FP8) attention for Hopper☆26Updated last month
- code for paper "Accessing higher dimensions for unsupervised word translation"☆21Updated last year
- benchmarking some transformer deployments☆26Updated 2 years ago
- Source-to-Source Debuggable Derivatives in Pure Python☆15Updated last year
- ☆21Updated 3 months ago
- Implementation of Spectral State Space Models☆16Updated last year
- Solver for Unconstrained Binary Quadratic Optimization (UBQO, BQO, QUBO) and Max 2-SAT, based on semidefinite relaxation with constraint …☆15Updated last year
- Benchmarking PyTorch 2.0 different models☆21Updated 2 years ago
- Triton kernels for Flux☆20Updated 2 months ago
- A library for simplifying fine tuning with multi gpu setups in the Huggingface ecosystem.☆16Updated 5 months ago
- Sparse linear Boolean algebra for Nvidia Cuda☆24Updated last week
- Causal Analysis of Agent Behavior for AI Safety☆17Updated last year
- No-GIL Python environment featuring NVIDIA Deep Learning libraries.☆53Updated 3 weeks ago
- Experimental scripts for researching data adaptive learning rate scheduling.☆23Updated last year
- Analogous Safe-state Exploration (ASE) is an algorithm for provably safe and optimal exploration in MDPs with unknown, stochastic dynamic…☆11Updated 4 years ago
- Codes accompanying the paper "LaProp: a Better Way to Combine Momentum with Adaptive Gradient"☆28Updated 4 years ago
- ☆18Updated 11 months ago
- ☆13Updated 2 years ago
- LaTeX source code for the slides