microsoft / DGT
Learning Accurate Decision Trees with Bandit Feedback via Quantized Gradient Descent
☆14Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for DGT
- Factorized Neural Layers☆27Updated last year
- ☆35Updated 5 years ago
- Implementation of a Tensorflow XLA rematerialization pass☆15Updated 4 years ago
- Code for the paper "Faster Neural Network Training with Approximate Tensor Operations"☆10Updated 3 years ago
- Successfully training approximations to full-rank matrices for efficiency in deep learning.☆16Updated 3 years ago
- Standalone commandline CLI tool for compiling Triton kernels☆15Updated 2 months ago
- Make triton easier☆41Updated 5 months ago
- benchmarking some transformer deployments☆26Updated last year
- ☆9Updated 11 months ago
- Code for BlockSwap (ICLR 2020).☆33Updated 3 years ago
- ☆12Updated 4 years ago
- Nonparametric Score Estimators, ICML 2020☆36Updated 3 years ago
- Open source implementation of SeaRNN (ICLR 2018, https://openreview.net/forum?id=HkUR_y-RZ)☆49Updated 6 years ago
- Spartan is an algorithm for training sparse neural network models. This repository accompanies the paper "Spartan Differentiable Sparsity…☆24Updated 2 years ago
- ☆36Updated last year
- An implementation of various tensor-based decomposition for NN & RNN parameters☆18Updated 6 years ago
- Implementation of Neural Arithmetic Logic Units (https://arxiv.org/pdf/1808.00508.pdf)☆31Updated 6 years ago
- Using FlexAttention to compute attention with different masking patterns☆40Updated 2 months ago
- Multi-framework implementation of Deep Kernel Shaping and Tailored Activation Transformations, which are methods that modify neural netwo…☆64Updated this week
- ☆24Updated last year
- ☆55Updated 6 months ago
- Structured matrices for compressing neural networks☆67Updated last year
- Official code release for the paper Coder Reviewer Reranking for Code Generation.☆42Updated last year
- A Learnable LSH Framework for Efficient NN Training☆30Updated 3 years ago
- Blog post☆16Updated 9 months ago
- This repository provides open-source code for sparse continuous distributions and corresponding Fenchel-Young losses.☆16Updated last year
- ☆23Updated 2 months ago
- Demo of the unit_scaling library, showing how a model can be easily adapted to train in FP8.☆35Updated 4 months ago
- A minimal implementation of a VAE with BinConcrete (relaxed Bernoulli) latent distribution in TensorFlow.☆21Updated 4 years ago
- ☆16Updated 5 years ago