wiegerw / nerva
C++ and Python libraries for neural networks.
☆13Updated 4 months ago
Alternatives and similar repositories for nerva:
Users that are interested in nerva are comparing it to the libraries listed below
- Official repository for the paper "Can You Learn an Algorithm? Generalizing from Easy to Hard Problems with Recurrent Networks"☆60Updated 2 years ago
- ☆13Updated last year
- ☆52Updated 4 months ago
- ☆21Updated 2 years ago
- A centralized place for deep thinking code and experiments☆81Updated last year
- Prototype routines for GPU quantization written using PyTorch.☆19Updated last week
- Official repository of Sparse ISO-FLOP Transformations for Maximizing Training Efficiency☆25Updated 6 months ago
- Latest Weight Averaging (NeurIPS HITY 2022)☆28Updated last year
- Code for the note "NF4 Isn't Information Theoretically Optimal (and that's Good)☆18Updated last year
- ☆19Updated 3 years ago
- Code for "Accelerating Training with Neuron Interaction and Nowcasting Networks" [to appear at ICLR 2025]☆18Updated last month
- The Energy Transformer block, in JAX☆56Updated last year
- Make triton easier☆44Updated 8 months ago
- JAX implementation of "Fine-Tuning Language Models with Just Forward Passes"☆19Updated last year
- 👑 Pytorch code for the Nero optimiser.☆20Updated 2 years ago
- Code accompanying our paper "Feature Learning in Infinite-Width Neural Networks" (https://arxiv.org/abs/2011.14522)☆59Updated 3 years ago
- A case study of efficient training of large language models using commodity hardware.☆68Updated 2 years ago
- Experiment of using Tangent to autodiff triton☆75Updated last year
- Repository of machine learning benchmarks☆34Updated this week
- Proof-of-concept of global switching between numpy/jax/pytorch in a library.☆18Updated 8 months ago
- ☆25Updated last year
- ☆22Updated 6 years ago
- Sparsity support for PyTorch☆33Updated last week
- Optimization algorithm which fits a ResNet to CIFAR-10 5x faster than SGD / Adam (with terrible generalization)☆12Updated last year
- [ICLR 2024] Dynamic Sparse Training with Structured Sparsity☆17Updated 10 months ago
- ☆29Updated 4 months ago
- ☆49Updated last year
- Meta-learning inductive biases in the form of useful conserved quantities.☆37Updated 2 years ago
- Code accompanying the NeurIPS 2020 paper: WoodFisher (Singh & Alistarh, 2020)☆48Updated 3 years ago
- Amos optimizer with JEstimator lib.☆81Updated 9 months ago