wiegerw / nervaLinks
C++ and Python libraries for neural networks.
☆15Updated 3 weeks ago
Alternatives and similar repositories for nerva
Users that are interested in nerva are comparing it to the libraries listed below
Sorting:
- Code for the paper: https://arxiv.org/pdf/2309.06979.pdf☆19Updated 10 months ago
- Official repository of Sparse ISO-FLOP Transformations for Maximizing Training Efficiency☆25Updated 10 months ago
- 🧮 Algebraic Positional Encodings.☆13Updated 4 months ago
- ☆13Updated 3 weeks ago
- [NAACL 2025] Official Implementation of "HMT: Hierarchical Memory Transformer for Long Context Language Processing"☆71Updated this week
- Repo for solving arc problems with an Neural Cellular Automata☆15Updated 2 weeks ago
- Make triton easier☆47Updated 11 months ago
- ☆21Updated 8 months ago
- Code implementing "Efficient Parallelization of a Ubiquitious Sequential Computation" (Heinsen, 2023)☆94Updated 6 months ago
- LeanAgent is a novel lifelong learning framework for formal theorem proving that continuously generalizes to and improves on ever-expandi…☆26Updated last month
- ☆32Updated 8 months ago
- ☆13Updated this week
- 👑 Pytorch code for the Nero optimiser.☆20Updated 2 years ago
- JAX/Flax implementation of the Hyena Hierarchy☆34Updated 2 years ago
- train with kittens!☆57Updated 7 months ago
- Minimum Description Length probing for neural network representations☆19Updated 4 months ago
- A package for defining deep learning models using categorical algebraic expressions.☆60Updated 10 months ago
- ☆53Updated 8 months ago
- Official Implementation of "CheckEmbed: Effective Verification of LLM Solutions to Open-Ended Tasks"☆19Updated this week
- Experiment of using Tangent to autodiff triton☆79Updated last year
- Official repository for the paper "Can You Learn an Algorithm? Generalizing from Easy to Hard Problems with Recurrent Networks"☆59Updated 3 years ago
- A system for automating selection and optimization of pre-trained models from the TAO Model Zoo☆25Updated 11 months ago
- The simplest implementation of recent Sparse Attention patterns for efficient LLM inference.☆62Updated 4 months ago
- Proof-of-concept of global switching between numpy/jax/pytorch in a library.☆18Updated 11 months ago
- FastFeedForward Networks☆20Updated last year
- Official repository for the paper "Neural Differential Equations for Learning to Program Neural Nets Through Continuous Learning Rules" (…☆22Updated 2 years ago
- FlexAttention w/ FlashAttention3 Support☆26Updated 8 months ago
- Implementation of Spectral State Space Models☆16Updated last year
- Meta-learning inductive biases in the form of useful conserved quantities.☆37Updated 2 years ago
- Implementing RASP transformer programming language https://arxiv.org/pdf/2106.06981.pdf.☆54Updated 3 years ago