PAIR-code / tiny-transformersLinks
☆20Updated this week
Alternatives and similar repositories for tiny-transformers
Users that are interested in tiny-transformers are comparing it to the libraries listed below
Sorting:
- Recursive Leasting Squares (RLS) with Neural Network for fast learning☆54Updated last year
- An attempt to merge ESBN with Transformers, to endow Transformers with the ability to emergently bind symbols☆16Updated 3 years ago
- Engineering the state of RNN language models (Mamba, RWKV, etc.)☆32Updated last year
- Shows how to do parameter ensembling using differential evolution.☆10Updated 3 years ago
- A JAX nn library☆21Updated 4 months ago
- Implementation of Token Shift GPT - An autoregressive model that solely relies on shifting the sequence space for mixing☆50Updated 3 years ago
- ☆16Updated last year
- DiCE: The Infinitely Differentiable Monte-Carlo Estimator☆31Updated last year
- Minimum Description Length probing for neural network representations☆18Updated 5 months ago
- ☆18Updated last year
- PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"☆24Updated 2 weeks ago
- A python library for highly configurable transformers - easing model architecture search and experimentation.☆49Updated 3 years ago
- Codes accompanying the paper "LaProp: a Better Way to Combine Momentum with Adaptive Gradient"☆29Updated 4 years ago
- Implementation of some personal helper functions for Einops, my most favorite tensor manipulation library ❤️☆54Updated 2 years ago
- AdaCat☆49Updated 2 years ago
- Implementation of Spectral State Space Models☆16Updated last year
- Implementation of a holodeck, written in Pytorch☆18Updated last year
- Easily serialize dataclasses to and from tensors (PyTorch, NumPy)