PAIR-code / tiny-transformersLinks
☆20Updated 2 weeks ago
Alternatives and similar repositories for tiny-transformers
Users that are interested in tiny-transformers are comparing it to the libraries listed below
Sorting:
- Implementation of Metaformer, but in an autoregressive manner☆25Updated 3 years ago
- ☆16Updated last year
- code for paper "Accessing higher dimensions for unsupervised word translation"☆21Updated 2 years ago
- PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"☆24Updated this week
- DiCE: The Infinitely Differentiable Monte-Carlo Estimator☆31Updated last year
- Implementation of a holodeck, written in Pytorch☆18Updated last year
- ☆18Updated last year
- A JAX nn library☆21Updated 4 months ago
- ☆21Updated last month
- An attempt to merge ESBN with Transformers, to endow Transformers with the ability to emergently bind symbols☆16Updated 3 years ago
- Shows how to do parameter ensembling using differential evolution.☆10Updated 3 years ago
- TaskMet Task-driven Metric Learning for Model Learning☆19Updated last year
- Implementation of some personal helper functions for Einops, my most favorite tensor manipulation library ❤️☆54Updated 2 years ago
- ☆11Updated last year
- Minimum Description Length probing for neural network representations☆18Updated 5 months ago
- 📰 Computing the information content of trained neural networks☆21Updated 3 years ago
- A framework for implementing equivariant DL☆10Updated 4 years ago
- ☆23Updated 6 months ago
- Generative cellular automaton-like learning environments for RL.☆19Updated 4 months ago
- AdaCat☆49Updated 2 years ago
- Automatically generate simple meta-learning tasks from a very large space☆15Updated last year
- JAX implementation of Learning to learn by gradient descent by gradient descent☆27Updated 8 months ago
- Alpha Zero equipped with Transformer with various novel techniques for speedup in tree search☆27Updated 6 years ago
- Implementation of Spectral State Space Models☆16Updated last year
- ☆32Updated last year
- ☆26Updated 2 years ago
- Causal Analysis of Agent Behavior for AI Safety☆18Updated 2 years ago
- The code for "Multi-CLS BERT: An Efficient Alternative to Traditional Ensembling"☆11Updated 2 years ago
- Training hybrid models for dummies.☆23Updated 5 months ago
- Documentation for dynamic machine learning systems.☆29Updated 9 months ago