PAIR-code / tiny-transformers
β20Updated 2 weeks ago
Alternatives and similar repositories for tiny-transformers
Users that are interested in tiny-transformers are comparing it to the libraries listed below
Sorting:
- Implementation of Metaformer, but in an autoregressive mannerβ24Updated 2 years ago
- π° Computing the information content of trained neural networksβ21Updated 3 years ago
- β16Updated last year
- An attempt to merge ESBN with Transformers, to endow Transformers with the ability to emergently bind symbolsβ15Updated 3 years ago
- Understanding RL vision Distill articleβ23Updated 2 years ago
- Implementation of a holodeck, written in Pytorchβ17Updated last year
- β18Updated last year
- Codes accompanying the paper "LaProp: a Better Way to Combine Momentum with Adaptive Gradient"β28Updated 4 years ago
- DiCE: The Infinitely Differentiable Monte-Carlo Estimatorβ31Updated last year
- Implementation of Token Shift GPT - An autoregressive model that solely relies on shifting the sequence space for mixingβ50Updated 3 years ago
- code for paper "Accessing higher dimensions for unsupervised word translation"β21Updated last year
- Framework for building algorithms based on FractalAI theoryβ19Updated 4 years ago
- The Intermediate Goal of the project is to train a GPT like architecture to learn to summarise reddit posts from human preferences, as thβ¦β12Updated 3 years ago
- Training hybrid models for dummies.β21Updated 4 months ago
- β31Updated last year
- Low-Rank Adaptation of Large Language Models clean implementationβ8Updated last year
- Automatically generate simple meta-learning tasks from a very large spaceβ15Updated last year
- β11Updated 11 months ago
- Source-to-Source Debuggable Derivatives in Pure Pythonβ15Updated last year
- AdaCatβ49Updated 2 years ago
- A JAX nn libraryβ21Updated 2 months ago
- Repo to reproduce the First-Explore paper resultsβ37Updated 4 months ago
- A framework for implementing equivariant DLβ10Updated 3 years ago
- Shows how to do parameter ensembling using differential evolution.β10Updated 3 years ago
- Explorations into the proposed Streaming Deep Reinforcement Learning, from University of Albertaβ19Updated 6 months ago
- Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classificationβ11Updated last year
- Personal solutions to the Triton Puzzlesβ18Updated 9 months ago
- Directed masked autoencodersβ14Updated 2 years ago
- β15Updated 2 years ago
- Scripts for building and deploying ConceptNet, using Packer and Puppetβ10Updated 4 years ago