Official codebase for Pretrained Transformers as Universal Computation Engines.
☆246Jan 14, 2022Updated 4 years ago
Alternatives and similar repositories for universal-computation
Users that are interested in universal-computation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official implementation of ORCA proposed in the paper "Cross-Modal Fine-Tuning: Align then Refine"☆73Mar 6, 2024Updated 2 years ago
- The official repository for our paper "The Devil is in the Detail: Simple Tricks Improve Systematic Generalization of Transformers". We s…☆66Dec 16, 2022Updated 3 years ago
- Implementation of Perceiver, General Perception with Iterative Attention, in Pytorch☆1,200Aug 22, 2023Updated 2 years ago
- ☆21Mar 15, 2023Updated 3 years ago
- A JAX nn library☆21Sep 9, 2025Updated 7 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A set of 13 diverse machine-learning tasks that require memory to solve.☆225Aug 12, 2021Updated 4 years ago
- [CVPR 2021] VirTex: Learning Visual Representations from Textual Annotations☆565Aug 22, 2025Updated 7 months ago
- Code for Language-Interfaced FineTuning for Non-Language Machine Learning Tasks.☆133Nov 11, 2024Updated last year
- Code Repo for the ACL21 paper "Common Sense Beyond English: Evaluating and Improving Multilingual LMs for Commonsense Reasoning"☆23Oct 26, 2021Updated 4 years ago
- ☆57Aug 14, 2020Updated 5 years ago
- Transformers are Sample-Efficient World Models. ICLR 2023, notable top 5%.☆876Oct 14, 2024Updated last year
- Code for T-MARS data filtering☆35Aug 23, 2023Updated 2 years ago
- AAAI 2022 Paper: Bet even Beth Harmon couldn't learn chess like that :)☆38Mar 3, 2021Updated 5 years ago
- A GPT, made only of MLPs, in Jax☆59Jun 23, 2021Updated 4 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- [NeurIPS'19] Deep Equilibrium Models☆797Jul 4, 2022Updated 3 years ago
- Implementation of Hourglass Transformer, in Pytorch, from Google and OpenAI☆99Dec 31, 2021Updated 4 years ago
- Codebase for Learning Invariances in Neural Networks☆96Oct 7, 2022Updated 3 years ago
- Fine-grained ImageNet annotations☆30May 25, 2020Updated 5 years ago
- Usable implementation of Emerging Symbol Binding Network (ESBN), in Pytorch☆25Jan 6, 2021Updated 5 years ago
- Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch☆5,624Feb 17, 2024Updated 2 years ago
- ARC Community Project☆22Aug 2, 2024Updated last year
- Implementation of Bidirectional Recurrent Independent Mechanisms (Learning to Combine Top-Down and Bottom-Up Signals in Recurrent Neural …☆28Nov 11, 2020Updated 5 years ago
- Repo for code for the NIPS paper entitled "An Architecture for Deep, Hierarchical Generative Models"☆14Oct 27, 2016Updated 9 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Locally Enhanced Self-Attention: Rethinking Self-Attention as Local and Context Terms☆20Nov 29, 2021Updated 4 years ago
- Official code for "Can Wikipedia Help Offline Reinforcement Learning?" by Machel Reid, Yutaro Yamada and Shixiang Shane Gu☆106Jul 18, 2022Updated 3 years ago
- [CVPR 2022] Official code for "Unified Contrastive Learning in Image-Text-Label Space"☆408Nov 10, 2023Updated 2 years ago
- Open-AI's DALL-E for large scale training in mesh-tensorflow.☆431Feb 12, 2022Updated 4 years ago
- Pytorch version of VidLanKD: Improving Language Understanding viaVideo-Distilled Knowledge Transfer (NeurIPS 2021))☆56Feb 6, 2023Updated 3 years ago
- Official code repository of the paper Linear Transformers Are Secretly Fast Weight Programmers.☆113Jun 10, 2021Updated 4 years ago
- VISSL is FAIR's library of extensible, modular and scalable components for SOTA Self-Supervised Learning with images.☆3,297Mar 3, 2024Updated 2 years ago
- Implementation of LambdaNetworks, a new approach to image recognition that reaches SOTA with less compute☆1,530Nov 18, 2020Updated 5 years ago
- ☆30Jan 17, 2022Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Official repository for the paper "Going Beyond Linear Transformers with Recurrent Fast Weight Programmers" (NeurIPS 2021)☆51Jun 11, 2025Updated 10 months ago
- Data Release for VALUE Benchmark☆30Feb 16, 2022Updated 4 years ago
- Understanding Training Dynamics of Deep ReLU Networks☆306Oct 19, 2025Updated 5 months ago
- Code for "Self-Attention Between Datapoints: Going Beyond Individual Input-Output Pairs in Deep Learning"☆417Mar 21, 2024Updated 2 years ago
- Official repository for our paper, Transformers Learn Higher-Order Optimization Methods for In-Context Learning: A Study with Linear Mode…☆19Nov 19, 2024Updated last year
- Implementation of Feedback Transformer in Pytorch☆108Mar 2, 2021Updated 5 years ago
- [CVPR 2021 Best Student Paper Honorable Mention, Oral] Official PyTorch code for ClipBERT, an efficient framework for end-to-end learning…☆730Aug 8, 2023Updated 2 years ago