Official codebase for Pretrained Transformers as Universal Computation Engines.
☆246Jan 14, 2022Updated 4 years ago
Alternatives and similar repositories for universal-computation
Users that are interested in universal-computation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official implementation of ORCA proposed in the paper "Cross-Modal Fine-Tuning: Align then Refine"☆73Mar 6, 2024Updated 2 years ago
- The official repository for our paper "The Devil is in the Detail: Simple Tricks Improve Systematic Generalization of Transformers". We s…☆66Dec 16, 2022Updated 3 years ago
- Implementation of Perceiver, General Perception with Iterative Attention, in Pytorch☆1,196Aug 22, 2023Updated 2 years ago
- ☆21Mar 15, 2023Updated 3 years ago
- A JAX nn library☆21Sep 9, 2025Updated 6 months ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A set of 13 diverse machine-learning tasks that require memory to solve.☆225Aug 12, 2021Updated 4 years ago
- [CVPR 2021] VirTex: Learning Visual Representations from Textual Annotations☆565Aug 22, 2025Updated 7 months ago
- Code Repo for the ACL21 paper "Common Sense Beyond English: Evaluating and Improving Multilingual LMs for Commonsense Reasoning"☆23Oct 26, 2021Updated 4 years ago
- ☆56Aug 14, 2020Updated 5 years ago
- Transformers are Sample-Efficient World Models. ICLR 2023, notable top 5%.☆872Oct 14, 2024Updated last year
- Code for T-MARS data filtering☆35Aug 23, 2023Updated 2 years ago
- AAAI 2022 Paper: Bet even Beth Harmon couldn't learn chess like that :)☆38Mar 3, 2021Updated 5 years ago
- A GPT, made only of MLPs, in Jax☆59Jun 23, 2021Updated 4 years ago
- [NeurIPS'19] Deep Equilibrium Models☆795Jul 4, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Codebase for Learning Invariances in Neural Networks☆96Oct 7, 2022Updated 3 years ago
- [NeurIPS'19] [PyTorch] Adaptive Regularization in NN☆68Oct 13, 2019Updated 6 years ago
- Fine-grained ImageNet annotations☆30May 25, 2020Updated 5 years ago
- Replication materials for "Algorithmic decision making and the cost of fairness," by Corbett-Davies et al.☆11Jun 1, 2017Updated 8 years ago
- Usable implementation of Emerging Symbol Binding Network (ESBN), in Pytorch☆25Jan 6, 2021Updated 5 years ago
- Locally Enhanced Self-Attention: Rethinking Self-Attention as Local and Context Terms☆20Nov 29, 2021Updated 4 years ago
- Official code for "Can Wikipedia Help Offline Reinforcement Learning?" by Machel Reid, Yutaro Yamada and Shixiang Shane Gu☆106Jul 18, 2022Updated 3 years ago
- Open-AI's DALL-E for large scale training in mesh-tensorflow.☆431Feb 12, 2022Updated 4 years ago
- [CVPR 2022] Official code for "Unified Contrastive Learning in Image-Text-Label Space"☆408Nov 10, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Pytorch version of VidLanKD: Improving Language Understanding viaVideo-Distilled Knowledge Transfer (NeurIPS 2021))☆56Feb 6, 2023Updated 3 years ago
- Official code repository of the paper Linear Transformers Are Secretly Fast Weight Programmers.☆114Jun 10, 2021Updated 4 years ago
- VISSL is FAIR's library of extensible, modular and scalable components for SOTA Self-Supervised Learning with images.☆3,294Mar 3, 2024Updated 2 years ago
- Implementation of LambdaNetworks, a new approach to image recognition that reaches SOTA with less compute☆1,532Nov 18, 2020Updated 5 years ago
- ☆30Jan 17, 2022Updated 4 years ago
- Official repository for the paper "Going Beyond Linear Transformers with Recurrent Fast Weight Programmers" (NeurIPS 2021)☆51Jun 11, 2025Updated 9 months ago
- Data Release for VALUE Benchmark☆30Feb 16, 2022Updated 4 years ago
- Understanding Training Dynamics of Deep ReLU Networks