Code for the paper PermuteFormer
☆42Oct 10, 2021Updated 4 years ago
Alternatives and similar repositories for PermuteFormer
Users that are interested in PermuteFormer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆10Sep 13, 2021Updated 4 years ago
- Fast model deployment on AWS Lambda☆15Feb 25, 2024Updated 2 years ago
- Implementation of Token Shift GPT - An autoregressive model that solely relies on shifting the sequence space for mixing☆49Jan 27, 2022Updated 4 years ago
- A Pytree Module system for Deep Learning in JAX☆212Feb 26, 2023Updated 3 years ago
- Implementation of some personal helper functions for Einops, my most favorite tensor manipulation library ❤️☆57Jan 5, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆13Jun 16, 2021Updated 5 years ago
- The Codebase for Causal Distillation for Language Models (NAACL '22)☆26May 1, 2022Updated 4 years ago
- ☆18Nov 25, 2022Updated 3 years ago
- A simple template for TensorFlow's highly efficient CudnnLSTM module☆11Jun 8, 2018Updated 8 years ago
- Assignment codes for CS736 Algorithms for Medical Image Processing.☆10Aug 10, 2016Updated 9 years ago
- Implementation of a Transformer that Ponders, using the scheme from the PonderNet paper☆84Oct 30, 2021Updated 4 years ago
- ☆14Oct 28, 2023Updated 2 years ago
- High performance pytorch modules☆18Jan 14, 2023Updated 3 years ago
- The code repository for Discovering Conflicting Groups in Signed Networks (NeurIPS 2020)☆15Aug 24, 2021Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ACL 2022(findings): A Sentence is Worth 128 Pseudo Tokens: A Semantic-Aware Contrastive Learning Framework for Sentence Embeddings☆18Mar 23, 2022Updated 4 years ago
- pytorch implementation of trDesign☆45Mar 19, 2021Updated 5 years ago
- Implementation of Perceiver, General Perception with Iterative Attention, in Pytorch☆1,216Jun 8, 2026Updated 3 weeks ago
- Code for the paper "Adaptive Transformers for Learning Multimodal Representations" (ACL SRW 2020)☆43Oct 20, 2022Updated 3 years ago
- ☆13Sep 8, 2020Updated 5 years ago
- ☆14Jul 2, 2022Updated 4 years ago
- awesome video-based self-supervised learning methods in recently years☆10Nov 26, 2020Updated 5 years ago
- Implementation of N-Grammer, augmenting Transformers with latent n-grams, in Pytorch☆81Dec 4, 2022Updated 3 years ago
- [DEPRECEATED] Piano Transformer model trained on 2.6GB of MIDI piano music☆13Oct 10, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Fast Discounted Cumulative Sums in PyTorch☆98Aug 28, 2021Updated 4 years ago
- Code associated with our paper "Learning Group Structure and Disentangled Representations of Dynamical Environments"☆15Dec 8, 2022Updated 3 years ago
- PyTorch Language Modeling Toolkit for Fast Weight Programmers☆22Jun 11, 2025Updated last year
- 【算法】通过图像颜色计算图像的相似度☆11Sep 16, 2020Updated 5 years ago
- ☆31Jun 28, 2022Updated 4 years ago
- reformer-pytorch中文版本,简单高效的生成模型。类似GPT2的效果☆16Jun 12, 2023Updated 3 years ago
- Implementation of Linformer for Pytorch☆307Jan 5, 2024Updated 2 years ago
- Official PyTorch implementation of 'RELATE: Physically Plausible Multi-Object SceneSynthesis Using Structured Latent Spaces'.☆31Nov 6, 2020Updated 5 years ago
- ☆26Sep 15, 2022Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- The implementation of "Neural Machine Translation without Embeddings", NAACL 2021☆33Jun 9, 2021Updated 5 years ago
- A convolution-free, transformer-only version of the CycleGAN framework☆33Feb 12, 2022Updated 4 years ago
- Code for 'Inference Suboptimality in Variational Autoencoders'☆10May 22, 2020Updated 6 years ago
- Code for the Shortformer model, from the ACL 2021 paper by Ofir Press, Noah A. Smith and Mike Lewis.☆147Jul 26, 2021Updated 4 years ago
- Training scripts for paper Miceli Barone et al. 2017 "Deep Architectures for Neural Machine Translation"☆11Jul 13, 2017Updated 8 years ago
- Official code repository of the paper Linear Transformers Are Secretly Fast Weight Programmers.☆115Jun 10, 2021Updated 5 years ago
- ☆81Jan 21, 2022Updated 4 years ago