A simple implementation of a deep linear Pytorch module
☆21Oct 16, 2020Updated 5 years ago
Alternatives and similar repositories for deep-linear-network
Users that are interested in deep-linear-network are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A simple Transformer where the softmax has been replaced with normalization☆20Sep 11, 2020Updated 5 years ago
- A GPT, made only of MLPs, in Jax☆59Jun 23, 2021Updated 4 years ago
- Exploration into the proposed "Self Reasoning Tokens" by Felipe Bonetto☆57May 17, 2024Updated last year
- Implementation of Transframer, Deepmind's U-net + Transformer architecture for up to 30 seconds video generation, in Pytorch☆72Aug 23, 2022Updated 3 years ago
- Implementation of Cross Transformer for spatially-aware few-shot transfer, in Pytorch☆54Mar 30, 2021Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- CUDA implementation of autoregressive linear attention, with all the latest research findings☆46May 23, 2023Updated 2 years ago
- Fast and memory-efficient exact attention☆20Jul 22, 2024Updated last year
- An implementation of Transformer with Expire-Span, a circuit for learning which memories to retain☆34Oct 30, 2020Updated 5 years ago
- Published by Packt Publishing☆19Jan 15, 2021Updated 5 years ago
- Implementation of Transformer in Transformer, pixel level attention paired with patch level attention for image classification, in Pytorc…☆309Dec 27, 2021Updated 4 years ago
- Distance Guided Channel Weighting for Semantic Sgementation (https://arxiv.org/abs/2004.12679)☆14Nov 24, 2020Updated 5 years ago
- A Pytorch implementation of Global Self-Attention Network, a fully-attention backbone for vision tasks☆94Nov 21, 2020Updated 5 years ago
- Implementation of Geometric Vector Perceptron, a simple circuit for 3d rotation equivariance for learning over large biomolecules, in Pyt…☆77Jun 8, 2021Updated 4 years ago
- Implementation of Hierarchical Transformer Memory (HTM) for Pytorch☆76Sep 15, 2021Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Implementation of Nvidia's NeuralPlexer, for end-to-end differentiable design of functional small-molecules and ligand-binding proteins, …☆52Nov 20, 2023Updated 2 years ago
- Implementation of N-Grammer, augmenting Transformers with latent n-grams, in Pytorch☆76Dec 4, 2022Updated 3 years ago
- To be a next-generation DL-based phenotype prediction from genome mutations.☆19May 17, 2021Updated 4 years ago
- Implementation of Token Shift GPT - An autoregressive model that solely relies on shifting the sequence space for mixing☆49Jan 27, 2022Updated 4 years ago
- CVE-2016-4657 web-kit vulnerability for ios 9.3, nintendo switch browser vulnerability☆10Nov 11, 2018Updated 7 years ago
- Implementation of Linformer for Pytorch☆306Jan 5, 2024Updated 2 years ago
- Toy genetic algorithm in Pytorch☆56Apr 21, 2026Updated 2 weeks ago
- EMNLP 2020: Filtering before Iteratively Referring for Knowledge-Grounded Response Selection in Retrieval-Based Chatbots☆12Dec 15, 2020Updated 5 years ago
- Contrastive Language-Audio Pretraining☆15May 18, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Implementation of Multistream Transformers in Pytorch☆54Jul 31, 2021Updated 4 years ago
- A barely barebone NumPy implementation of Hierarchical Temporal Memory.☆11Mar 26, 2023Updated 3 years ago
- ICML'20: SIGUA: Forgetting May Make Learning with Noisy Labels More Robust☆17Dec 14, 2020Updated 5 years ago
- Implementation of the Triangle Multiplicative module, used in Alphafold2 as an efficient way to mix rows or columns of a 2d feature map, …☆39Aug 3, 2021Updated 4 years ago
- Implementation of Feedback Transformer in Pytorch☆108Mar 2, 2021Updated 5 years ago
- ☆21May 3, 2020Updated 6 years ago
- Source code for ScaleGrad☆19Dec 28, 2021Updated 4 years ago
- Implementation of Long-Short Transformer, combining local and global inductive biases for attention over long sequences, in Pytorch☆120Aug 4, 2021Updated 4 years ago
- models for MoreMNAS☆31Jul 6, 2020Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆33Apr 12, 2021Updated 5 years ago
- PhD thesis (updating) of Jiatao Gu from HKU☆19Aug 10, 2018Updated 7 years ago
- Benchmark Generator for Global Routing☆13Jul 18, 2019Updated 6 years ago
- Causal Effect Inference for Structured Treatments (SIN) (NeurIPS 2021)☆42Apr 26, 2022Updated 4 years ago
- Modern Deep Network Toolkits for Tensorflow-Keras. This is a extension for newest tensorflow 1.x.☆13Aug 30, 2020Updated 5 years ago
- Implementation of ICLR 2020 paper "Revisiting Self-Training for Neural Sequence Generation"☆46Jun 30, 2022Updated 3 years ago
- PyTorch Language Modeling Toolkit for Fast Weight Programmers☆21Jun 11, 2025Updated 10 months ago