A PyTorch implementation of the Transformer model from "Attention Is All You Need".
☆60Jul 13, 2019Updated 6 years ago
Alternatives and similar repositories for pytorch-transformer
Users that are interested in pytorch-transformer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆35Aug 20, 2020Updated 5 years ago
- Implementation of "Attention is All You Need" paper☆33Jul 25, 2024Updated last year
- ☆11Jan 2, 2022Updated 4 years ago
- Keras implementation of the Information Dropout (arXiv:1611.01353) paper☆15Dec 31, 2016Updated 9 years ago
- MARNNs Can Learn Generalized Dyck Languages☆12Nov 11, 2019Updated 6 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Batch Calculator for Zeolite synthesis☆11Dec 14, 2024Updated last year
- Belief in the Machine: Investigating Epistemological Blind Spots of Language Models☆32Apr 19, 2025Updated 11 months ago
- Follow the Wisdom of the Crowd: Effective Text Generation via Minimum Bayes Risk Decoding☆20Nov 16, 2022Updated 3 years ago
- Scientific Computing from Scratch☆11Oct 23, 2025Updated 5 months ago
- CRFs based Chinese word segmentor☆21Oct 8, 2014Updated 11 years ago
- Minimal AlphaZero in PyTorch, trained on Connect4 on a 6x6 board.☆21Aug 12, 2022Updated 3 years ago
- Code for our EMNLP '22 paper "Fixing Model Bugs with Natural Language Patches"☆19Dec 7, 2022Updated 3 years ago
- Code release of our NeurIPS 18 paper "A flexible model for training action localization with varying levels of supervision"☆16Dec 28, 2018Updated 7 years ago
- ☆18Mar 25, 2020Updated 6 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Pytorch implementation of RFCN used as baseline for Imagenet VID+DET in https://arxiv.org/abs/1710.03958.☆34Nov 3, 2018Updated 7 years ago
- materials for my workshop "Latest Deep Learning Models for NLP" @ the European Open Data Science Conference 2019☆11Feb 3, 2020Updated 6 years ago
- Audio-conditioned video texture generation☆24Sep 16, 2022Updated 3 years ago
- A Higher-order HMM with EM algo.☆16May 4, 2022Updated 3 years ago
- ☆16Jul 6, 2023Updated 2 years ago
- ☆13Dec 12, 2022Updated 3 years ago
- A CNN based Depth, Optical Flow, Flow Uncertainty and Camera Pose Prediction pipeline☆13Mar 25, 2019Updated 7 years ago
- Conditional Random Fields implemented as Lasagne layer☆10Jul 22, 2016Updated 9 years ago
- Differentiable MPC in Chainer, developed as part of PFN summer internship 2019.☆15Aug 23, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- DIBS is an implementation of a basic controlled digital lending (CDL) system using IIIF to make scanned books available for time-limited …☆27Dec 15, 2025Updated 3 months ago
- Compute the most likely permutation of a lattice given an LM☆10Jan 3, 2013Updated 13 years ago
- Get personalised programming job postings right to your telegram with ease☆18Jul 5, 2021Updated 4 years ago
- Tetris: Implement my solution-finder in C++☆17Apr 7, 2025Updated last year
- Easy to use benchmarks for linear algebra frameworks☆24Jun 5, 2020Updated 5 years ago
- A curated list of resources about NLP☆10Apr 16, 2023Updated 2 years ago
- The official code for ICCV 2023 paper "Reconstructing Groups of People with Hypergraph Relational Reasoning"☆12Jul 4, 2025Updated 9 months ago
- This repository contains code for the paper "Are Pretrained Language Models Symbolic Reasoners over Knowledge?"☆13Mar 23, 2021Updated 5 years ago
- Expected edit distance implementation using OpenFst tools☆11May 13, 2015Updated 10 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Code for papers "A Surprisingly Robust Trick for Winograd Schema Challenge" and "WikiCREM: A Large Unsupervised Corpus for Coreference Re…☆71Oct 4, 2022Updated 3 years ago
- ☆20Jan 12, 2026Updated 2 months ago
- Pipelined quality estimation.☆51Aug 13, 2019Updated 6 years ago
- ☆13Mar 2, 2025Updated last year
- The implementation of the model proposed in the Large-Scale Multi-Domain Belief Tracking with Knowledge Sharing paper☆60Jan 16, 2019Updated 7 years ago
- Implementation of Poincare Embedding in PyTorch☆13Jul 27, 2017Updated 8 years ago
- Repository for MoleGuLAR: Molecule generation using Reinforcement Learning and Alternating Rewards☆25May 18, 2024Updated last year