Implementation of "Attention is All You Need" paper
☆33Jul 25, 2024Updated last year
Alternatives and similar repositories for pytorch-transformer
Users that are interested in pytorch-transformer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆15Oct 20, 2023Updated 2 years ago
- A method for evaluating the high-level coherence of machine-generated texts. Identifies high-level coherence issues in transformer-based …☆12Mar 18, 2023Updated 3 years ago
- Implementation of FixMatch in PyTorch and experimentations☆12Aug 9, 2020Updated 5 years ago
- minimal seq2seq of keras☆24Jun 17, 2017Updated 8 years ago
- PyTorch implementation of FAIR's paper "End-to-End Memory Network", NIPS 2015☆12Oct 19, 2017Updated 8 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- PyTorch implementation for PaLM: A Hybrid Parser and Language Model.☆10Jan 7, 2020Updated 6 years ago
- The code for the paper "Adversarial Decomposition of Text Representation", NAACL 2019☆29Dec 8, 2022Updated 3 years ago
- Egocentric Video Description based on Temporally-Linked Sequences☆11Jul 17, 2017Updated 8 years ago
- ☆13Aug 11, 2018Updated 7 years ago
- [NeurIPS 2024] Self-Optimization Improves the Efficiency of Code Generation☆14May 10, 2025Updated last year
- ☆10Dec 21, 2019Updated 6 years ago
- Span and Rule Models for Neural Constituent Parsing☆10Jun 11, 2018Updated 7 years ago
- Dependency Grammar Induction☆18Feb 11, 2019Updated 7 years ago
- Fairring (FAIR + Herring) is a plug-in for PyTorch that provides a process group for distributed training that outperforms NCCL at large …☆66Mar 21, 2022Updated 4 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- character recognition, textline recognition☆10Aug 31, 2019Updated 6 years ago
- ☆10May 24, 2020Updated 6 years ago
- Distributed Bayesian Optimization☆23Jun 29, 2020Updated 5 years ago
- pytorch tacotron2 https://arxiv.org/pdf/1712.05884.pdf☆43Mar 18, 2018Updated 8 years ago
- Literature mining for T cell relations☆23Aug 5, 2022Updated 3 years ago
- ☆19Oct 28, 2018Updated 7 years ago
- Radam+lookahead implemented by tensorflow☆11Oct 14, 2019Updated 6 years ago
- Render pyecharts as image via phantomjs☆13Oct 14, 2020Updated 5 years ago
- Deep Learning - Multi-Task Representation Learning using Shared Architecture for Deep Neural Networks☆19Apr 11, 2017Updated 9 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Time Series forecasting with MLP, CNN, LSTM, CNN-LSTM for predicting future sales☆11Mar 27, 2023Updated 3 years ago
- wip, Pytorch implementation for ACL2017 paper "An unsupervised neural attention model for aspect extraction"☆12Apr 30, 2019Updated 7 years ago
- Code for H. Narasimhan, "Learning with Complex Loss Functions and Constraints", AISTATS 2018☆11Mar 21, 2018Updated 8 years ago
- X (weighted / probabilistic) Context-Free Grammars☆25Jan 30, 2024Updated 2 years ago
- Observe the dataset of images and targets in few shots☆11Sep 27, 2022Updated 3 years ago
- ☆14Apr 18, 2020Updated 6 years ago
- ☆17Nov 20, 2024Updated last year
- template for stowable dotfile dir☆13Feb 6, 2020Updated 6 years ago
- Dataset Pinocchio for paper "Towards Understanding Factual Knowledge of Large Language Models" accepted by ICLR 2024 (Spotlight)☆12Mar 13, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- A seq2seq with attention dialogue/MT model implemented by TensorFlow.☆11Jul 17, 2018Updated 7 years ago
- The project mainly forcus on using recordIO to pack images and transforming learning for object classification.☆13Jul 22, 2019Updated 6 years ago
- Sequence-to-Sequence Generative Model for Sequential Recommender System☆18Mar 25, 2024Updated 2 years ago
- ☆15Oct 19, 2021Updated 4 years ago
- ☆20May 30, 2024Updated last year
- Generates a zip archive that is uploadable to arXiv.☆46Feb 19, 2020Updated 6 years ago
- Neptune - TensorBoard integration 🧩 Experiment tracking with advanced UI, collaborative features, and user access management.☆13Sep 4, 2025Updated 8 months ago