A PyTorch implementation of Attention is all you need
☆43Oct 16, 2018Updated 7 years ago
Alternatives and similar repositories for Transformer-PyTorch
Users that are interested in Transformer-PyTorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An example of DyNet autobatching for the NIPS "how to code a paper" workshop☆12Dec 9, 2017Updated 8 years ago
- ☆13Aug 20, 2021Updated 4 years ago
- Parallel SGD, done locally and remote☆14May 19, 2016Updated 9 years ago
- A Python wrapper for the ROUGE summarization evaluation package☆14Aug 9, 2017Updated 8 years ago
- Implementation of the attention-sum reader using tensorflow and keras.☆11Aug 1, 2017Updated 8 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆25May 21, 2018Updated 7 years ago
- ☆14Dec 7, 2022Updated 3 years ago
- Pre-processing and training scripts for WMT 2017 ZH-EN translation task☆40Jun 7, 2020Updated 5 years ago
- chinese wwm masking and ngram masking based on jieba☆11Jul 25, 2019Updated 6 years ago
- Open solution to the Cdiscount’s Image Classification Challenge☆18Jun 22, 2022Updated 3 years ago
- Python wrapper for evaluating summarization quality by ROUGE package☆162May 25, 2020Updated 5 years ago
- 多语言降噪预训练模型MBart的中文生成任务☆11May 27, 2021Updated 4 years ago
- ☆12Dec 8, 2022Updated 3 years ago
- 간단한 파이썬 🇰🇷 한글 조사처리 라이브러리 은/는 와/과 이/가 등을 처리합니다. PyPI에 배포한 오픈소스 프로젝트입니다.☆24Jul 6, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- PyTorch implementation of Transformer-based Neural Machine Translation☆78Dec 14, 2022Updated 3 years ago
- Semantic Parser with Execution☆28Sep 10, 2018Updated 7 years ago
- ☆17Oct 9, 2022Updated 3 years ago
- Tools for working with the S800 corpus☆12Sep 17, 2020Updated 5 years ago
- Jekyll theme for displaying a resume/cv in a clean, minimallistic way.☆10Jan 4, 2021Updated 5 years ago
- Attention Is All You Need (https://arxiv.org/abs/1706.03762)☆10Apr 26, 2018Updated 8 years ago
- 야자타임 (a.k.a. 야밤의 자연어처리 타임)☆27Mar 31, 2021Updated 5 years ago
- ☆11Oct 3, 2021Updated 4 years ago
- Code and dataset for "Leveraging 2-hop Distant Supervision from Table Entity Pairs for Relation Extraction" (EMNLP'19)☆13May 18, 2020Updated 5 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆38Mar 17, 2018Updated 8 years ago
- ☆10Oct 4, 2024Updated last year
- An implementation of an autoregressive language model using an improved Transformer and DeepSpeed pipeline parallelism.☆29Jan 12, 2026Updated 3 months ago
- VaLM: Visually-augmented Language Modeling. ICLR 2023.☆56Mar 6, 2023Updated 3 years ago
- Text classification (specifically for Sentiment Analysis) using Deep Learning☆10Jun 9, 2016Updated 9 years ago
- The classic game "Snake" (in React+Redux)☆24Jan 25, 2019Updated 7 years ago
- Code inspired by Unsupervised Machine Translation Using Monolingual Corpora Only☆50Jul 25, 2024Updated last year
- PyTorch parameter server with MPI☆16Mar 22, 2018Updated 8 years ago
- Implementation of a dependency parser using neural networks☆11Mar 7, 2017Updated 9 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Top-down Tree LSTM (NAACL 2016) http://aclweb.org/anthology/N/N16/N16-1035.pdf☆83Nov 29, 2016Updated 9 years ago
- ☆182Aug 17, 2018Updated 7 years ago
- Text Style Transfer: A Review☆13Jun 1, 2019Updated 6 years ago
- Natural language dataset for training a Conversational Recommender System☆11Jul 9, 2019Updated 6 years ago
- ☆41Feb 12, 2019Updated 7 years ago
- Decompile Flash Project☆18Aug 19, 2020Updated 5 years ago
- Tensorflow/Pytorch implementation of Gated Attention Reader☆37May 9, 2017Updated 8 years ago