A PyTorch implementation of Attention is all you need
☆43Oct 16, 2018Updated 7 years ago
Alternatives and similar repositories for Transformer-PyTorch
Users that are interested in Transformer-PyTorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An example of DyNet autobatching for the NIPS "how to code a paper" workshop☆12Dec 9, 2017Updated 8 years ago
- Parallel SGD, done locally and remote☆14May 19, 2016Updated 9 years ago
- A Python wrapper for the ROUGE summarization evaluation package☆14Aug 9, 2017Updated 8 years ago
- ☆25May 21, 2018Updated 7 years ago
- Pre-processing and training scripts for WMT 2017 ZH-EN translation task☆40Jun 7, 2020Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- chinese wwm masking and ngram masking based on jieba☆11Jul 25, 2019Updated 6 years ago
- Open solution to the Cdiscount’s Image Classification Challenge☆19Jun 22, 2022Updated 3 years ago
- Python wrapper for evaluating summarization quality by ROUGE package☆162May 25, 2020Updated 5 years ago
- 多语言降噪预训练模型MBart的中文生成任务☆11May 27, 2021Updated 4 years ago
- ☆16Apr 11, 2022Updated 4 years ago
- ☆12Dec 8, 2022Updated 3 years ago
- 간단한 파이썬 🇰🇷 한글 조사처리 라이브러리 은/는 와/과 이/가 등을 처리합니다. PyPI에 배포한 오픈소스 프로젝트입니다.☆24Jul 6, 2021Updated 4 years ago
- PyTorch implementation of Transformer-based Neural Machine Translation☆78Dec 14, 2022Updated 3 years ago
- This is AlpaGasus2-QLoRA based on LLaMA2 with AlpaGasus mechanism using QLoRA!☆15Nov 22, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Semantic Parser with Execution☆28Sep 10, 2018Updated 7 years ago
- ☆17Oct 9, 2022Updated 3 years ago
- Tools for working with the S800 corpus☆12Sep 17, 2020Updated 5 years ago
- Code for the paper "Extreme Adaptation for Personalized Neural Machine Translation"☆42Sep 22, 2025Updated 6 months ago
- Attention Is All You Need (https://arxiv.org/abs/1706.03762)☆10Apr 26, 2018Updated 7 years ago
- ☆10Nov 15, 2020Updated 5 years ago
- Library for implementing RNNs with Theano☆11Mar 26, 2015Updated 11 years ago
- ☆11Oct 3, 2021Updated 4 years ago
- Code and dataset for "Leveraging 2-hop Distant Supervision from Table Entity Pairs for Relation Extraction" (EMNLP'19)☆13May 18, 2020Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- 트랜스포머 블록을 활용한 상품명 자연어처리 기반 카테고리 분류 모델☆10Dec 5, 2022Updated 3 years ago
- An implementation of an autoregressive language model using an improved Transformer and DeepSpeed pipeline parallelism.☆30Jan 12, 2026Updated 2 months ago
- VaLM: Visually-augmented Language Modeling. ICLR 2023.☆56Mar 6, 2023Updated 3 years ago
- 一台海外Linux服务器,一行代码, 便能实现翻墙。好用的话求个Star。☆10Dec 20, 2018Updated 7 years ago
- Text classification (specifically for Sentiment Analysis) using Deep Learning☆10Jun 9, 2016Updated 9 years ago
- The classic game "Snake" (in React+Redux)☆24Jan 25, 2019Updated 7 years ago
- Code inspired by Unsupervised Machine Translation Using Monolingual Corpora Only☆50Jul 25, 2024Updated last year
- Implementation of a dependency parser using neural networks☆11Mar 7, 2017Updated 9 years ago
- Top-down Tree LSTM (NAACL 2016) http://aclweb.org/anthology/N/N16/N16-1035.pdf☆83Nov 29, 2016Updated 9 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆182Aug 17, 2018Updated 7 years ago
- N/A☆18Aug 15, 2022Updated 3 years ago
- Natural language dataset for training a Conversational Recommender System☆11Jul 9, 2019Updated 6 years ago
- ☆41Feb 12, 2019Updated 7 years ago
- A Python Library for Consuming Transactions from Pro Sports Transactions (https://www.prosportstransactions.com)☆21Apr 1, 2026Updated last week
- Tensorflow/Pytorch implementation of Gated Attention Reader☆37May 9, 2017Updated 8 years ago
- A bert baseline for DocRED☆18Oct 12, 2022Updated 3 years ago