Transformer implementation in PyTorch.
☆493Mar 7, 2019Updated 7 years ago
Alternatives and similar repositories for transformer-pytorch
Users that are interested in transformer-pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- tunz's CUDA pytorch operator (MaskedSoftmax)☆75Mar 7, 2019Updated 7 years ago
- using nn.Transformer() module to accomplish a machine learning demo.☆13Mar 23, 2022Updated 4 years ago
- A PyTorch implementation of the Transformer model in "Attention is All You Need".☆9,661Apr 16, 2024Updated last year
- Transformer seq2seq model, program that can build a language translator from parallel corpus☆1,427May 19, 2023Updated 2 years ago
- ☆12Apr 23, 2023Updated 2 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- 数学建模相关资料☆10Jul 10, 2023Updated 2 years ago
- Implementation of unregularized, l1 regularized and l2 regularized linear regression using numpy and without sklearn☆12Oct 4, 2019Updated 6 years ago
- A Pytorch Implementation of "Attention is All You Need" and "Weighted Transformer Network for Machine Translation"☆578Oct 1, 2020Updated 5 years ago
- This is project to analyze korquad 2.0☆23Jun 22, 2022Updated 3 years ago
- trajectory prediction using NGSIM. ECE 228 Sp22☆14Jun 10, 2022Updated 3 years ago
- An annotated implementation of the Transformer paper.☆7,155Apr 7, 2024Updated last year
- pytorch implementation of Attention is all you need☆240Jun 16, 2021Updated 4 years ago
- A re-implementation of the CVPR19 paper Quantization Networks on CIFAR-10, MNIST and ImageNet☆10Aug 9, 2020Updated 5 years ago
- Tutorial for pretraining Korean GPT-2 model☆67Jun 12, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A collection of Korean Text Datasets ready to use using Tensorflow-Datasets.☆20Jun 8, 2022Updated 3 years ago
- Download and create a tfreader for the audioset dataset☆16Apr 16, 2020Updated 5 years ago
- A PyTorch implementation of Transformer in "Attention is All You Need"☆106Dec 6, 2020Updated 5 years ago
- ☆13Jul 31, 2023Updated 2 years ago
- ☆15Jul 28, 2024Updated last year
- Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Py…☆24,996Mar 27, 2026Updated last week
- Materials for "Natural Language Processing for Multilingual Task-Oriented Dialogue" Tutorial at ACL 2022☆14May 21, 2022Updated 3 years ago
- This repository provides a framework to serve LLM(Large Language Model) based applications such as Chatbot.☆18Apr 20, 2023Updated 2 years ago
- Label-Imbalanced and Group-Sensitive Classification under Overparameterization☆17Nov 3, 2021Updated 4 years ago
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- ☆22Dec 31, 2019Updated 6 years ago
- In this repository, I try to combine k2 with speechbrain to decode well and fastly.☆16Jun 17, 2022Updated 3 years ago
- Source code for our paper "Pessimistic Decision-Making for Recommender Systems" published at ACM TORS, and RecSys 2021.☆11Dec 15, 2022Updated 3 years ago
- The Transformer in PyTorch☆13Aug 7, 2024Updated last year
- Tutorials on implementing a few sequence-to-sequence (seq2seq) models with PyTorch and TorchText.☆5,686Jan 20, 2024Updated 2 years ago
- Google AI 2018 BERT pytorch implementation☆6,522Sep 15, 2023Updated 2 years ago
- 🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal model…☆158,637Updated this week
- ☆10Mar 28, 2022Updated 4 years ago
- ☆12,401Mar 3, 2026Updated last month
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- PyTorch Implementation of "Non-Autoregressive Neural Machine Translation"☆271Feb 12, 2022Updated 4 years ago
- PlaNet: Learning Latent Dynamics for Planning from Pixels☆10Feb 13, 2020Updated 6 years ago
- ☆14Aug 3, 2021Updated 4 years ago
- A C++/CUDA toolkit for Transformer (NMT) Translator (Decoder)☆17Jan 7, 2019Updated 7 years ago
- Unofficial PyTorch implementation of the paper "cosFormer: Rethinking Softmax In Attention".☆44Oct 29, 2021Updated 4 years ago
- Posterior Control of Blackbox Generation☆23May 2, 2020Updated 5 years ago
- ☆13Jun 1, 2017Updated 8 years ago