A PyTorch implementation of the Transformer model in "Attention is All You Need".
☆19Aug 29, 2018Updated 7 years ago
Alternatives and similar repositories for attention-is-all-you-need-pytorch
Users that are interested in attention-is-all-you-need-pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [AAAI 2023] Official implementation of FiTs: Fine-grained Two-stage Training for Knowledge Base Question Answering☆11Mar 10, 2023Updated 3 years ago
- ☆11Aug 27, 2022Updated 3 years ago
- Code for UAI 2018 paper by Forré & Mooij (causal discovery with mSCMs using sigma-separation)☆10Aug 11, 2020Updated 5 years ago
- EMNLP 2020: Filtering before Iteratively Referring for Knowledge-Grounded Response Selection in Retrieval-Based Chatbots☆12Dec 15, 2020Updated 5 years ago
- ☆15Sep 28, 2020Updated 5 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆13Jul 8, 2020Updated 5 years ago
- Enhancing Legal Case Retrieval via Scaling High-quality Synthetic Query-Candidate Pairs (EMNLP 2024)☆16Nov 17, 2024Updated last year
- Automated Discovery of Interactions and Dynamics for Large Networked Dynamical Systems☆16Jun 15, 2021Updated 4 years ago
- 使用Pytorch Geometric(PyG)实现了Cora、Citeseer、Pubmed数据集上的GraphSAGE模型(full-batch)☆18May 18, 2024Updated last year
- Code used in our ijcai 2019 paper "Story Ending Prediction by Transferable BERT"☆24Nov 21, 2022Updated 3 years ago
- Code for paper: Weakly- and Semi-supervised Evidence Extraction☆15Apr 12, 2021Updated 5 years ago
- Image readout, processing and SLAM library☆11Jun 3, 2022Updated 3 years ago
- This is the latex template that should be used for the paper submission in ICASSP 2022☆12Sep 14, 2021Updated 4 years ago
- The top conferences on video retrieval libraries in recent years, synchronized with my blog.☆14Nov 27, 2021Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Sliding Convolutional Attention Network for Scene Text Recognition☆11Aug 31, 2018Updated 7 years ago
- ☆12May 5, 2024Updated last year
- ☆10Nov 21, 2023Updated 2 years ago
- Entity/Relation/Event extraction on ACE 2005 corpus☆14Mar 26, 2019Updated 7 years ago
- A simplified fine tune and deploy code based on bert for text matching.☆15Aug 12, 2019Updated 6 years ago
- Dataset for Image-Goal Navigation in Habitat☆12Feb 24, 2022Updated 4 years ago
- Implementation of a multi-turn Chain of Thought (CoT) reasoning system, powered by the Llama 3.1 70B model on Groq.☆18Sep 22, 2024Updated last year
- ROS packages for multi robot exploration, with custom set of parameters for efficient exploration in large environments☆10Oct 1, 2024Updated last year
- [TPAMI] Multi-modality Multi-attribute Contrastive Pre-training for Image Aesthetics Computing☆25Jul 3, 2025Updated 10 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆21Feb 6, 2023Updated 3 years ago
- ☆48Mar 8, 2026Updated last month
- 这是GraphSAGE模型在Cora、Citeseer、Pubmed数据集上的复现代码。语言:PyTorch☆27Jan 12, 2021Updated 5 years ago
- ☆10Nov 16, 2023Updated 2 years ago
- Self-Supervised Dataset Distillation for Transfer Learning☆18Apr 10, 2024Updated 2 years ago
- Implementation of semi-supervised learning using PyTorch Lightning☆14Jul 25, 2024Updated last year
- H&M商品推荐比赛(rank: 116/2952 )方案☆14Jun 16, 2022Updated 3 years ago
- An implementation of Tiling and Corruption (TACo) Augmentations for OCR/HTR☆15Dec 4, 2021Updated 4 years ago
- ☆12Jul 16, 2024Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- 本项目是同济大学高级程序设计课程的第二次大作业——扫雷小游戏大作业,内含工程文件与课程报告。必须要说明的是,我上传这次作业的主要目的是抛砖引玉,以期学弟学妹在做作业的过程中少走弯路,报告内容也仅供参考,切勿全局抄袭,否则后果自负。如果认为这个工程有帮助的话,希望各位能给我点…☆15Jul 16, 2020Updated 5 years ago
- A collection of graph contrastive learning methods.☆18Apr 1, 2022Updated 4 years ago
- [NeurIPS 2020]. COPT - Coordinated Optimal Transport on Graphs☆17Dec 22, 2020Updated 5 years ago
- Official code for the ICLR 2025 paper, "Ada-K Routing: Boosting the Efficiency of MoE-based LLMs"☆12Mar 1, 2025Updated last year
- ☆18Apr 10, 2023Updated 3 years ago
- The official source code of our AAAI25 paper "D&M: Enriching E-commerce Videos with Sound Effects by Key Moment Detection and SFX Matchin…☆10Feb 9, 2025Updated last year
- An PyTorch implementation of "Importance Weighted Actor-Learner Architectures" https://arxiv.org/abs/1802.01561☆12Jan 6, 2021Updated 5 years ago