在sts数据集上用多头注意力机制上进行测试。 pytorch torchtext 代码简练,非常适合新手了解多头注意力机制的运作。不想transformer牵扯很多层 multi-head attention + one layer linear
☆18Aug 20, 2025Updated 6 months ago
Alternatives and similar repositories for multi-head-self-attention
Users that are interested in multi-head-self-attention are comparing it to the libraries listed below
Sorting:
- Code for my master thesis on hierarchical probabilistic forecasting of smart meter time series using weather input.☆11Aug 21, 2022Updated 3 years ago
- 中文文本的向量表示方法(Sentence-BERT, CoSENT)的PyTorch简单实现,可以用于文本相似度计算。☆10Mar 27, 2022Updated 3 years ago
- 中文公开聊天语料库☆11Nov 5, 2018Updated 7 years ago
- Trained 50epochs☆12Aug 22, 2023Updated 2 years ago
- ☆12Oct 4, 2022Updated 3 years ago
- knrm文本相似度☆10Aug 1, 2020Updated 5 years ago
- A GAN architecture conditioned on Action Units (AU) annotations generating facial expressions in a continuous domain.☆11Nov 22, 2022Updated 3 years ago
- FastMCP for Google's langextract library☆28Aug 6, 2025Updated 7 months ago
- ☆13Jun 15, 2021Updated 4 years ago
- Contrastive Distillation for Incremental Class Learning in Semantic Segmentation☆15Dec 13, 2021Updated 4 years ago
- Hierarchical Time Series Forecasting☆15Feb 8, 2023Updated 3 years ago
- Conditional Variational Autoencoder for the prediction of site-specific recombinases selective for a specified target site☆12Jun 23, 2023Updated 2 years ago
- This repository contains code for the method proposed in the paper: Two-stream Encoder-Decoder Network for Localizing Image Forgeries☆12Nov 12, 2021Updated 4 years ago
- ☆13Jun 22, 2021Updated 4 years ago
- A Keras-based and TensorFlow-backend NLP Models Toolkit.☆12Jul 7, 2022Updated 3 years ago
- 中文领域的多模态Bert☆46Mar 24, 2020Updated 5 years ago
- 📖The Big-&-Extending-Repository-of-Transformers: Pretrained PyTorch models for Google's BERT, OpenAI GPT & GPT-2, Google/CMU Transformer…☆10Dec 4, 2020Updated 5 years ago
- Deep reinforcement learning for protein complex modeling☆11May 28, 2022Updated 3 years ago
- A structurally comprehensive dataset of AMR-to-text alignments for coverage of a larger variety of linguistic phenomena, for research rel…☆16Dec 10, 2022Updated 3 years ago
- Calculate the RMSD between two protein structures☆12Jun 29, 2022Updated 3 years ago
- AMR-parser. Code for EMNLP2019 paper "Core Semantic First: A Top-down Approach for AMR Parsing."☆11Feb 23, 2020Updated 6 years ago
- Energy-based Graph Convolutional Networks for Scoring Protein Docking Models☆12Aug 31, 2021Updated 4 years ago
- ☆15Oct 20, 2019Updated 6 years ago
- RLMM is a reinforcement learning env for molecular modeling (currently only protein-ligand docking).☆11Nov 14, 2022Updated 3 years ago
- Example of using multiple GPUs with PyTorch DataParallel☆12Jan 28, 2020Updated 6 years ago
- Official PyTorch Implementation of "Self-Guided Generation of Minority Samples Using Diffusion Models" (ECCV 2024)☆14May 5, 2025Updated 10 months ago
- Implementation of the paper : Not all attention is needed - Gated Attention Network for Sequence Data (GA-Net) [https://arxiv.org/abs/191…☆13Aug 20, 2020Updated 5 years ago
- Replication material for "Forecasting Hierarchical Time Series with a Regularized Embedding Space," KDD MileTS 2020☆13Mar 1, 2023Updated 3 years ago
- Predicting Biomedical Interactions with Higher-Order Graph Convolutional Networks☆16Nov 9, 2021Updated 4 years ago
- Code for the ICML 2013 and AAAI 2013 papers on ELLA☆13Mar 10, 2017Updated 8 years ago
- 常用命令☆17Oct 7, 2024Updated last year
- ☆18Jun 5, 2024Updated last year
- ☆15Updated this week
- ☆17Sep 28, 2021Updated 4 years ago
- Official implementation for Multi-Modal Interaction Graph Convolutional Network for Temporal Language Localization in Videos☆16May 23, 2023Updated 2 years ago
- Pytorch implementation of Transformer-TTS for converting text into speech.☆19Jul 9, 2021Updated 4 years ago
- WebUI extension for InteractDiffusion☆18Mar 11, 2024Updated last year
- BiLSTM+CRF by Pytorch and classic CRF by pysuite 基于双向循环神经网络和CRF特征模板的信息抽取☆17Jan 9, 2019Updated 7 years ago
- Relative and Absolute Location Embedding for Few-Shot Node Classification on Graph☆13Nov 13, 2022Updated 3 years ago