在sts数据集上用多头注意力机制上进行测试。 pytorch torchtext 代码简练,非常适合新手了解多头注意力机制的运作。不想transformer牵扯很多层 multi-head attention + one layer linear
☆18Aug 20, 2025Updated 7 months ago
Alternatives and similar repositories for multi-head-self-attention
Users that are interested in multi-head-self-attention are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A Keras-based and TensorFlow-backend NLP Models Toolkit.☆12Jul 7, 2022Updated 3 years ago
- 中文文本的向量表示方法(Sentence-BERT, CoSENT)的PyTorch简单实现,可以用于文本相似度计算。☆10Mar 27, 2022Updated 4 years ago
- This project hosts the code for implementing the SOLOv2 algorithms based on the official project(https://github.com/WXinlong/SOLO). Due t…☆17Feb 23, 2022Updated 4 years ago
- 使用Few-Shot方法来做文本分类任务,基于THUCNews数据☆10Jun 4, 2020Updated 5 years ago
- Contrastive Distillation for Incremental Class Learning in Semantic Segmentation☆15Dec 13, 2021Updated 4 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- ☆13Jun 22, 2021Updated 4 years ago
- Hierarchical Time Series Forecasting☆15Feb 8, 2023Updated 3 years ago
- smplify code for point cloud based HMR☆10Jan 11, 2022Updated 4 years ago
- Implementation of the paper : Not all attention is needed - Gated Attention Network for Sequence Data (GA-Net) [https://arxiv.org/abs/191…☆13Aug 20, 2020Updated 5 years ago
- ☆17Feb 16, 2025Updated last year
- Replication material for "Forecasting Hierarchical Time Series with a Regularized Embedding Space," KDD MileTS 2020☆13Mar 1, 2023Updated 3 years ago
- The offical Pytorch code for "Continual Attentive Fusion for Incremental Learning in Semantic Segmentation"☆16Apr 8, 2022Updated 3 years ago
- 重构论文A Biterm Topic Model for Short Texts提供的源代码,编译成一个python 扩展模块,并用python 包装了一下,提供一个user-friendly python package☆11Apr 15, 2019Updated 6 years ago
- 中文领域的多模态Bert☆46Mar 24, 2020Updated 6 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- 人工智能大作业:关于计算文本相似度的深度神经网络模型与算法研究分析(BERT、SentenceBERT、SimCSE)☆17Jul 11, 2022Updated 3 years ago
- 4th Year project aiming to implement PC, FCI and RFCI algorithms in python☆14Apr 29, 2019Updated 6 years ago
- ☆12Apr 6, 2021Updated 4 years ago
- Try to realize the algorithm "support tensor machine"☆12May 9, 2022Updated 3 years ago
- Google Research☆10Apr 20, 2022Updated 3 years ago
- Classifying Forged vs Authentic using Domain Adaptation across in new domains in unsupervised settings☆15May 6, 2020Updated 5 years ago
- ☆12Jun 18, 2020Updated 5 years ago
- Copy-Move Image Forgery Detection☆11Oct 2, 2019Updated 6 years ago
- A GAN architecture conditioned on Action Units (AU) annotations generating facial expressions in a continuous domain.☆11Nov 22, 2022Updated 3 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Code for the ICML 2013 and AAAI 2013 papers on ELLA☆13Mar 10, 2017Updated 9 years ago
- Segmentation of the JSRT - Chest Lung Nodules and Non-Nodules images data set using UNet, R2U-Net and DCAN☆10Jul 1, 2020Updated 5 years ago
- CAiRE in DialDoc21: Data Augmentation for Information-SeekingDialogue System☆11May 24, 2022Updated 3 years ago
- Use numpy to build neuron network☆11May 17, 2022Updated 3 years ago
- ☆13Oct 4, 2022Updated 3 years ago
- Deep reinforcement learning for protein complex modeling☆11May 28, 2022Updated 3 years ago
- The implementation and addtional material of AAAI2020 paper "Stable Learning via Sample Reweighting"☆19Mar 26, 2020Updated 6 years ago
- 复现热门网络Alexnet,VGG,Inception,YoLo等CSDN地址:https://blog.csdn.net/shankezh☆14Jun 20, 2019Updated 6 years ago
- 文本相似性☆23Aug 21, 2019Updated 6 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Calculate the RMSD between two protein structures☆12Jun 29, 2022Updated 3 years ago
- AMR-parser. Code for EMNLP2019 paper "Core Semantic First: A Top-down Approach for AMR Parsing."☆11Feb 23, 2020Updated 6 years ago
- Mojuan: Write your own AI application.☆16Jul 12, 2024Updated last year
- 📖The Big-&-Extending-Repository-of-Transformers: Pretrained PyTorch models for Google's BERT, OpenAI GPT & GPT-2, Google/CMU Transformer…☆10Dec 4, 2020Updated 5 years ago
- ☆13Oct 3, 2017Updated 8 years ago
- ☆13Jun 15, 2021Updated 4 years ago
- [TMLR] Lightweight Learner for Shared Knowledge Lifelong Learning☆28Jun 7, 2024Updated last year