在sts数据集上用多头注意力机制上进行测试。 pytorch torchtext 代码简练,非常适合新手了解多头注意力机制的运作。不想transformer牵扯很多层 multi-head attention + one layer linear
☆19Aug 20, 2025Updated 9 months ago
Alternatives and similar repositories for multi-head-self-attention
Users that are interested in multi-head-self-attention are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 中文文本的向量表示方法(Sentence-BERT, CoSENT)的PyTorch简单实现,可以用于文本相似度计算。☆10Mar 27, 2022Updated 4 years ago
- knrm文本相似度☆10Aug 1, 2020Updated 5 years ago
- Code for my master thesis on hierarchical probabilistic forecasting of smart meter time series using weather input.☆11Aug 21, 2022Updated 3 years ago
- Contrastive Distillation for Incremental Class Learning in Semantic Segmentation☆14Dec 13, 2021Updated 4 years ago
- ☆13Jun 22, 2021Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Implementation of the paper : Not all attention is needed - Gated Attention Network for Sequence Data (GA-Net) [https://arxiv.org/abs/191…☆13Aug 20, 2020Updated 5 years ago
- ☆21Feb 16, 2025Updated last year
- This GitHub repository provides an implementation of the paper "MAGNET: Multi-Label Text Classification using Attention-based Graph Neura…☆21Nov 2, 2023Updated 2 years ago
- ☆17Sep 10, 2021Updated 4 years ago
- 地图选点一键导出公交到达圈的小工具☆11Apr 4, 2022Updated 4 years ago
- 利用爬虫获取北京公交路径,数据来源:高德地图☆12May 13, 2024Updated 2 years ago
- 利用NLP对法律文书信息提取,实现基于文本的搜索建议,相似文章推荐,关键词提取,词性标注。并用网页呈现。☆28Feb 22, 2022Updated 4 years ago
- The offical Pytorch code for "Continual Attentive Fusion for Incremental Learning in Semantic Segmentation"☆16Apr 8, 2022Updated 4 years ago
- VectorNet Code Replication,Contains mini data datasets that can be run directly,The visualization content will be updated in the future☆14Jun 28, 2024Updated last year
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- ☆25Dec 23, 2019Updated 6 years ago
- 4th Year project aiming to implement PC, FCI and RFCI algorithms in python☆14Apr 29, 2019Updated 7 years ago
- 人工智能大作业:关于计算文本相似度的深度神经网络模型与算法 研究分析(BERT、SentenceBERT、SimCSE)☆17Jul 11, 2022Updated 3 years ago
- Try to realize the algorithm "support tensor machine"☆12May 9, 2022Updated 4 years ago
- An interpretable probabilistic model for short-term solar power forecasting using natural gradient boosting☆15Sep 28, 2021Updated 4 years ago
- ☆13May 26, 2022Updated 4 years ago
- 中文公开聊天语料库☆11Nov 5, 2018Updated 7 years ago
- Official Implementation of "Steering Vision-Language-Action Models as Anti-Exploration: A Test-Time Scaling Approach"☆39Apr 6, 2026Updated 2 months ago
- CUDA implementation of Wavelet KAN.☆17Jun 8, 2024Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Classifying Forged vs Authentic using Domain Adaptation across in new domains in unsupervised settings☆15May 6, 2020Updated 6 years ago
- ☆12Jun 18, 2020Updated 6 years ago
- Code for the ICML 2013 and AAAI 2013 papers on ELLA☆13Mar 10, 2017Updated 9 years ago
- ☆15Oct 16, 2024Updated last year
- This repository is the implementation of trajectory prediction's methods, such as LaneGCN, Vectornet, TNT...☆15May 26, 2023Updated 3 years ago
- Energy-based Graph Convolutional Networks for Scoring Protein Docking Models☆12Aug 31, 2021Updated 4 years ago
- Example of using multiple GPUs with PyTorch DataParallel☆12Jan 28, 2020Updated 6 years ago
- Findings of EMNLP'22 | Trial2Vec: Zero-Shot Clinical Trial Document Similarity Search using Self-Supervision☆26Apr 11, 2024Updated 2 years ago
- Official Code for Merging Statistical Feature via Adaptive Gate for Improved Text Classification (AAAI2021)☆26Feb 5, 2022Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Use numpy to build neuron network☆11May 17, 2022Updated 4 years ago
- ☆13Oct 4, 2022Updated 3 years ago
- Pytorch implementation of various traffic prediction modules(FC-LSTM, GRU, GCN, Diffusion Conv, Temporal Attention, etc.)☆12Jan 24, 2024Updated 2 years ago
- Deep reinforcement learning for protein complex modeling☆11May 28, 2022Updated 4 years ago
- A structurally comprehensive dataset of AMR-to-text alignments for coverage of a larger variety of linguistic phenomena, for research rel…☆16Dec 10, 2022Updated 3 years ago
- Multi-modal RAG system with knowledge graph indexing. Upload anything, query everything.☆32Dec 23, 2025Updated 5 months ago
- 复现热门网络Alexnet,VGG,Inception,YoLo等CSDN地址:https://blog.csdn.net/shankezh☆13Jun 20, 2019Updated 6 years ago