在sts数据集上用多头注意力机制上进行测试。 pytorch torchtext 代码简练,非常适合新手了解多头注意力机制的运作。不想transformer牵扯很多层 multi-head attention + one layer linear
☆19Aug 20, 2025Updated 7 months ago
Alternatives and similar repositories for multi-head-self-attention
Users that are interested in multi-head-self-attention are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 中文文本的向量表示方法(Sentence-BERT, CoSENT)的PyTorch简单实现,可以用于文本相似度计算。☆10Mar 27, 2022Updated 4 years ago
- Code for my master thesis on hierarchical probabilistic forecasting of smart meter time series using weather input.☆11Aug 21, 2022Updated 3 years ago
- 使用Few-Shot方法来做文本分类任务,基于THUCNews数据☆10Jun 4, 2020Updated 5 years ago
- bert_avg,bert_whitening,sbert,consert,simcse,esimcse 中文句向量表示☆15Apr 7, 2022Updated 4 years ago
- Hierarchical Time Series Forecasting☆15Feb 8, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- smplify code for point cloud based HMR☆10Jan 11, 2022Updated 4 years ago
- Implementation of the paper : Not all attention is needed - Gated Attention Network for Sequence Data (GA-Net) [https://arxiv.org/abs/191…☆13Aug 20, 2020Updated 5 years ago
- ☆17Feb 16, 2025Updated last year
- Recod.ai Scientific Image Integrity Library☆13Mar 27, 2025Updated last year
- 地图选点一键导出公交到达圈的小工具☆11Apr 4, 2022Updated 4 years ago
- 利用爬虫获取北京公交路径,数据来源:高德地图☆12May 13, 2024Updated last year
- ☆17Sep 10, 2021Updated 4 years ago
- The offical Pytorch code for "Continual Attentive Fusion for Incremental Learning in Semantic Segmentation"☆16Apr 8, 2022Updated 4 years ago
- 中文领域的多模态Bert☆47Mar 24, 2020Updated 6 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆25Dec 23, 2019Updated 6 years ago
- 4th Year project aiming to implement PC, FCI and RFCI algorithms in python☆14Apr 29, 2019Updated 6 years ago
- Try to realize the algorithm "support tensor machine"☆12May 9, 2022Updated 3 years ago
- ☆13May 26, 2022Updated 3 years ago
- Official Implementation of "Steering Vision-Language-Action Models as Anti-Exploration: A Test-Time Scaling Approach"☆36Apr 6, 2026Updated last week
- 中文公开聊天语料库☆11Nov 5, 2018Updated 7 years ago
- Google Research☆10Apr 20, 2022Updated 3 years ago
- CUDA implementation of Wavelet KAN.☆17Jun 8, 2024Updated last year
- ☆16Oct 20, 2019Updated 6 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Classifying Forged vs Authentic using Domain Adaptation across in new domains in unsupervised settings☆15May 6, 2020Updated 5 years ago
- Code for the ICML 2013 and AAAI 2013 papers on ELLA☆13Mar 10, 2017Updated 9 years ago
- CAiRE in DialDoc21: Data Augmentation for Information-SeekingDialogue System☆11May 24, 2022Updated 3 years ago
- Example of using multiple GPUs with PyTorch DataParallel☆12Jan 28, 2020Updated 6 years ago
- semantic similarity, word2vec + wmd, bert+wmd, pytorch☆31Jan 29, 2024Updated 2 years ago
- MantraNet implemented by pytorch☆15Nov 17, 2020Updated 5 years ago
- Use numpy to build neuron network☆11May 17, 2022Updated 3 years ago
- ☆13Oct 4, 2022Updated 3 years ago
- Pytorch implementation of various traffic prediction modules(FC-LSTM, GRU, GCN, Diffusion Conv, Temporal Attention, etc.)☆12Jan 24, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Multi-modal RAG system with knowledge graph indexing. Upload anything, query everything.☆29Dec 23, 2025Updated 3 months ago
- Deep reinforcement learning for protein complex modeling☆11May 28, 2022Updated 3 years ago
- A structurally comprehensive dataset of AMR-to-text alignments for coverage of a larger variety of linguistic phenomena, for research rel…☆16Dec 10, 2022Updated 3 years ago
- ☆19Mar 12, 2025Updated last year
- The implementation and addtional material of AAAI2020 paper "Stable Learning via Sample Reweighting"☆19Mar 26, 2020Updated 6 years ago
- Copy–Move forgery or Cloning is a type of Image tampering where a part of the image is copied and pasted on another part of same image…☆18Oct 23, 2021Updated 4 years ago
- 复现热门网络Alexnet,VGG,Inception,YoLo等CSDN地址:https://blog.csdn.net/shankezh☆14Jun 20, 2019Updated 6 years ago