在sts数据集上用多头注意力机制上进行测试。 pytorch torchtext 代码简练,非常适合新手了解多头注意力机制的运作。不想transformer牵扯很多层 multi-head attention + one layer linear
☆19Aug 20, 2025Updated 9 months ago
Alternatives and similar repositories for multi-head-self-attention
Users that are interested in multi-head-self-attention are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 电报注册教程:2025年国内手机号注册Telegram收不到验证码怎么办?【已解决】本文还将会详细介绍如何下载、安装、注册和使用Telegram电报,并会为大家提供在中国使用Telegram电报的一些实用建议☆19Apr 9, 2025Updated last year
- A Keras-based and TensorFlow-backend NLP Models Toolkit.☆12Jul 7, 2022Updated 3 years ago
- 中文文本的向量表示方法(Sentence-BERT, CoSENT)的PyTorch简单实现,可以用于文本相似度计算。☆10Mar 27, 2022Updated 4 years ago
- Code for my master thesis on hierarchical probabilistic forecasting of smart meter time series using weather input.☆11Aug 21, 2022Updated 3 years ago
- 使用Few-Shot方法来做文本分类任务,基于THUCNews数据☆10Jun 4, 2020Updated 5 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- 天津财经大学2019年统计学院机器学习讨论班☆13Dec 9, 2019Updated 6 years ago
- Project: Efficient upscaling of geologic model based on theory-guided encoder-decoder☆12Jun 23, 2021Updated 4 years ago
- Contrastive Distillation for Incremental Class Learning in Semantic Segmentation☆14Dec 13, 2021Updated 4 years ago
- bert_avg,bert_whitening,sbert,consert,simcse,esimcse 中文句向量表示☆15Apr 7, 2022Updated 4 years ago
- ☆13Jun 22, 2021Updated 4 years ago
- Hierarchical Time Series Forecasting☆15Feb 8, 2023Updated 3 years ago
- smplify code for point cloud based HMR☆10Jan 11, 2022Updated 4 years ago
- ☆20Feb 16, 2025Updated last year
- This GitHub repository provides an implementation of the paper "MAGNET: Multi-Label Text Classification using Attention-based Graph Neura…☆20Nov 2, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- 地图选点一键导出公交到达圈的小工具☆11Apr 4, 2022Updated 4 years ago
- 利用NLP对法律文书信息提取,实现基于文本的搜索建议,相似文章推荐,关键词提取,词性标注。并用网页呈现。☆28Feb 22, 2022Updated 4 years ago
- Replication material for "Forecasting Hierarchical Time Series with a Regularized Embedding Space," KDD MileTS 2020☆13Mar 1, 2023Updated 3 years ago
- ☆25Dec 23, 2019Updated 6 years ago
- 4th Year project aiming to implement PC, FCI and RFCI algorithms in python☆14Apr 29, 2019Updated 7 years ago
- 人工智能大作业:关于计算文本相似度的深度神经网络模型与算法研究分析(BERT、SentenceBERT、SimCSE)☆17Jul 11, 2022Updated 3 years ago
- This repository contains code for the method proposed in the paper: Two-stream Encoder-Decoder Network for Localizing Image Forgeries☆12Nov 12, 2021Updated 4 years ago
- An interpretable probabilistic model for short-term solar power forecasting using natural gradient boosting☆15Sep 28, 2021Updated 4 years ago
- ☆13May 26, 2022Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- 中文公开聊天语料库☆11Nov 5, 2018Updated 7 years ago
- Google Research☆10Apr 20, 2022Updated 4 years ago
- FastMCP for Google's langextract library☆30Aug 6, 2025Updated 9 months ago
- ☆16Oct 20, 2019Updated 6 years ago
- Classifying Forged vs Authentic using Domain Adaptation across in new domains in unsupervised settings☆15May 6, 2020Updated 6 years ago
- ☆12Jun 18, 2020Updated 5 years ago
- Copy-Move Image Forgery Detection☆11Oct 2, 2019Updated 6 years ago
- Code for the ICML 2013 and AAAI 2013 papers on ELLA☆13Mar 10, 2017Updated 9 years ago
- This repository is the implementation of trajectory prediction's methods, such as LaneGCN, Vectornet, TNT...☆15May 26, 2023Updated 3 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- CAiRE in DialDoc21: Data Augmentation for Information-SeekingDialogue System☆11May 24, 2022Updated 4 years ago
- Conditional Variational Autoencoder for the prediction of site-specific recombinases selective for a specified target site☆12Jun 23, 2023Updated 2 years ago
- A Tensorflow LSTM spam detector utilizing GloVe word embeddings.☆12Nov 9, 2019Updated 6 years ago
- Energy-based Graph Convolutional Networks for Scoring Protein Docking Models☆12Aug 31, 2021Updated 4 years ago
- Example of using multiple GPUs with PyTorch DataParallel☆12Jan 28, 2020Updated 6 years ago
- Use numpy to build neuron network☆11May 17, 2022Updated 4 years ago
- Detection of copy-move forgeries in images using image-signal representation methods☆12Apr 28, 2018Updated 8 years ago