在sts数据集上用多头注意力机制上进行测试。 pytorch torchtext 代码简练,非常适合新手了解多头注意力机制的运作。不想transformer牵扯很多层 multi-head attention + one layer linear
☆19Aug 20, 2025Updated 8 months ago
Alternatives and similar repositories for multi-head-self-attention
Users that are interested in multi-head-self-attention are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 电报注册教程:2025年国内手机号注册Telegram收不到验证码怎么办?【已解决】本文还将会详细介绍如何下载、安装、注册和使用Telegram电报,并会为大家提供在中国使用Telegram 电报的一些实用建议☆19Apr 9, 2025Updated last year
- A Keras-based and TensorFlow-backend NLP Models Toolkit.☆12Jul 7, 2022Updated 3 years ago
- 中文文本的向量表示方法(Sentence-BERT, CoSENT)的PyTorch简单实现,可以用于文本相似度计算。☆10Mar 27, 2022Updated 4 years ago
- knrm文本相似度☆10Aug 1, 2020Updated 5 years ago
- 使用Few-Shot方法来做文本分类任务,基于THUCNews数据☆10Jun 4, 2020Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- 天津财经大学2019年统计学院机器学习讨论班☆13Dec 9, 2019Updated 6 years ago
- Contrastive Distillation for Incremental Class Learning in Semantic Segmentation☆15Dec 13, 2021Updated 4 years ago
- bert_avg,bert_whitening,sbert,consert,simcse,esimcse 中文句向量表示☆15Apr 7, 2022Updated 4 years ago
- smplify code for point cloud based HMR☆10Jan 11, 2022Updated 4 years ago
- ☆19Feb 16, 2025Updated last year
- This GitHub repository provides an implementation of the paper "MAGNET: Multi-Label Text Classification using Attention-based Graph Neura…☆20Nov 2, 2023Updated 2 years ago
- ☆17Sep 10, 2021Updated 4 years ago
- 重构论文A Biterm Topic Model for Short Texts提供的源代码,编译成一个python 扩展模块,并用python 包装了一下,提供一个user-friendly python package☆11Apr 15, 2019Updated 7 years ago
- VectorNet Code Replication,Contains mini data datasets that can be run directly,The visualization content will be updated in the future☆14Jun 28, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆25Dec 23, 2019Updated 6 years ago
- 4th Year project aiming to implement PC, FCI and RFCI algorithms in python☆14Apr 29, 2019Updated 7 years ago
- 人工智能大作业:关于计算文本相似度的深度神经网络模型与算法研究分析(BERT、SentenceBERT、SimCSE)☆17Jul 11, 2022Updated 3 years ago
- This repository contains code for the method proposed in the paper: Two-stream Encoder-Decoder Network for Localizing Image Forgeries☆12Nov 12, 2021Updated 4 years ago
- Deep Learning Project for Trajectory Prediction using nuScenes dataset.☆10Sep 13, 2022Updated 3 years ago
- 2019 年第十六届中国研究生数学建模竞赛 A 题(华为赛题)☆14Sep 27, 2019Updated 6 years ago
- ☆13May 26, 2022Updated 3 years ago
- 中文公开聊天语料库☆11Nov 5, 2018Updated 7 years ago
- FastMCP for Google's langextract library☆30Aug 6, 2025Updated 9 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- CUDA implementation of Wavelet KAN.☆17Jun 8, 2024Updated last year
- ☆16Oct 20, 2019Updated 6 years ago
- ☆12Jun 18, 2020Updated 5 years ago
- Code for the ICML 2013 and AAAI 2013 papers on ELLA☆13Mar 10, 2017Updated 9 years ago
- This repository is the implementation of trajectory prediction's methods, such as LaneGCN, Vectornet, TNT...☆14May 26, 2023Updated 2 years ago
- Segmentation of the JSRT - Chest Lung Nodules and Non-Nodules images data set using UNet, R2U-Net and DCAN☆10Jul 1, 2020Updated 5 years ago
- In this repository, I have used ARIMA,SARIMA,AutoArima,ANN,CNN and LSTMs for time series modelling and Anomaly detection has been done us…☆12Feb 8, 2021Updated 5 years ago
- This is the official codebase for KDD 2021 paper Generalized Zero-Shot Extreme Multi-Label Learning☆24Jul 25, 2022Updated 3 years ago
- CAiRE in DialDoc21: Data Augmentation for Information-SeekingDialogue System☆11May 24, 2022Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- A Tensorflow LSTM spam detector utilizing GloVe word embeddings.☆12Nov 9, 2019Updated 6 years ago
- Energy-based Graph Convolutional Networks for Scoring Protein Docking Models☆12Aug 31, 2021Updated 4 years ago
- semantic similarity, word2vec + wmd, bert+wmd, pytorch☆31Jan 29, 2024Updated 2 years ago
- Example of using multiple GPUs with PyTorch DataParallel☆12Jan 28, 2020Updated 6 years ago
- Findings of EMNLP'22 | Trial2Vec: Zero-Shot Clinical Trial Document Similarity Search using Self-Supervision☆26Apr 11, 2024Updated 2 years ago
- MantraNet implemented by pytorch☆15Nov 17, 2020Updated 5 years ago
- Official Code for Merging Statistical Feature via Adaptive Gate for Improved Text Classification (AAAI2021)☆26Feb 5, 2022Updated 4 years ago