A Faster Pytorch Implementation of Multi-Head Self-Attention
☆76May 27, 2022Updated 3 years ago
Alternatives and similar repositories for multi-head_self-attention
Users that are interested in multi-head_self-attention are comparing it to the libraries listed below
Sorting:
- PyTorch solution of Vietnamese Named Entity Recognition task with Google AI's BERT model.☆23Dec 8, 2022Updated 3 years ago
- ☆11Jun 17, 2019Updated 6 years ago
- VNOnDB dataset extractor. This dataset can be use for build deep learning model to attack vietnamese handwritten text recognition problem…☆19Sep 8, 2021Updated 4 years ago
- Voice Face Association Learning Paper List☆17May 20, 2023Updated 2 years ago
- ☆29Nov 7, 2025Updated 4 months ago
- Tutorial for text classification with BERT, using HuggingFace's transformers.☆13Jan 15, 2020Updated 6 years ago
- [ICLR 2024] "Data Distillation Can Be Like Vodka: Distilling More Times For Better Quality" by Xuxi Chen*, Yu Yang*, Zhangyang Wang, Baha…☆15May 18, 2024Updated last year
- 7th sem college stuff☆14Nov 16, 2022Updated 3 years ago
- Deep-HOSeq: Deep Higher-Order Sequence Fusion for Multimodal Sentiment Analysis.☆11Oct 19, 2020Updated 5 years ago
- Benchmarking Recommendation Abilities for Large Language Models☆31Mar 10, 2026Updated last week
- Deep learning network MEBCRN for separation of fat and water magnetic resonance images☆11Dec 29, 2020Updated 5 years ago
- An ObsPy library for event detection and seismic attribute calculation: preparing waveforms for automated analysis☆11Jan 17, 2023Updated 3 years ago
- An implementation of a Capsule Attention Network.☆10Jan 26, 2018Updated 8 years ago
- ☆14Jul 2, 2018Updated 7 years ago
- Chiller Fault Diagnosis based on VAE Enabled Generative Adversarial Networks☆46Jul 22, 2020Updated 5 years ago
- The official repository for the CodeGym project: "Generalizable End-to-End Tool-Use RL with Synthetic CodeGym"☆24Oct 14, 2025Updated 5 months ago
- The stream-learn is an open-source Python library for difficult data stream analysis.☆66Aug 29, 2025Updated 6 months ago
- Codes used in the earthquake detection and location method presented in Beauce et al. 2019. A real data example is also provided.☆13Feb 2, 2021Updated 5 years ago
- One of the first implementations of Grad-CAM ++ for time series / 1d signal.☆18Apr 9, 2023Updated 2 years ago
- AudioVisual Diarization - Supervised and Unsupervised☆15Nov 22, 2022Updated 3 years ago
- PyTorch implementation of simplified neural source filter model (s-nsf)☆14Aug 4, 2021Updated 4 years ago
- ☆14Aug 24, 2018Updated 7 years ago
- Its a CAM(Class Activation Mapping) demo for 3d medical image. (pytorch and UNet 3d)☆14Nov 3, 2022Updated 3 years ago
- 阿里巴巴ESMM模型解读☆43Aug 6, 2020Updated 5 years ago
- Paper list☆23Oct 31, 2019Updated 6 years ago
- [CVPR2024] Efficient Dataset Distillation via Minimax Diffusion☆104Mar 22, 2024Updated last year
- Multiscale reduction clustering of vibration signals for unsupervised diagnosis of machine faults☆17Nov 27, 2024Updated last year
- ViText2SQL: A dataset for Vietnamese Text-to-SQL semantic parsing (EMNLP-2020 Findings)☆36Jul 22, 2024Updated last year
- scMODAL: A general deep learning framework for single-cell Multi-Omics Data Alignment with feature Links☆21Jun 10, 2025Updated 9 months ago
- A temporary repo to share the DMBERT code for Event Detection☆13Apr 19, 2020Updated 5 years ago
- VQVAE | VAE | GumbelVAE | PixelCNN☆21Jun 15, 2020Updated 5 years ago
- Learning Interactions and Relationships between Movie Characters (CVPR'20)☆22Apr 12, 2023Updated 2 years ago
- ☆13Jun 26, 2024Updated last year
- Attempt on a Kaggle competition, Personalized Web Search Challenge, hosted by Yandex (http://www.kaggle.com/c/yandex-personalized-web-sea…☆11Jan 3, 2014Updated 12 years ago
- Repo for storing notes from Andrew Ng's Machine Learning course on Coursera☆18Dec 18, 2021Updated 4 years ago
- This is a Pytorch Implementation of the DASP algorithm from the paper "Explaining Deep Neural Networks with a Polynomial Time Algorithm f…☆11Jun 12, 2020Updated 5 years ago
- [NeurIPS 2024] Official Implementation of "SDformer: Similarity-driven Discrete Transformer For Time Series Generation"☆13May 23, 2025Updated 9 months ago
- Creating a steering wheel angle predictor using Udacity's challenge 2 data.☆12Sep 26, 2017Updated 8 years ago
- ☆12May 19, 2024Updated last year