Worth-reading papers and related resources on attention mechanism, Transformer and pretrained language model (PLM) such as BERT. 值得一读的注意力机制、Transformer和预训练语言模型论文与相关资源集合
☆130Mar 27, 2021Updated 5 years ago
Alternatives and similar repositories for ATPapers
Users that are interested in ATPapers are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- BERT-related papers☆2,039Aug 12, 2023Updated 2 years ago
- ☆20Jun 6, 2021Updated 4 years ago
- list of efficient attention modules☆1,022Aug 23, 2021Updated 4 years ago
- Survey on Machine Reading Comprehension☆147Jan 26, 2021Updated 5 years ago
- ☆10Aug 22, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- COVID-19 Related NLP Papers☆30Jan 20, 2022Updated 4 years ago
- ICME2022 Special Session “Beyond Accuracy: Responsible, Responsive, and Robust Multimedia Retrieval ”☆12Jun 3, 2024Updated last year
- ☆28Oct 21, 2019Updated 6 years ago
- Initializing Convolutional Filters with Semantic Features for Text Classification☆24May 13, 2018Updated 7 years ago
- ☆20Sep 16, 2017Updated 8 years ago
- EMNLP 2021 Tutorial: Multi-Domain Multilingual Question Answering☆38Nov 7, 2021Updated 4 years ago
- Code to create pre-training data for a span selection pre-training task inspired by reading comprehension and an effort to avoid encoding…☆30May 20, 2022Updated 3 years ago
- ☆23Oct 15, 2022Updated 3 years ago
- 用bert4keras来解小学数学应用题☆77Oct 23, 2020Updated 5 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆17Jul 6, 2020Updated 5 years ago
- Video Games Dataset for Multi-Document Summarization☆19Sep 20, 2025Updated 7 months ago
- Adversarial perturbations on word embeddings of BERT☆13Jan 17, 2021Updated 5 years ago
- Python wrapper for Yossi Rubner's implementation of the earth mover's distance (EMD)☆33Oct 16, 2012Updated 13 years ago
- This is a list of open-source projects at Microsoft Research NLP Group☆112Sep 29, 2020Updated 5 years ago
- Code for 'Contrastive Multi-Document Question Generation'☆11Oct 16, 2022Updated 3 years ago
- Contrastive Attention Mechanism for Abstractive Text Summarization☆40Jan 14, 2020Updated 6 years ago
- [NLPCC 2021] Shared Task on AutoIE2: Sub-Event Identification☆14Jul 19, 2021Updated 4 years ago
- 本文旨在整理文本生成领域国内外工业界和企业家的研究者和研究机构。排名不分先后。更新中,欢迎大家补充☆53Jan 4, 2021Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Implementation for paper " Unsupervised Domain Adaptation on Reading Comprehension "☆30May 21, 2020Updated 5 years ago
- 天池-新冠疫情相似句对判定大赛 大白_Rank6☆21Apr 8, 2020Updated 6 years ago
- DatasetResearch: Benchmarking Agent Systems for Demand-Driven Dataset Discovery☆20Sep 24, 2025Updated 7 months ago
- Adversarial Adaptation with Distillation for BERT Unsupervised Domain Adaptation☆35Oct 27, 2020Updated 5 years ago
- 高性能小模型测评 Shared Tasks in NLPCC 2020. Task 1 - Light Pre-Training Chinese Language Model for NLP Task☆60Jun 1, 2020Updated 5 years ago
- Turn GitHub into an RSS reader☆25Jan 1, 2024Updated 2 years ago
- 国内外数据竞赛资讯整理☆18Nov 6, 2021Updated 4 years ago
- Awesome Knowledge-Distillation. 分类整理的知识蒸馏paper(2014-2021)。☆2,663May 30, 2023Updated 2 years ago
- Understanding the Difficulty of Training Transformers☆332May 31, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Paper Lists, Notes and Slides, Focus on NLP. For summarization, please refer to https://github.com/xcfcode/Summarization-Papers☆165Jun 12, 2022Updated 3 years ago
- ☆14Aug 18, 2022Updated 3 years ago
- ACL2020 Tutorial: Open-Domain Question Answering☆835Jan 1, 2021Updated 5 years ago
- roBERTa training for SQuAD☆50Mar 2, 2020Updated 6 years ago
- Providing the answer to "How to do patching on all available SAEs on GPT-2?". It is an official repository of the implementation of the p…☆13Jan 26, 2025Updated last year
- Codes for DATA: Differentiable ArchiTecture Approximation.☆11Jul 22, 2021Updated 4 years ago
- ☆23Dec 8, 2022Updated 3 years ago