ZhengZixiang / ATPapersView external linksLinks
Worth-reading papers and related resources on attention mechanism, Transformer and pretrained language model (PLM) such as BERT. 值得一读的注意力机制、Transformer和预训练语言模型论文与相关资源集合
☆130Mar 27, 2021Updated 4 years ago
Alternatives and similar repositories for ATPapers
Users that are interested in ATPapers are comparing it to the libraries listed below
Sorting:
- ICME2022 Special Session “Beyond Accuracy: Responsible, Responsive, and Robust Multimedia Retrieval ”☆12Jun 3, 2024Updated last year
- Notes of my introduction about NLP in Fudan University☆37Jul 6, 2021Updated 4 years ago
- BERT-related papers☆2,042Aug 12, 2023Updated 2 years ago
- COVID-19 Related NLP Papers☆30Jan 20, 2022Updated 4 years ago
- Survey on Machine Reading Comprehension☆147Jan 26, 2021Updated 5 years ago
- list of efficient attention modules☆1,022Aug 23, 2021Updated 4 years ago
- ☆17Jul 6, 2020Updated 5 years ago
- 国内外数据竞赛资讯整理☆18Nov 6, 2021Updated 4 years ago
- 个人所需整理的自然语言处理资源集合☆71Mar 27, 2021Updated 4 years ago
- A simple implementation of a deep linear Pytorch module☆21Oct 16, 2020Updated 5 years ago
- Video Games Dataset for Multi-Document Summarization☆19Sep 20, 2025Updated 4 months ago
- Revisiting Parameter Sharing for Automatic Neural Channel Number Search, NeurIPS 2020☆22Nov 15, 2020Updated 5 years ago
- UNF(Universal NLP Framework)☆71Mar 6, 2020Updated 5 years ago
- EMNLP 2021 Tutorial: Multi-Domain Multilingual Question Answering☆38Nov 7, 2021Updated 4 years ago
- codes for ICML2021 paper iDARTS: Differentiable Architecture Search with Stochastic Implicit Gradients☆10May 27, 2021Updated 4 years ago
- DatasetResearch: Benchmarking Agent Systems for Demand-Driven Dataset Discovery☆20Sep 24, 2025Updated 4 months ago
- The Python solutions of leetcode☆13Apr 26, 2020Updated 5 years ago
- Code for 'Contrastive Multi-Document Question Generation'☆11Oct 16, 2022Updated 3 years ago
- ☆10Aug 22, 2023Updated 2 years ago
- Pytorch code for the paper 'Attention-based Atrous Convolutional Neural Networks: Visualisation and Understanding Perspectives of Acousti…☆14Nov 12, 2020Updated 5 years ago
- Must-read papers on improving efficiency for pre-trained language models.☆105Oct 21, 2022Updated 3 years ago
- Awesome Transformers (self-attention) in Computer Vision☆269Jul 31, 2021Updated 4 years ago
- The Dataset and Official Implementation for <The ELCo Dataset: Bridging Emoji and Lexical Composition> @ LREC-COLING 2024☆16May 11, 2024Updated last year
- Gradually Updated Neural Networks for Large-Scale Image Recognition at ICML 2018☆10Jun 25, 2018Updated 7 years ago
- Official code of "NAS acceleration via proxy data", IJCAI21☆10May 29, 2022Updated 3 years ago
- ☆20Jun 6, 2021Updated 4 years ago
- NSAS code for CVPR review☆27Jun 2, 2021Updated 4 years ago
- Implementation for paper " Unsupervised Domain Adaptation on Reading Comprehension "☆30May 21, 2020Updated 5 years ago
- 💻 Terminal-Agent with Human-in-the-Loop Learning☆34Jan 16, 2026Updated 3 weeks ago
- The programming assignments of Natural Language Processing by Michael Collins on Coursera☆14Apr 28, 2013Updated 12 years ago
- This is a niche collection of research papers which are proven to be gradients pushing the field of Natural Language Processing, Deep Lea…☆25Nov 19, 2024Updated last year
- ☆13Jun 28, 2021Updated 4 years ago
- [ICLR 2023] Eva: Practical Second-order Optimization with Kronecker-vectorized Approximation☆12Jul 31, 2023Updated 2 years ago
- Codes for DATA: Differentiable ArchiTecture Approximation.☆11Jul 22, 2021Updated 4 years ago
- ☆12Apr 25, 2022Updated 3 years ago
- Providing the answer to "How to do patching on all available SAEs on GPT-2?". It is an official repository of the implementation of the p…☆12Jan 26, 2025Updated last year
- ☆13Mar 16, 2022Updated 3 years ago
- ☆12May 22, 2022Updated 3 years ago
- ☆13Apr 9, 2018Updated 7 years ago