heartcored98 / transformer_anatomyView external linksLinks
Official Pytorch implementation of (Roles and Utilization of Attention Heads in Transformer-based Neural Language Models), ACL 2020
☆16Mar 21, 2025Updated 10 months ago
Alternatives and similar repositories for transformer_anatomy
Users that are interested in transformer_anatomy are comparing it to the libraries listed below
Sorting:
- Code for Dissecting Generation Modes for Abstractive Summarization Models via Ablation and Attribution (ACL2021)☆13Jun 2, 2021Updated 4 years ago
- ☆17May 25, 2020Updated 5 years ago
- ☆17May 14, 2020Updated 5 years ago
- LegalCrawler: A tool for automated scraping of English legal corpora☆59Aug 18, 2022Updated 3 years ago
- ☆23Jun 18, 2021Updated 4 years ago
- Subword-level Word Vector Representations for Korean (ACL 2018)☆107Oct 17, 2019Updated 6 years ago
- Implementation, trained models and result data for the paper "Aspect-based Document Similarity for Research Papers" #COLING2020☆63Apr 30, 2024Updated last year
- Method to improve inference time for BERT. This is an implementation of the paper titled "PoWER-BERT: Accelerating BERT Inference via Pro…☆62Sep 17, 2025Updated 5 months ago
- MultiCite code and data. Models are available on Huggingface.☆33May 10, 2022Updated 3 years ago
- Large scale unannotated Korean corpus for unsupervised tasks. (e.g. Language modeling)☆28Aug 11, 2019Updated 6 years ago
- ☆10Nov 6, 2020Updated 5 years ago
- Linear chain conditional random fields are implemented using Numpy and Mxnet/Gluon, and batch training is supported, not limited to train…☆22Apr 5, 2019Updated 6 years ago
- The Stanford Word Substitution (Swords) Benchmark☆32Mar 24, 2022Updated 3 years ago
- Pre-trained Machine Translation Models of Korean from/to ECJ☆29Jul 15, 2019Updated 6 years ago
- BERT models for many languages created from Wikipedia texts☆33May 25, 2020Updated 5 years ago
- 한국어 악성댓글 데이터셋☆73Sep 26, 2020Updated 5 years ago
- MMMG: A Massive, Multidisciplinary, Multi-Tier Generation Benchmark for Text-to-Image Reasoning [NeurIPS 2025 Poster]☆22Dec 10, 2025Updated 2 months ago
- code for modular summarization work published in ACL2021 by Krishna et al☆30Nov 4, 2021Updated 4 years ago
- “Welcome to my GitHub repository, a hub of exploration and innovation in the realm of data science. 📊💻 Here, you’ll find a curated coll…☆10Apr 3, 2025Updated 10 months ago
- Code and performance tests to demonstrate the COUNTLESS algorithm. https://medium.com/@willsilversmith/countless-high-performance-2x-down…☆10Oct 23, 2019Updated 6 years ago
- Curated list of awesome datasets for various table understanding tasks☆18Sep 5, 2025Updated 5 months ago
- Analyse des Pegida facebook Korpus☆10Jan 31, 2015Updated 11 years ago
- Highly scalable integration and classification of single-cell RNA sequencing data☆10Dec 27, 2020Updated 5 years ago
- Open Set Semantic Segmentation☆10Dec 23, 2020Updated 5 years ago
- Code accompanying the submission "Structural Text Segmentation of Legal Documents" by Aumiller et al.☆98Aug 14, 2023Updated 2 years ago
- Add noise to your text, can be used to improve synthetic training corpus for Neural Machine Translation☆41Aug 8, 2019Updated 6 years ago
- FaVIQ: Fact Verification from Information-seeking Questions☆43Nov 23, 2022Updated 3 years ago
- Object-aware Contrastive Learning for Debiased Scene Representation (NeurIPS 2021)☆45Oct 25, 2021Updated 4 years ago
- Jump to better conclusions: SCAN both left and right☆11Jan 24, 2019Updated 7 years ago
- python project template for personal projects! 🙋♀️☆11Nov 28, 2020Updated 5 years ago
- Source codes for the paper "Local Additivity Based Data Augmentation for Semi-supervised NER"☆43Oct 15, 2022Updated 3 years ago
- Codes for "Learning bounds for risk-sensitive learning," NeurIPS 2020 (or see arXiv 2006.08138)☆11Oct 15, 2020Updated 5 years ago
- Online Hyperparameter Optimization☆11Feb 17, 2021Updated 5 years ago
- Edit and Generate Anything in 3D world!☆13Apr 15, 2023Updated 2 years ago
- (Unofficial) Tensorflow implementation of Adversarial Latent Autoencoder (ALAE, Pidhorskyi et al., 2020)☆11Sep 8, 2020Updated 5 years ago
- Creative Instructions Project☆11Sep 4, 2023Updated 2 years ago
- DenseShuffleNet for Semantic Segmentation using Caffe for Cityscapes and Mapillary Vistas Dataset☆10Mar 21, 2018Updated 7 years ago
- Data and code: "Answering legal questions from laymen in German civil law system", Büttner & Habernal, EACL'24☆13Mar 2, 2024Updated last year
- 대부분의 신문사 뉴스를 수집하는 것을 목적으로 하는 크롤러 제작 프로젝트☆10Jul 29, 2019Updated 6 years ago