Code for the ACL 2022 paper "Contextual Representation Learning beyond Masked Language Modeling"
☆33Oct 23, 2022Updated 3 years ago
Alternatives and similar repositories for TACO
Users that are interested in TACO are comparing it to the libraries listed below
Sorting:
- [NAACL 2022] "Learning to Win Lottery Tickets in BERT Transfer via Task-agnostic Mask Training", Yuanxin Liu, Fandong Meng, Zheng Lin, Pe…☆15Oct 18, 2022Updated 3 years ago
- [NeurIPS 2022] Non-Linguistic Supervision for Contrastive Learning of Sentence Embeddings☆22Jan 30, 2023Updated 3 years ago
- Official Code for 'EPiDA: An Easy Plug-in Data Augmentation Framework for High Performance Text Classification' - NAACL 2022☆23May 9, 2022Updated 3 years ago
- [NAACL 2022] TreeMix: Compositional Constituency-based Data Augmentation for Natural Language Understanding☆10Jul 15, 2023Updated 2 years ago
- Dataset and baseline for Coling 2022 long paper (oral): "ConFiguRe: Exploring Discourse-level Chinese Figures of Speech"☆13Jul 27, 2023Updated 2 years ago
- GisPy: A Tool for Measuring Gist Inference Score in Text https://aclanthology.org/2022.wnu-1.5/☆13Jul 1, 2024Updated last year
- Code of ACL 2022 paper Debiased Contrastive Learning of Unsupervised Sentence Representations☆32Mar 16, 2022Updated 3 years ago
- ☆14Feb 3, 2021Updated 5 years ago
- PathPiece tokenizer☆13Nov 10, 2024Updated last year
- German Language Understanding Evaluation Benchmark @NAACL24☆22Dec 11, 2025Updated 2 months ago
- Meta Representation Transformation for Low-resource Cross-lingual Learning☆41May 5, 2021Updated 4 years ago
- [EMNLP 2021] Efficient Contrastive Learning via Novel Data Augmentation and Curriculum Learning☆17Jun 28, 2025Updated 8 months ago
- Virtual Data Augmentation: A Robust and General Framework for Fine-tuning Pre-trained Models☆16Sep 13, 2021Updated 4 years ago
- One Initialization to Rule them All: Fine-tuning via Explained Variance Adaptation☆48Oct 20, 2025Updated 4 months ago
- ✂️ Sentence segmentation with wtpsplit's state-of-the-art Segment any Text (SaT) models☆36Oct 1, 2025Updated 5 months ago
- The official repository for Toxic Commons and Celadon. Toxicity Classification for public domain data.☆22Nov 10, 2024Updated last year
- ☆80Jul 11, 2022Updated 3 years ago
- Tool for the Automatic Analysis of Syntactic Sophistication and Complexity☆31Nov 4, 2023Updated 2 years ago
- pytorch版bert权重转tf☆22May 19, 2020Updated 5 years ago
- Natural Universal Trigger Search (NUTS)☆21Apr 17, 2021Updated 4 years ago
- WebRED is a large and diverse manually annotated dataset for extracting relationships from a variety of text found on the World Wide Web.☆22Mar 11, 2021Updated 4 years ago
- The repository contains the dataset and the code of the paper: Document-Level Text Simplification: Dataset, Metric and Model.☆26Jun 2, 2023Updated 2 years ago
- German Parliamentary Corpus (GerParCor)☆30Jan 14, 2026Updated last month
- ☆30Feb 3, 2026Updated last month
- Small python package to measure OCR quality and other related metrics.☆27Feb 19, 2024Updated 2 years ago
- Multi-sense word embeddings from visual co-occurrences☆25Sep 5, 2019Updated 6 years ago
- Research code for the paper "How Good is Your Tokenizer? On the Monolingual Performance of Multilingual Language Models"☆28Oct 3, 2021Updated 4 years ago
- Code for "BERTifying the Hidden Markov Model for Multi-Source Weakly Supervised Named Entity Recognition"☆32Jun 20, 2023Updated 2 years ago
- Source code for ACL 2022 paper "Self-contrastive Decorrelation for Sentence Embeddings".☆26Mar 10, 2025Updated 11 months ago
- Code for paper Sentence-aware Contrastive Learning for Open-Domain Passage Retrieval, Accepted by ACL2022 Main Conference, Long Paper☆30Mar 12, 2022Updated 3 years ago
- A sample Java gRPC client for the Salesforce Pub/Sub API☆12Oct 9, 2024Updated last year
- Code for ACL 2022 paper "Expanding Pretrained Models to Thousands More Languages via Lexicon-based Adaptation"☆30Apr 2, 2022Updated 3 years ago
- Source code for "UniRE: A Unified Label Space for Entity Relation Extraction.", ACL2021. It is based on our NERE toolkit (https://github.…☆122Apr 13, 2022Updated 3 years ago
- ☆37Sep 22, 2021Updated 4 years ago
- Code for the NAACL 2022 long paper "DiffCSE: Difference-based Contrastive Learning for Sentence Embeddings"☆297Oct 27, 2022Updated 3 years ago
- State-of-the-art paired encoder and decoder models (17M-1B params)☆59Aug 6, 2025Updated 7 months ago
- Code for "Dynamic Contextualized Word Embeddings"☆32Dec 30, 2021Updated 4 years ago
- [ICIP 2024] Open-Vocabulary Panoptic Segmentation Using BERT Pre-Training of Vision-Language Multiway Transformer Model☆16May 23, 2025Updated 9 months ago
- A python package to automate downloads of Salesforce Weekly Data Exports☆10Jan 26, 2021Updated 5 years ago