cindyxinyiwang / multiview-subword-regularizationView external linksLinks
PyTorch implementation of NAACL 2021 paper "Multi-view Subword Regularization"
☆26Jun 2, 2021Updated 4 years ago
Alternatives and similar repositories for multiview-subword-regularization
Users that are interested in multiview-subword-regularization are comparing it to the libraries listed below
Sorting:
- Meta Representation Transformation for Low-resource Cross-lingual Learning☆41May 5, 2021Updated 4 years ago
- Python library for backtranslation (with Google Translate)☆12Jan 11, 2020Updated 6 years ago
- A Framework aims to wisely initialize unseen subword embeddings in PLMs for efficient large-scale continued pretraining☆18Nov 26, 2023Updated 2 years ago
- ☆18Nov 25, 2022Updated 3 years ago
- [Unofficial] Kakaotrans: Kakao translate API for python☆16Mar 29, 2020Updated 5 years ago
- “Data Augmentation for Cross-Domain Named Entity Recognition” (EMNLP 2021)☆20Apr 4, 2022Updated 3 years ago
- NER task for Naver NLP Challenge 2018 (3rd Place)☆18Mar 24, 2023Updated 2 years ago
- On the Complementarity between Pre-Training and Back-Translation for Neural Machine Translation (Findings of EMNLP 2021))☆13Nov 21, 2021Updated 4 years ago
- pytorch implementation for "Mutual Information Neural Estimation"☆11Dec 13, 2019Updated 6 years ago
- ☆11Aug 26, 2021Updated 4 years ago
- Scale your ML workers asynchronously across processes and machines☆13Apr 1, 2025Updated 10 months ago
- Codes for NLPDove at SemEval 2020 Task 6: OffensEval, COLING 2020☆10Apr 3, 2020Updated 5 years ago
- ☆11Jun 23, 2022Updated 3 years ago
- MINERS ⛏️: The semantic retrieval benchmark for evaluating multilingual language models. (EMNLP 2024 Findings)☆14Oct 3, 2024Updated last year
- [ACL 2021] Learning to Perturb Word Embeddings for Out-of-distribution QA☆16May 11, 2022Updated 3 years ago
- Getting interpretable dimensions in word embedding spaces.☆15Jul 6, 2023Updated 2 years ago
- PathPiece tokenizer☆13Nov 10, 2024Updated last year
- Enhaced version of Wikiextrator: A wikipedia dumps extractor☆28Sep 17, 2025Updated 4 months ago
- Code for Episodic Memory Reader (EMR) https://arxiv.org/abs/1903.06164☆15Nov 16, 2022Updated 3 years ago
- Streamlit, but better.☆16Feb 5, 2024Updated 2 years ago
- Curriculum training☆22Jun 25, 2025Updated 7 months ago
- 매주 목요일, 20:00 모임☆16Jul 24, 2020Updated 5 years ago
- Test code of Inverse cloze task for information retrieval☆33Jan 10, 2021Updated 5 years ago
- ☆62Apr 19, 2022Updated 3 years ago
- Python Template Repository☆19Updated this week
- Create augmentation examples from MultiNLI by subject-object inversion and passivizing.☆17Feb 22, 2021Updated 4 years ago
- ☆43Feb 21, 2022Updated 3 years ago
- Code for our EMNLP 2020 Paper "AIN: Fast and Accurate Sequence Labeling with Approximate Inference Network"☆19Nov 14, 2022Updated 3 years ago
- Code and data for the paper "Soft Gazetteers for Low-resource Named Entity Recognition"☆19Nov 3, 2020Updated 5 years ago
- (ACL-IJCNLP 2021) Convolutions and Self-Attention: Re-interpreting Relative Positions in Pre-trained Language Models.☆21Jul 13, 2022Updated 3 years ago
- This repository contains the HiNER dataset released with our paper at LREC 2022☆16Jun 6, 2023Updated 2 years ago
- ☆45Oct 11, 2021Updated 4 years ago
- Suite of 500 procedurally-generated NLP tasks to study language model adaptability☆21Jul 16, 2022Updated 3 years ago
- BPE modification that implements removing of the intermediate tokens during tokenizer training.☆26Nov 25, 2024Updated last year
- ☆52Jun 6, 2023Updated 2 years ago
- ☆75Jul 2, 2021Updated 4 years ago
- Massively Multilingual Transfer for NER☆86Oct 7, 2021Updated 4 years ago
- An original implementation of "Noisy Channel Language Model Prompting for Few-Shot Text Classification"☆131Apr 23, 2022Updated 3 years ago
- Data for the ACL SRW 2020 paper "Understanding Points of Correspondence between Sentences for Abstractive Summarization"☆20Nov 2, 2022Updated 3 years ago