ikergarcia1996 / T-Projection
T-Projection is a method to perform high-quality Annotation Projection of Sequence Labeling datasets.
☆12Updated last year
Alternatives and similar repositories for T-Projection:
Users that are interested in T-Projection are comparing it to the libraries listed below
- LTG-Bert☆32Updated last year
- ☆27Updated 2 months ago
- ☆13Updated last week
- ☆23Updated 3 months ago
- SeqScore: Scoring for named entity recognition and other sequence labeling tasks☆24Updated last month
- Code for ACL 2022 paper "Expanding Pretrained Models to Thousands More Languages via Lexicon-based Adaptation"☆30Updated 3 years ago
- A survey of corpora for Germanic low-resource languages and dialects☆25Updated 5 months ago
- SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 la…☆48Updated last year
- UDapter is a multilingual dependency parser that uses "contextual" adapters together with language-typology features for language-specifi…☆31Updated 2 years ago
- Starbucks: Improved Training for 2D Matryoshka Embeddings☆19Updated 3 months ago
- Label shift estimation for transfer difficulty with Familiarity.☆10Updated 3 months ago
- ☆19Updated 2 years ago
- Code for our TSD paper "TOKEN is a MASK: Few-shot Named Entity Recognition with Pre-trained Language Models"☆14Updated 2 years ago
- Multilingual Open Text☆25Updated 6 months ago
- GC4LM: A Colossal (Biased) language model for German☆13Updated 4 years ago
- Ranking of fine-tuned HF models as base models.☆35Updated last year
- Repository with code for MaChAmp: https://aclanthology.org/2021.eacl-demos.22/☆86Updated this week
- The dataset and code for ACL 2022 paper "SciNLI: A Corpus for Natural Language Inference on Scientific Text" are released here.☆27Updated last year
- Dataset, models, and code for paper "CiteSum: Citation Text-guided Scientific Extreme Summarization and Low-resource Domain Adaptation", …☆33Updated 2 years ago
- Code and data for "Superbizarre Is Not Superb: Derivational Morphology Improves BERT's Interpretation of Complex Words"☆16Updated 3 years ago
- Data and code for the paper "CiteWorth: Cite-Worthiness Detection for Improved Scientific Document Understanding"☆14Updated 2 years ago
- ParaNames: A multilingual resource for parallel names☆31Updated 11 months ago
- ☆11Updated 4 months ago
- Python source code for EMNLP 2021 Findings paper: "Subword Mapping and Anchoring Across Languages".☆13Updated 3 years ago
- ☆13Updated 3 years ago
- Embedding Recycling for Language models☆38Updated last year
- Code for SaGe subword tokenizer (EACL 2023)☆24Updated 5 months ago
- ☆13Updated 3 years ago
- An official repository for MIA 2022 (NAACL 2022 Workshop) Shared Task on Cross-lingual Open-Retrieval Question Answering.☆30Updated 2 years ago
- Zero-Shot Learning in Named Entity Recognition with Common Sense Knowledge☆17Updated 3 years ago