jjzha / cartography-alView external linksLinks
Code base for the EMNLP 2021 Findings paper: Cartography Active Learning
☆14Jun 3, 2025Updated 8 months ago
Alternatives and similar repositories for cartography-al
Users that are interested in cartography-al are comparing it to the libraries listed below
Sorting:
- Overview of corpora/datasets for Germanic low-resource languages and dialects. Accompanies "A Survey of Corpora for Germanic Low-Resource…☆27Updated this week
- KnowMAN: Weakly Supervised Multinomial Adversarial Networks☆12Nov 9, 2021Updated 4 years ago
- 🔍 Multilingual Evaluation of English-Centric LLMs via Cross-Lingual Alignment☆11Apr 6, 2025Updated 10 months ago
- [Konvens21] This repository contains the DFKI MobIE Corpus, a dataset of 3,232 German-language documents that have been annotated with fi…☆12Sep 17, 2024Updated last year
- ☆11Jun 23, 2022Updated 3 years ago
- ☆12Dec 6, 2024Updated last year
- ☆13Feb 7, 2023Updated 3 years ago
- ☆16Dec 14, 2022Updated 3 years ago
- ☆16May 14, 2024Updated last year
- AFD Dataset Cleaned☆15Apr 9, 2020Updated 5 years ago
- Repository for the paper titled: "When is BERT Multilingual? Isolating Crucial Ingredients for Cross-lingual Transfer"☆13Nov 10, 2021Updated 4 years ago
- Implementation and experiments for Partially Supervised NER via Expected Entity Ratio in TACL 2022☆14Nov 7, 2022Updated 3 years ago
- Dataset Cartography: Mapping and Diagnosing Datasets with Training Dynamics☆216Jul 19, 2022Updated 3 years ago
- [ICLR 2022] Code for paper "Exploring Extreme Parameter Compression for Pre-trained Language Models"(https://arxiv.org/abs/2205.10036)☆22May 24, 2023Updated 2 years ago
- MasakhaNEWS: News Topic Classification for African Languages☆24May 12, 2024Updated last year
- CrossRE: A Cross-Domain Dataset for Relation Extraction (Findings of EMNLP 2022)☆49Aug 20, 2024Updated last year
- ☆21Oct 19, 2020Updated 5 years ago
- The Codebase for Causal Distillation for Language Models (NAACL '22)☆26May 1, 2022Updated 3 years ago
- Robust Self-augmentation for NER with Meta-reweighting☆29Nov 8, 2022Updated 3 years ago
- Efficient Language Model Training through Cross-Lingual and Progressive Transfer Learning☆30Jan 25, 2023Updated 3 years ago
- The repository contains code for Adaptive Data Optimization☆32Dec 9, 2024Updated last year
- Code for ACL 2022 paper "Expanding Pretrained Models to Thousands More Languages via Lexicon-based Adaptation"☆30Apr 2, 2022Updated 3 years ago
- A library of techniques for local interpretation of machine learning models☆10Mar 24, 2023Updated 2 years ago
- ☆77Apr 29, 2024Updated last year
- A Data-Driven Approach to Predict the Success of Bank Telemarketing☆10Apr 27, 2021Updated 4 years ago
- Code for pre-training CharacterBERT models (as well as BERT models).☆34Sep 6, 2021Updated 4 years ago
- Embedding Recycling for Language models☆38Jul 11, 2023Updated 2 years ago
- ☆10Jul 16, 2023Updated 2 years ago
- A collection of demos and utilities prepared ahead of the Vector Institute Privacy Enhancing Techniques (PETs) Bootcamp.☆15Sep 22, 2022Updated 3 years ago
- Dataset Catalogue Homepage for Indonesian Languages☆10Feb 19, 2024Updated last year
- ☆10Oct 2, 2024Updated last year
- XWikisCorpus, cross-lingual summarisation, multi-lingual summarisation, pre-trained language models, zero-shot and few-shot summarisation…☆10Nov 4, 2022Updated 3 years ago
- Guide to interviewing for industry machine learning roles (data/applied/research scientist, ML engineer, etc).☆11Dec 28, 2022Updated 3 years ago
- Compute training dynamics, plot data cartography, analysing data quality...☆42Nov 10, 2022Updated 3 years ago
- ☆45Jul 5, 2022Updated 3 years ago
- Digitale Geisteswissenschaften rund um Graphentechnologien☆10Updated this week
- ☆12Dec 4, 2020Updated 5 years ago
- Word embeddings from PPMI-weighted and dirichlet-smoothed co-occurrence matrices☆10Aug 3, 2020Updated 5 years ago
- Active Learning Helps Pretrained Models Learn the Intended Task (https://arxiv.org/abs/2204.08491) by Alex Tamkin, Dat Nguyen, Salil Desh…☆11Nov 22, 2022Updated 3 years ago