ltgoslo / NorQuAD
Norwegian question answering dataset
☆13Updated 11 months ago
Alternatives and similar repositories for NorQuAD:
Users that are interested in NorQuAD are comparing it to the libraries listed below
- Natural language understanding benchmarks for Norwegian☆14Updated last year
- The CleanCoNLL dataset from our EMNLP 2023 paper where we corrected annotation errors and inconsistencies in CoNLL-03.☆22Updated 6 months ago
- T-Projection is a method to perform high-quality Annotation Projection of Sequence Labeling datasets.☆11Updated last year
- Evaluation of language models on mono- or multilingual tasks.☆76Updated this week
- Semantically Structured Sentence Embeddings☆66Updated 3 months ago
- A python package to run inference with HuggingFace language and vision-language checkpoints wrapping many convenient features.☆25Updated 4 months ago
- ☆21Updated last year
- ConfliBERT: A Pre-trained Language Model for Political Conflict and Violence (NAACL 2022)☆25Updated this week
- SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 la…☆45Updated last year
- Experiments on including metadata such as URLs, timestamps, website descriptions and HTML tags during pretraining.☆30Updated last year
- Dutch coreference resolution & dialogue analysis using deterministic rules☆21Updated last year
- Package to extract connotation frames☆81Updated last year
- ☆22Updated 2 years ago
- Bayesian Assessment of Hypotheses☆24Updated last year
- ☆11Updated 6 months ago
- Noise-robust de-duplication at scale☆15Updated last year
- SeqScore: Scoring for named entity recognition and other sequence labeling tasks☆22Updated 2 weeks ago
- Repository with code for MaChAmp: https://aclanthology.org/2021.eacl-demos.22/☆83Updated last week
- Code for SaGe subword tokenizer (EACL 2023)☆22Updated last month
- Dutch abusive language data☆11Updated last year
- An easy-to-use library to linguistically compare one sentence and its words to another, in the same language or a different one. For inst…☆22Updated 3 years ago
- A small repository to test Captum Explainable AI with a trained Flair transformers-based text classifier.☆26Updated 3 years ago
- Temporary remove unused tokens during training to save ram and speed.☆22Updated 6 months ago
- triple-encoders is a library for contextualizing distributed Sentence Transformers representations.☆13Updated 4 months ago
- Code associated with the paper "Entropy-based Attention Regularization Frees Unintended Bias Mitigation from Lists"☆47Updated 2 years ago
- ☆21Updated last year
- Robust Cross-lingual Embeddings from Parallel Sentences☆21Updated 4 years ago
- KIND: an Italian Multi-Domain Dataset for Named Entity Recognition☆15Updated last year
- ☆26Updated 5 months ago
- Source code and data for Like a Good Nearest Neighbor☆28Updated last week