ltgoslo / NorQuAD
Norwegian question answering dataset
☆13Updated last year
Alternatives and similar repositories for NorQuAD:
Users that are interested in NorQuAD are comparing it to the libraries listed below
- Natural language understanding benchmarks for Norwegian☆14Updated last year
- Experiments on including metadata such as URLs, timestamps, website descriptions and HTML tags during pretraining.☆31Updated last year
- Dutch abusive language data☆11Updated last year
- ☆21Updated 3 weeks ago
- Code for NAACL 2022 paper "Reframing Human-AI Collaboration for Generating Free-Text Explanations"☆31Updated last year
- This repo contains code for the paper "Psychologically-informed chain-of-thought prompts for metaphor understanding in large language mod…☆14Updated last year
- Code for Stage-wise Fine-tuning for Graph-to-Text Generation☆26Updated 2 years ago
- Plug-and-play Search Interfaces with Pyserini and Hugging Face☆32Updated last year
- Code for SaGe subword tokenizer (EACL 2023)☆22Updated 2 months ago
- ☆22Updated 2 years ago
- Repository with code for MaChAmp: https://aclanthology.org/2021.eacl-demos.22/☆83Updated last week
- The CleanCoNLL dataset from our EMNLP 2023 paper where we corrected annotation errors and inconsistencies in CoNLL-03.☆23Updated 7 months ago
- Semantically Structured Sentence Embeddings☆66Updated 4 months ago
- T-Projection is a method to perform high-quality Annotation Projection of Sequence Labeling datasets.☆12Updated last year
- This repository hosts the code for a tokenizer of tweets.☆12Updated 6 years ago
- ☆26Updated 2 years ago
- Source code and data for Like a Good Nearest Neighbor☆28Updated last month
- A Framework for the Systematic Evaluation of Chat-Optimized Language Models as Conversational Agents and an Extensible Benchmark☆28Updated this week
- ☆14Updated 10 months ago
- Bayesian Assessment of Hypotheses☆24Updated last year
- ☆11Updated 7 months ago
- An implementation of GrASP (Shnarch et. al., 2017)☆21Updated 2 years ago
- The codebase for our ACL2023 paper: Did You Read the Instructions? Rethinking the Effectiveness of Task Definitions in Instruction Learni…☆29Updated last year
- Norwegian Speech Transformer Models☆18Updated 3 months ago
- LTG-Bert☆29Updated last year
- Few-shot Learning with Auxiliary Data☆26Updated last year
- ☆19Updated 2 years ago
- SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 la…☆46Updated last year
- Ranking of fine-tuned HF models as base models.☆35Updated last year
- The official repository for the LREC 2022 paper "D3: A Massive Dataset of Scholarly Metadata for Analyzing the State of Computer Science …☆27Updated 2 years ago