MFAQ: a Multilingual FAQ Dataset
☆18Sep 17, 2023Updated 2 years ago
Alternatives and similar repositories for mfaq
Users that are interested in mfaq are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- NLQuAD: A Non-Factoid Long Question Answering Data Set. To be published at EACL2021☆13May 18, 2021Updated 5 years ago
- Create augmentation examples from MultiNLI by subject-object inversion and passivizing.☆17Feb 22, 2021Updated 5 years ago
- A web interface to understand language-specific BERT-models☆18Apr 16, 2024Updated 2 years ago
- Extracting six domain-specific QA datasets from MS MARCO☆17Dec 1, 2019Updated 6 years ago
- Pre-training BART in Flax on The Pile dataset☆22Jul 24, 2021Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Code for reproducing meta-learning for cross-lingual transfer learning in NLU and QA☆13Aug 17, 2021Updated 4 years ago
- Tools relating to the CC-News-En Collection☆20Dec 8, 2023Updated 2 years ago
- Author implementation of the paper "Don’t paraphrase, detect! Rapid and Effective Data Collection for Semantic Parsing"☆20Oct 5, 2020Updated 5 years ago
- ☆22Oct 14, 2021Updated 4 years ago
- Android custom keyboard implementation in kotlin☆13Oct 5, 2017Updated 8 years ago
- 🦮 Code and pretrained models for Findings of ACL 2022 paper "LaPraDoR: Unsupervised Pretrained Dense Retriever for Zero-Shot Text Retrie…☆49Apr 25, 2022Updated 4 years ago
- Python + OpenCV script to detect playing cards in an image. It uses template matching.☆13Jan 24, 2017Updated 9 years ago
- 한국어 문서에 노이즈를 추가합니다.☆27Nov 9, 2022Updated 3 years ago
- This dataset contains human judgements about answer equivalence. The data is based on SQuAD (Stanford Question Answering Dataset), and co…☆28Oct 24, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Question-Directed Graph Attention Network for Numerical Reasoning over Text☆10Aug 14, 2020Updated 5 years ago
- Research code for the paper "How Good is Your Tokenizer? On the Monolingual Performance of Multilingual Language Models"☆28Oct 3, 2021Updated 4 years ago
- ☆12Apr 15, 2022Updated 4 years ago
- ☆62Apr 19, 2022Updated 4 years ago
- Code for the paper "UnNatural Language Inference" to appear at ACL 2021 (Long Paper)☆36Aug 31, 2021Updated 4 years ago
- Code Roberta version of RetroMAE: Pre-Training Retrieval-oriented Language Models Via Masked Auto-Encoder☆10Mar 16, 2023Updated 3 years ago
- Re-Implementation of SPARTA model☆13Oct 1, 2021Updated 4 years ago
- ☆12Sep 2, 2021Updated 4 years ago
- Compressed sparse matrices☆15Jun 12, 2024Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- This will help you convert a GPT2-XL model to an optimized onnx model fp 16.☆10Oct 13, 2020Updated 5 years ago
- Shared code for training sentence embeddings with Flax / JAX☆28Jul 15, 2021Updated 4 years ago
- 😎 Better Naver blog browsing☆10Jan 8, 2026Updated 5 months ago
- Code for Paper: Calibrated Language Model Fine-Tuning for In- and Out-of-Distribution Data☆36Nov 16, 2020Updated 5 years ago
- A work-in-progress book on Dask☆12Jul 15, 2023Updated 2 years ago
- Code and data to support the paper "PAQ 65 Million Probably-Asked Questions andWhat You Can Do With Them"☆211Aug 31, 2021Updated 4 years ago
- ☆11Nov 5, 2021Updated 4 years ago
- ☆16Feb 10, 2026Updated 3 months ago
- Official repository for our EACL 2023 paper "LongEval: Guidelines for Human Evaluation of Faithfulness in Long-form Summarization" (https…☆45Aug 10, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- The code related to the baselines from NeurIPS 2021 paper "DUE: End-to-End Document Understanding Benchmark."☆36Mar 2, 2023Updated 3 years ago
- A simple neural truecaser written in pytorch and allennlp.☆35Jun 17, 2024Updated last year
- CBench, Benchmarking System for Question Answering Over Knowledge Graphs Systems.☆12Sep 16, 2022Updated 3 years ago
- ☆14May 7, 2016Updated 10 years ago
- EMNLP 2021 Tutorial: Multi-Domain Multilingual Question Answering☆38Nov 7, 2021Updated 4 years ago
- Datasets used in the tox21 challenge☆11Nov 6, 2019Updated 6 years ago
- Binary Passage Retriever (BPR) - an efficient passage retriever for open-domain question answering☆174Jun 6, 2021Updated 5 years ago