amazon-science / mintakaView external linksLinks
Dataset from the paper "Mintaka: A Complex, Natural, and Multilingual Dataset for End-to-End Question Answering" (COLING 2022)
☆118Oct 25, 2022Updated 3 years ago
Alternatives and similar repositories for mintaka
Users that are interested in mintaka are comparing it to the libraries listed below
Sorting:
- ☆14Apr 8, 2021Updated 4 years ago
- Learning Semantic Parsers from Denotations with Latent Structured Alignments and Abstract Programs(EMNLP2019)☆19Dec 3, 2019Updated 6 years ago
- paraphase sentence☆11Aug 22, 2025Updated 5 months ago
- ☆25Jan 22, 2024Updated 2 years ago
- Directed masked autoencoders☆14Feb 5, 2026Updated last week
- A extension of Transformers library to include T5ForSequenceClassification class.☆40Apr 17, 2023Updated 2 years ago
- ☆13Sep 2, 2021Updated 4 years ago
- MMM 2021: Crossed-Time Delay Neural Network for Speaker Recognition☆11Dec 4, 2021Updated 4 years ago
- An NLP research mainly exploring sequence-to-sequence (s2s) architecture to build Indonesian Automatic Question Generator (AQG). You can …☆25Dec 8, 2022Updated 3 years ago
- Dockerized code for E3: Entailment-driven Extracting and Editing for Conversational Machine Reading.☆48Jul 22, 2023Updated 2 years ago
- QALD-9-Plus Dataset for Knowledge Graph Question Answering☆29Jun 5, 2024Updated last year
- AWD-LSTM language model trained on newspaper corpora with fast.ai☆27Apr 9, 2020Updated 5 years ago
- T-Projection is a method to perform high-quality Annotation Projection of Sequence Labeling datasets.☆13Nov 21, 2023Updated 2 years ago
- ☆13Nov 30, 2022Updated 3 years ago
- g2p ID: Indonesian Grapheme-to-Phoneme Converter☆27Dec 13, 2024Updated last year
- ☆19Sep 16, 2025Updated 4 months ago
- Resources related to EMNLP 2021 paper "FAME: Feature-Based Adversarial Meta-Embeddings for Robust Input Representations"☆13Dec 14, 2021Updated 4 years ago
- NLQuAD: A Non-Factoid Long Question Answering Data Set. To be published at EACL2021☆13May 18, 2021Updated 4 years ago
- PathPiece tokenizer☆13Nov 10, 2024Updated last year
- Textual Visual Semantic Dataset for Text Spotting. CVPRW 2020☆12Jul 2, 2022Updated 3 years ago
- The implementation of "Neural Machine Translation without Embeddings", NAACL 2021☆33Jun 9, 2021Updated 4 years ago
- Code for our paper: Hierarchical RL Using an Ensemble of Proprioceptive Periodic Policies☆15Feb 21, 2019Updated 6 years ago
- This repo contains a set of notebooks to reproduce reinforcement learning algorithms.☆16Nov 21, 2022Updated 3 years ago
- ☆15Jul 8, 2023Updated 2 years ago
- Don't Count, Predict! An Automatic Approach to Learning Sentiment Lexicons for Short Text☆13Jul 20, 2016Updated 9 years ago
- 日本語テキストに対する wikification のためのソフトウェア☆17Mar 14, 2017Updated 8 years ago
- ☆15Mar 22, 2023Updated 2 years ago
- CTC Decoder implementation with python only. Also supports language model decoding using KenLM.☆37May 3, 2024Updated last year
- Embedding Recycling for Language models☆38Jul 11, 2023Updated 2 years ago
- ☆19Sep 19, 2022Updated 3 years ago
- ☆16Oct 11, 2021Updated 4 years ago
- ☆38Apr 17, 2024Updated last year
- Extracts plain text, language identification and more metadata from WARC records☆23Oct 1, 2025Updated 4 months ago
- ☆16Apr 9, 2021Updated 4 years ago
- Benchmarking Multidomain English-Indonesian Machine Translation☆16Dec 19, 2020Updated 5 years ago
- Repository accompanying "An Open Dataset and Model for Language Identification" (Burchell et al., 2023)☆74Apr 1, 2025Updated 10 months ago
- ☆225Sep 19, 2023Updated 2 years ago
- Code for "Tracing Knowledge in Language Models Back to the Training Data"☆39Dec 27, 2022Updated 3 years ago
- Tools and Modeling Code for the MASSIVE dataset☆554Nov 28, 2022Updated 3 years ago