Python-based implementation of the Translate-Align-Retrieve method to automatically translate the SQuAD Dataset to Spanish.
☆59Dec 8, 2022Updated 3 years ago
Alternatives and similar repositories for TranslateAlignRetrieve
Users that are interested in TranslateAlignRetrieve are comparing it to the libraries listed below
Sorting:
- This is the official repository for NAACL 2021, "XOR QA: Cross-lingual Open-Retrieval Question Answering".☆80Jun 3, 2021Updated 4 years ago
- Common Voice Generator using Speech Synthesizer☆13Jul 28, 2021Updated 4 years ago
- ☆33Aug 16, 2021Updated 4 years ago
- Parallel Universal Dependencies.☆15Nov 12, 2025Updated 3 months ago
- 🫠 check your data, before you wreck your model☆16Aug 11, 2022Updated 3 years ago
- New dataset☆311Aug 31, 2021Updated 4 years ago
- Highly specialized crate to parse and use `google/sentencepiece` 's precompiled_charsmap in `tokenizers`☆20Jan 8, 2026Updated last month
- Simple Questions Generate Named Entity Recognition Datasets (EMNLP 2022)☆76Apr 10, 2023Updated 2 years ago
- Parallelized automatic corpus collection for ASR. Forked from https://github.com/EgorLakomkin/KTSpeechCrawler☆23Mar 21, 2021Updated 4 years ago
- Code and datasets of "Multilingual Extractive Reading Comprehension by Runtime Machine Translation"☆40Jan 2, 2019Updated 7 years ago
- CLIP (Contrastive Language–Image Pre-training) trained on Indonesian data☆19Dec 4, 2021Updated 4 years ago
- TyDi QA contains 200k human-annotated question-answer pairs in 11 Typologically Diverse languages, written without seeing the answer and …☆317May 28, 2020Updated 5 years ago
- Agents that build knowledge graphs and explore textual worlds by asking questions☆79Aug 9, 2023Updated 2 years ago
- ☆20Apr 5, 2021Updated 4 years ago
- Evaluation framework for open-domain question answering.☆20May 16, 2021Updated 4 years ago
- Welcome to our repository! This repository hosts the data on "IndoCollex: A Testbed for Morphological Transformation of Indonesian Word …☆23Aug 10, 2021Updated 4 years ago
- NAACL'19: "Jointly Optimizing Diversity and Relevance in Neural Response Generation"☆73Sep 28, 2020Updated 5 years ago
- Meta Representation Transformation for Low-resource Cross-lingual Learning☆41May 5, 2021Updated 4 years ago
- HEAD-QA: A Healthcare Dataset for Complex Reasoning☆33Feb 15, 2021Updated 5 years ago
- ☆207Nov 12, 2021Updated 4 years ago
- Question-answers, collected from Google☆132Jul 23, 2021Updated 4 years ago
- ☆12Dec 20, 2018Updated 7 years ago
- Generate reports for spaCy models.☆29May 27, 2022Updated 3 years ago
- This repository is about how to build an SQLite version of the Arabic WordNet database.☆10Mar 19, 2019Updated 6 years ago
- automatically align transcribed audio and generate a wav2letter training corpus☆36Apr 11, 2023Updated 2 years ago
- XPersona: Evaluating Multilingual Personalized Chatbot☆70Apr 6, 2023Updated 2 years ago
- Experiments with generating opensource language model assistants☆97May 14, 2023Updated 2 years ago
- UnifiedQA: Crossing Format Boundaries With a Single QA System☆445May 9, 2022Updated 3 years ago
- [NAACL 2021] Designing a Minimal Retrieve-and-Read System for Open-Domain Question Answering☆36Apr 20, 2021Updated 4 years ago
- An asynchronous concurrent pipeline for classifying Common Crawl based on fastText's pipeline.☆86Apr 21, 2021Updated 4 years ago
- A pre-commit hook for Pyrefly.☆23Updated this week
- 🧾 Let's automate Invoice generation from CSV file (@jakobowsky YouTube tutorial)☆12Sep 12, 2020Updated 5 years ago
- RNN that generates names resembling those you give it, be it people's names, city names, etc.☆13Jan 22, 2019Updated 7 years ago
- ☆11Jun 15, 2019Updated 6 years ago
- This repository defines a python class that can be used to load data for the tf.keras.model.fit_generator function by using a torch.utils…☆11Oct 26, 2024Updated last year
- Stemmer and lemmatizer for Indonesian (Bahasa Indonesia)☆42Aug 14, 2023Updated 2 years ago
- Official code and data repository for our ACL 2019 long paper "Generating Question-Answer Hierarchies" (https://arxiv.org/abs/1906.02622)…☆92Aug 10, 2024Updated last year
- ☆44Sep 16, 2020Updated 5 years ago
- ☆17Nov 7, 2023Updated 2 years ago