Python-based implementation of the Translate-Align-Retrieve method to automatically translate the SQuAD Dataset to Spanish.
☆59Dec 8, 2022Updated 3 years ago
Alternatives and similar repositories for TranslateAlignRetrieve
Users that are interested in TranslateAlignRetrieve are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This is the official repository for NAACL 2021, "XOR QA: Cross-lingual Open-Retrieval Question Answering".☆80Jun 3, 2021Updated 5 years ago
- New dataset☆311Aug 31, 2021Updated 4 years ago
- TyDi QA contains 200k human-annotated question-answer pairs in 11 Typologically Diverse languages, written without seeing the answer and …☆318May 28, 2020Updated 6 years ago
- Highly specialized crate to parse and use `google/sentencepiece` 's precompiled_charsmap in `tokenizers`☆23Jun 9, 2026Updated 3 weeks ago
- 🫠 check your data, before you wreck your model☆16Aug 11, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆14Jul 18, 2025Updated 11 months ago
- Parallel Universal Dependencies.☆15May 6, 2026Updated last month
- ☆33Aug 16, 2021Updated 4 years ago
- Pre-training BART in Flax on The Pile dataset☆22Jul 24, 2021Updated 4 years ago
- Exploring Domain-Driven Design + HEX in Go☆11Aug 7, 2020Updated 5 years ago
- ☆210Nov 12, 2021Updated 4 years ago
- Training chatbot models with reinforcement learning in ParlAI.☆17Dec 8, 2022Updated 3 years ago
- A starter kit for evaluating benchmarks on the 🤗 Hub☆17Apr 8, 2026Updated 2 months ago
- A verified version of the WebArena Benchmark☆43Mar 8, 2026Updated 3 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Evaluation framework for open-domain question answering.☆20May 16, 2021Updated 5 years ago
- Official codebase for the ACL 2025 Findings paper: Optimized Text Embedding Models and Benchmarks for Amharic Passage Retrieval.☆21Jul 26, 2025Updated 11 months ago
- ☆24Feb 16, 2024Updated 2 years ago
- The pipeline for the OSCAR corpus☆178Nov 9, 2025Updated 7 months ago
- Token and Sentence Level Classification with Google's BERT (TensorFlow)☆10Jul 11, 2019Updated 6 years ago
- Cross-lingual TRansfer Evaluation of Multilingual Encoders (XTREME)☆22Apr 11, 2020Updated 6 years ago
- A collection of Danish Transformers☆30Aug 27, 2021Updated 4 years ago
- A pre-commit hook for Pyrefly.☆27Jun 19, 2026Updated last week
- Placeholder repository☆15Mar 16, 2022Updated 4 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Linux, Jenkins, AWS, SRE, Prometheus, Docker, Python, Ansible, Git, Kubernetes, Terraform, OpenStack, SQL, NoSQL, Azure, GCP☆10Jun 26, 2021Updated 5 years ago
- French Machine Reading for Question Answering☆18Sep 21, 2022Updated 3 years ago
- The source for the astropy data repository (although the primary server is not on github)☆13Jun 2, 2026Updated 3 weeks ago
- spaCy-wrap is a wrapper library for spaCy for including fine-tuned transformers from Huggingface in your spaCy pipeline allowing you to i…☆46Apr 15, 2024Updated 2 years ago
- CLIP (Contrastive Language–Image Pre-training) trained on Indonesian data☆19Dec 4, 2021Updated 4 years ago
- ☆10Oct 28, 2019Updated 6 years ago
- A Python wrapper for the bioRxiv API.☆11Aug 18, 2021Updated 4 years ago
- Lightweight piece tokenization library☆12Apr 15, 2024Updated 2 years ago
- ☆18Feb 2, 2023Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Inspired by the neural style algorithm in the computer vision field, we propose a high-level language model with the aim of adapting the …☆18Nov 20, 2022Updated 3 years ago
- Data and scripts for the proper evaluation of cross-lingual embeddings in multiple languages☆15Apr 11, 2020Updated 6 years ago
- Small utility to monitor fairseq training in tensorboard☆21Apr 28, 2019Updated 7 years ago
- CBench, Benchmarking System for Question Answering Over Knowledge Graphs Systems.☆12Sep 16, 2022Updated 3 years ago
- The better version of Ubuntu Dialogue Corpus☆16Feb 20, 2016Updated 10 years ago
- Generate reports for spaCy models.☆29May 27, 2022Updated 4 years ago
- The 14th Machine Translation Marathon 2019 in Edinburgh☆13Dec 8, 2022Updated 3 years ago