ccasimiro88/TranslateAlignRetrieve

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ccasimiro88/TranslateAlignRetrieve)

ccasimiro88 / TranslateAlignRetrieve

Python-based implementation of the Translate-Align-Retrieve method to automatically translate the SQuAD Dataset to Spanish.

☆59

Alternatives and similar repositories for TranslateAlignRetrieve

Users that are interested in TranslateAlignRetrieve are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

semantic-systems / amharic-qa
View on GitHub
AmQA - The first Amharic Open Domain Question Answering Dataset
☆15May 27, 2024Updated 2 years ago
AkariAsai / XORQA
View on GitHub
This is the official repository for NAACL 2021, "XOR QA: Cross-lingual Open-Retrieval Question Answering".
☆80Jun 3, 2021Updated 5 years ago
AkariAsai / extractive_rc_by_runtime_mt
View on GitHub
Code and datasets of "Multilingual Extractive Reading Comprehension by Runtime Machine Translation"
☆40Jan 2, 2019Updated 7 years ago
facebookresearch / MLQA
View on GitHub
New dataset
☆312Aug 31, 2021Updated 4 years ago
jogonba2 / twilbert
View on GitHub
Specialization of BERT architecture both for the Spanish language and the Twitter domain
☆13Nov 6, 2020Updated 5 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
google-research-datasets / tydiqa
View on GitHub
TyDi QA contains 200k human-annotated question-answer pairs in 11 Typologically Diverse languages, written without seeing the answer and …
☆319May 28, 2020Updated 6 years ago
cahya-wirawan / artificial-commonvoice
View on GitHub
Common Voice Generator using Speech Synthesizer
☆14Jul 28, 2021Updated 4 years ago
coqui-ai / data-checker
View on GitHub
🫠 check your data, before you wreck your model
☆16Aug 11, 2022Updated 3 years ago
UniversalDependencies / UD_Thai-PUD
View on GitHub
Parallel Universal Dependencies.
☆15May 6, 2026Updated 2 months ago
JerichoWorld / JerichoWorld
View on GitHub
☆33Aug 16, 2021Updated 4 years ago
amirveyseh / AAAI-21-SDU-shared-task-2-AD
View on GitHub
☆21Nov 20, 2020Updated 5 years ago
dmis-lab / GeNER
View on GitHub
Simple Questions Generate Named Entity Recognition Datasets (EMNLP 2022)
☆75Apr 10, 2023Updated 3 years ago
morganmcg1 / rotobart
View on GitHub
Pre-training BART in Flax on The Pile dataset
☆22Jul 24, 2021Updated 4 years ago
google-deepmind / xquad
View on GitHub
☆210Nov 12, 2021Updated 4 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
nisargjhaveri / indicNLP
View on GitHub
A collection of basic text processing modules focused on Gujarati
☆10Oct 24, 2017Updated 8 years ago
harshel / AUDIO-PREOCESSING-AND-SPEECH-CLASSIFICATION
View on GitHub
This repository consists of the IPython Notebook for the work related to audio processing and implementing convolution neural networks fo…
☆13Feb 13, 2019Updated 7 years ago
soco-ai / SF-QA
View on GitHub
Evaluation framework for open-domain question answering.
☆20May 16, 2021Updated 5 years ago
freds0 / katube
View on GitHub
KATube is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. From a l…
☆26Jul 27, 2024Updated last year
alvenirai / punctfix
View on GitHub
☆24Feb 16, 2024Updated 2 years ago
26hzhang / bert_classification
View on GitHub
Token and Sentence Level Classification with Google's BERT (TensorFlow)
☆10Jul 11, 2019Updated 7 years ago
Tikam02 / devops-interview-questions
View on GitHub
Linux, Jenkins, AWS, SRE, Prometheus, Docker, Python, Ansible, Git, Kubernetes, Terraform, OpenStack, SQL, NoSQL, Azure, GCP
☆10Jun 26, 2021Updated 5 years ago
henchc / web-scrapers
View on GitHub
various web scrapers as examples
☆17Oct 10, 2020Updated 5 years ago
JunjieHu / xtreme-dev
View on GitHub
Cross-lingual TRansfer Evaluation of Multilingual Encoders (XTREME)
☆22Apr 11, 2020Updated 6 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
ocastel / exact-extract
View on GitHub
☆12Sep 2, 2021Updated 4 years ago
Alikabbadj / French-SQuAD
View on GitHub
French Machine Reading for Question Answering
☆18Sep 21, 2022Updated 3 years ago
ryanzhumich / sparc_atis_pytorch
View on GitHub
☆10Oct 28, 2019Updated 6 years ago
jzbjyb / lm-calibration
View on GitHub
☆34Nov 17, 2021Updated 4 years ago
xkianteb / leaqi
View on GitHub
Active Imitation Learing with Noisy Guidance
☆10May 29, 2020Updated 6 years ago
galuhsahid / clip-indonesian
View on GitHub
CLIP (Contrastive Language–Image Pre-training) trained on Indonesian data
☆19Dec 4, 2021Updated 4 years ago
allenai / neural-wire-viz
View on GitHub
Javascript library for visualizing dynamic neural networks across time.
☆13Dec 9, 2019Updated 6 years ago
alexandrainst / alexandra-ml-template
View on GitHub
Template for Python-based data science projects in the Alexandra Institute.
☆12Jun 10, 2026Updated last month
jacquerie / biorxiv-cli
View on GitHub
A Python wrapper for the bioRxiv API.
☆11Aug 18, 2021Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
noe / fairseq-tensorboard
View on GitHub
Small utility to monitor fairseq training in tensorboard
☆21Apr 28, 2019Updated 7 years ago
talonvoice / wav2train
View on GitHub
automatically align transcribed audio and generate a wav2letter training corpus
☆36Apr 11, 2023Updated 3 years ago
Prem-kumar27 / Fast-KTSpeechCrawler
View on GitHub
Parallelized automatic corpus collection for ASR. Forked from https://github.com/EgorLakomkin/KTSpeechCrawler
☆23Mar 21, 2021Updated 5 years ago
google-research / dialog-inpainting
View on GitHub
☆97Aug 6, 2022Updated 3 years ago
sridharmahadevan / Geodesic-Covariance-Alignment
View on GitHub
MATLAB code for ECML 2018 paper on "Unified Framework for Domain Adaptation using Metric Learning on Manifolds"
☆10Oct 14, 2018Updated 7 years ago
Rallio67 / language-model-agents
View on GitHub
Experiments with generating opensource language model assistants
☆97May 14, 2023Updated 3 years ago
oscar-project / goclassy
View on GitHub
An asynchronous concurrent pipeline for classifying Common Crawl based on fastText's pipeline.
☆86Apr 21, 2021Updated 5 years ago