A curated list of resources for Cross-lingual Information Retrieval (CLIR).
☆49Jan 18, 2019Updated 7 years ago
Alternatives and similar repositories for awesome-clir
Users that are interested in awesome-clir are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- first commit☆18Apr 14, 2025Updated last year
- "Cross-lingual Language Model Pretraining for Retrieval". (WWW 2021)☆10Jun 17, 2022Updated 4 years ago
- ☆18Jul 23, 2021Updated 4 years ago
- Set-Encoder: Permutation-Invariant Inter-Passage Attention for Listwise Passage Re-Ranking with Cross-Encoders☆19May 23, 2025Updated last year
- Uses gpt-2 to find all completions of a sentence over a certain probability threshold.☆13Mar 17, 2020Updated 6 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆28Sep 28, 2021Updated 4 years ago
- Witwicky: An implementation of Transformer in PyTorch.☆22Aug 17, 2020Updated 5 years ago
- ☆20Dec 31, 2020Updated 5 years ago
- Code and resources for evaluating cross-lingual embedding spaces☆29Apr 7, 2020Updated 6 years ago
- Open-source Large Language Models are Strong Zero-shot Query Likelihood Models for Document Ranking☆17Oct 26, 2023Updated 2 years ago
- ☆14Jan 6, 2017Updated 9 years ago
- A collection of product search embedding models☆19Jan 17, 2020Updated 6 years ago
- Data and code accompanying the paper "As Little as Possible, as Much as Necessary: Detecting Over- and Undertranslations with Contrastive…☆22Apr 13, 2023Updated 3 years ago
- Repository containing the website for the EMNLP 2023 conference☆17Feb 12, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Code and datasets of "Multilingual Extractive Reading Comprehension by Runtime Machine Translation"☆40Jan 2, 2019Updated 7 years ago
- Dataset and baseline for ACL 2019 paper "XQA: A Cross-lingual Open-domain Question Answering Dataset"☆89Nov 16, 2021Updated 4 years ago
- Code for the paper "Cross-Lingual BERT Transformation for Zero-Shot Dependency Parsing"☆35Sep 20, 2019Updated 6 years ago
- Longformer for MS MARCO document re-ranking task.☆20Jan 11, 2021Updated 5 years ago
- "밑바닥부터 시작하는 데이터 사이언스" 예시 코드☆14Feb 5, 2020Updated 6 years ago
- ☆30Oct 8, 2018Updated 7 years ago
- Code for extracting parallel corpora from pmindia☆17Jan 28, 2020Updated 6 years ago
- Tooling to play around with multilingual machine translation for Indian Languages.☆22Mar 5, 2022Updated 4 years ago
- ☆36Jun 12, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- provides a common interface to many IR measure tools☆101Feb 17, 2026Updated 4 months ago
- This repository contains additional reference translations for the WMT'14 En-De (newstest2014) and WMT'19 En-Ru (newstest2019) test sets …☆15Aug 31, 2021Updated 4 years ago
- ☆13Sep 27, 2022Updated 3 years ago
- ☆22Dec 20, 2019Updated 6 years ago
- ☆26Jan 9, 2023Updated 3 years ago
- A framework to learn cross-lingual word embedding mappings☆654Apr 22, 2023Updated 3 years ago
- A Python wrapper for the ROUGE summarization evaluation package☆14Aug 9, 2017Updated 8 years ago
- Pre-processing and in some cases downloading of datasets for the paper "Content Selection in Deep Learning Models of Summarization."☆78Nov 2, 2022Updated 3 years ago
- Code that drives the public web-based tools for the Media Cloud Online News Archive and Directory.☆12Jun 11, 2026Updated last week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆11Jul 17, 2020Updated 5 years ago
- On the Complementarity between Pre-Training and Back-Translation for Neural Machine Translation (Findings of EMNLP 2021))☆13Nov 21, 2021Updated 4 years ago
- Generative Reranker PyTerrier☆18Dec 1, 2025Updated 6 months ago
- Metadata browser of TREC☆10May 19, 2026Updated last month
- Diagnostic tests for linguistic capacities in language models☆65May 7, 2022Updated 4 years ago
- ☆26Sep 14, 2025Updated 9 months ago
- A command-line tool for creating and managing external HITs on Amazon's Mechanical Turk☆15Jan 11, 2021Updated 5 years ago