QAmeleon introduces synthetic multilingual QA data using PaLM, a 540B large language model. This dataset was generated by prompt tuning PaLM with only five examples per language. We use the synthetic data to finetune downstream QA models leading to improved accuracy in comparison to English-only and translation-based baselines.
โ34Aug 15, 2023Updated 2 years ago
Alternatives and similar repositories for QAmeleon
Users that are interested in QAmeleon are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ACL 2025] ๐ Multilingual Evaluation of English-Centric LLMs via Cross-Lingual Alignmentโ11Apr 6, 2025Updated last year
- A collection of utilities for handling IPA phones.โ27Sep 24, 2023Updated 2 years ago
- Ukrainian ELECTRA modelโ12Mar 11, 2023Updated 3 years ago
- Efficient Language Model Training through Cross-Lingual and Progressive Transfer Learningโ30Jan 25, 2023Updated 3 years ago
- ๐ข Data Toolkit for Sailor Language Modelsโ96Feb 24, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer โข AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- JAX Scalify: end-to-end scaled arithmeticsโ18Oct 30, 2024Updated last year
- โ56Apr 18, 2026Updated last month
- The paper list of multilingual pre-trained models (Continual Updated).โ24Jun 18, 2024Updated last year
- A library for language transfer methods and algorithms.โ16Feb 6, 2026Updated 3 months ago
- Library of models for Protein Function prediction (part of the 18th top solution out of 1625 teams in CAFA5)โ20May 23, 2025Updated last year
- suffix array construction and searching algorithms for in-memory binary data.โ12Sep 10, 2022Updated 3 years ago
- From Hero to Zรฉroe: A Benchmark of Low-Level Adversarial Attacksโ15Feb 23, 2023Updated 3 years ago
- Aligntune : A Modular Toolkit for Post Training Alignment of LLMsโ36Apr 29, 2026Updated 3 weeks ago
- โ21Nov 20, 2020Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer โข AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A part-of-speech tagger with support for domain adaptation and external resources.โ24Oct 26, 2022Updated 3 years ago
- Submission archive for the MS MARCO passage ranking leaderboardโ13Apr 21, 2023Updated 3 years ago
- โ37Nov 14, 2025Updated 6 months ago
- Deep memory and sequence models in JAXโ28Apr 23, 2026Updated last month
- Non Metric Space ( Approximate ) Library in Rโ12Feb 2, 2023Updated 3 years ago
- Official repository of the paper MPMQA: Multimodal Question Answering on Product Manuals (AAAI 2023)โ19Nov 28, 2022Updated 3 years ago
- Library for experimenting with state-of-the-art evaluation metrics like UScoreโ12May 27, 2023Updated 2 years ago
- ๅบไบไธญๅฟๅบฆ็ไธญๆๅ ณ้ฎ็ญ่ฏญๆฝๅๅทฅๅ ทโ11Sep 2, 2022Updated 3 years ago
- EWoK dataset generation frameworkโ14May 14, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer โข AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- โ12Jul 6, 2023Updated 2 years ago
- [NAACL 2024] TabSQLify: Enhancing Reasoning Capabilities of LLMs Through Table Decompositionโ17Jan 5, 2026Updated 4 months ago
- Data Structures with Python(AIX20001) ๊ฐ์ ์๋ฃ์คโ18Jun 14, 2024Updated last year
- โ12Dec 13, 2022Updated 3 years ago
- โ15Nov 20, 2025Updated 6 months ago
- โ11Jun 19, 2022Updated 3 years ago
- https://arxiv.org/abs/2404.10917โ14Mar 18, 2025Updated last year
- Few-shot Learning with Auxiliary Dataโ31Dec 8, 2023Updated 2 years ago
- The LM Contamination Index is a manually created database of contamination evidences for LMs.โ81Apr 11, 2024Updated 2 years ago
- Virtual machines for every use case on DigitalOcean โข AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Advantage Leftover Lunch Reinforcement Learning (A-LoL RL): Improving Language Models with Advantage-based Offline Policy Gradientsโ27Sep 10, 2024Updated last year
- KG data for ODAโ12May 14, 2026Updated last week
- โ11Jun 2, 2022Updated 3 years ago
- โ๏ธ Sentence segmentation with wtpsplit's state-of-the-art Segment any Text (SaT) modelsโ39May 2, 2026Updated 3 weeks ago
- A scalable implementation of diffusion and flow-matching with XGBoost models, applied to calorimeter data.โ22Mar 23, 2026Updated 2 months ago
- A powerful text cleaner for Japanese web textsโ12Jan 20, 2024Updated 2 years ago
- โ18Jun 9, 2025Updated 11 months ago