Crosslingual Reasoning through Test-Time Scaling
☆20May 13, 2025Updated last year
Alternatives and similar repositories for crosslingual-test-time-scaling
Users that are interested in crosslingual-test-time-scaling are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Generate synthetic labeled data for extremely low-resource languages using bilingual lexicons.☆20Oct 3, 2024Updated last year
- Can LLMs generate code-mixed sentences through zero-shot prompting?☆11Apr 18, 2023Updated 3 years ago
- Code for "Preference Tuning For Toxicity Mitigation Generalizes Across Languages." Paper accepted at Findings of EMNLP 2024☆18Mar 25, 2025Updated last year
- A collaborative project to collect datasets in SEA languages, SEA regions, or SEA cultures.☆101Mar 16, 2026Updated 3 months ago
- Official Implementation of K-Paths: Reasoning over Graph Paths for Drug Repurposing and Drug Interaction Prediction.☆20Jul 8, 2025Updated 11 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A system for prompted weak supervision. Alfred is a powerful tool that leverages large language models to accelerate data annotation.☆58Apr 3, 2025Updated last year
- MINERS ⛏️: The semantic retrieval benchmark for evaluating multilingual language models. (EMNLP 2024 Findings)☆14Oct 3, 2024Updated last year
- ☆22Jul 16, 2024Updated last year
- NoMIRACL: A multilingual hallucination evaluation dataset to evaluate LLM robustness in RAG against first-stage retrieval errors on 18 la…☆27Nov 29, 2024Updated last year
- Jump to better conclusions: SCAN both left and right☆11Jan 24, 2019Updated 7 years ago
- The paper list of multilingual pre-trained models (Continual Updated).☆25Jun 18, 2024Updated 2 years ago
- ✍️ A browser add-on (Firefox, Chrome, Thunderbird) that allows you to autocorrect common text sequences and convert text characters to a …☆12May 19, 2026Updated last month
- R3: Robust Rubric-Agnostic Reward Models☆22Jul 12, 2025Updated 11 months ago
- Multi-Layer Key-Value sharing experiments on Pythia models☆34Jun 14, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- EANN(Pytorch)☆10Mar 12, 2022Updated 4 years ago
- Official Repo for FoodieQA paper (EMNLP 2024)☆20Jun 26, 2025Updated 11 months ago
- ☆17Dec 6, 2023Updated 2 years ago
- Exploring prompt tuning with pseudolabels for multiple modalities, learning settings, and training strategies.☆50Nov 8, 2024Updated last year
- Dataset Catalogue Homepage for Indonesian Languages☆12Feb 19, 2024Updated 2 years ago
- Dataset and benchmark for assessing LLMs in translating natural language descriptions of planning problems into PDDL☆66Oct 16, 2024Updated last year
- Multilingual RAG benchmark.☆11Nov 22, 2024Updated last year
- ☆14Sep 1, 2025Updated 9 months ago
- Code for paper ”Language Versatilists vs. Specialists: An Empirical Revisiting on Multilingual Transfer Ability“☆15Jun 13, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- RENT (Reinforcement Learning via Entropy Minimization) is an unsupervised method for training reasoning LLMs.☆43Oct 31, 2025Updated 7 months ago
- Code for "Towards Robust k-Nearest-Neighbor Machine Translation" (EMNLP 2022)☆12Oct 18, 2022Updated 3 years ago
- NusaWrites is an in-depth analysis of corpora collection strategy and a comprehensive language modeling benchmark for underrepresented an…☆28Sep 27, 2024Updated last year
- BLOOM+1: Adapting BLOOM model to support a new unseen language☆74Mar 2, 2024Updated 2 years ago
- Code and data for the paper "Turning English-centric LLMs Into Polyglots: How Much Multilinguality Is Needed?"☆26Jun 3, 2025Updated last year
- Automatic OCR of clipboard contents.☆14Aug 12, 2022Updated 3 years ago
- [EMNLP 2022] This is the code repo for our EMNLP‘22 paper "Dimension Reduction for Efficient Dense Retrieval via Conditional Autoencoder"…☆13Oct 20, 2022Updated 3 years ago
- official code for paper Probing the Decision Boundaries of In-context Learning in Large Language Models. https://arxiv.org/abs/2406.11233…☆20Jul 27, 2025Updated 10 months ago
- A weak supervision framework for (partial) labeling functions☆16Jul 15, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A collaborative project to collect datasets in Indonesian languages.☆283Jun 2, 2024Updated 2 years ago
- ☆11Jun 23, 2022Updated 3 years ago
- Towards Few-Shot Fact-Checking via Perplexity☆13Jun 11, 2021Updated 5 years ago
- A python module to process data for Frame Semantic Parsing☆23Nov 3, 2020Updated 5 years ago
- Cross-lingual Language Model (XLM) pretraining and Model-Agnostic Meta-Learning (MAML) for fast adaptation of deep networks☆20Mar 26, 2021Updated 5 years ago
- [ACL 2024 Findings] Learning Fine-Grained Grounded Citations for Attributed Large Language Models☆20Oct 24, 2024Updated last year
- Code to reproduce the experiments in the paper: Does CLIP Bind Concepts? Probing Compositionality in Large Image Models.☆16Oct 14, 2023Updated 2 years ago