Do Multilingual Language Models Think Better in English?
☆42Aug 3, 2023Updated 2 years ago
Alternatives and similar repositories for self-translate
Users that are interested in self-translate are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- T-Projection is a method to perform high-quality Annotation Projection of Sequence Labeling datasets.☆13Nov 21, 2023Updated 2 years ago
- ☆13Aug 23, 2024Updated last year
- ☆10Sep 13, 2022Updated 3 years ago
- Benchmark dataset for the evaluation of scientific article representations on the task of citation recommendation across various scientif…☆12Oct 21, 2022Updated 3 years ago
- A Systematic Study and Comprehensive Evaluation of ChatGPT on Benchmark Datasets.☆15Jul 10, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Curriculum training☆22Jun 25, 2025Updated 10 months ago
- The LM Contamination Index is a manually created database of contamination evidences for LMs.☆82Apr 11, 2024Updated 2 years ago
- ☆21Dec 5, 2022Updated 3 years ago
- ☆12Jan 2, 2024Updated 2 years ago
- A extension of Transformers library to include T5ForSequenceClassification class.☆40Apr 17, 2023Updated 3 years ago
- A Test Collection of Computer Science Papers for Faceted Query by Example☆23Nov 28, 2021Updated 4 years ago
- The implementation of "Mitigating Hallucinations and Off-target Machine Translation with Source-Contrastive and Language-Contrastive Deco…☆38Aug 29, 2025Updated 8 months ago
- BPE modification that implements removing of the intermediate tokens during tokenizer training.☆27Nov 25, 2024Updated last year
- ☆16May 14, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- OpusCleaner is a web interface that helps you select, clean and schedule your data for training machine translation models.☆58Feb 3, 2026Updated 3 months ago
- [NAACL 2024] Official repository for "KTRL+F: Knowledge-Augmented In-Document Search"☆23Oct 11, 2024Updated last year
- Data and code: "Answering legal questions from laymen in German civil law system", Büttner & Habernal, EACL'24☆14Mar 2, 2024Updated 2 years ago
- DSTC9 Submission☆16Apr 12, 2021Updated 5 years ago
- Code for the paper "Modeling Information Change in Science Communication with Semantically Matched Paraphrases" from EMNLP 2022☆13Oct 20, 2022Updated 3 years ago
- ACL 2021 paper "Style is NOT a single variable: Case Studies for Cross-Style Language Understanding " by Dongyeop Kang and Eduard Hovy☆15Jul 19, 2021Updated 4 years ago
- Pushing the Limits of Zero-shot End-to-End Speech Translation☆25Dec 12, 2024Updated last year
- Tree data structure supporting functional manipulation. Works closely with FSet.☆17Mar 16, 2026Updated last month
- ACL22 paper: Imputing Out-of-Vocabulary Embeddings with LOVE Makes Language Models Robust with Little Cost☆42Nov 15, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Tool to perform paired evaluation of automatic systems☆13Oct 20, 2021Updated 4 years ago
- Named entity recognition for the legal domain☆43Jun 1, 2021Updated 4 years ago
- German Text Embedding Clustering Benchmark☆18Mar 15, 2024Updated 2 years ago
- Code for ECIR 2022 paper Local Citation Recommendation with Hierarchical-Attention Text Encoder and SciBERT-based Reranking☆25Jul 30, 2024Updated last year
- ☆13Feb 7, 2023Updated 3 years ago
- DefSent: Sentence Embeddings using Definition Sentences☆23Aug 5, 2021Updated 4 years ago
- Official Implementation for Seq2seq is All You Need For Coreference Resolution Paper☆16Dec 1, 2023Updated 2 years ago
- Are foundation LMs multilingual knowledge bases? (EMNLP 2023)☆19Dec 8, 2023Updated 2 years ago
- This repo contains the stats for the College Humor show "Um, Actually..." as well as a simple HTML page to view those stats. The page is…☆11Apr 17, 2026Updated 2 weeks ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆22Sep 19, 2023Updated 2 years ago
- Materials for paper "Are Large Language Models Temporally Grounded?"☆13Nov 16, 2023Updated 2 years ago
- ☆25Jan 22, 2024Updated 2 years ago
- This repository includes the masking vocabulary used in the ICLR 2021 spotlight PMI-Masking paper☆14Aug 9, 2021Updated 4 years ago
- ZS4IE: A Toolkit for Zero-Shot Information Extraction with Simple Verbalizations☆29Mar 28, 2022Updated 4 years ago
- A software for transferring pre-trained English models to foreign languages☆19Mar 20, 2023Updated 3 years ago
- Converting PDF files to text, mainly with a focus on arXiv papers.☆24Feb 19, 2024Updated 2 years ago