juletx/self-translate

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/juletx/self-translate)

juletx / self-translate

Do Multilingual Language Models Think Better in English?

☆42

Alternatives and similar repositories for self-translate

Users that are interested in self-translate are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ikergarcia1996 / T-Projection
View on GitHub
T-Projection is a method to perform high-quality Annotation Projection of Sequence Labeling datasets.
☆13Nov 21, 2023Updated 2 years ago
fyvo / WMT-Biomed-Test
View on GitHub
☆13Aug 23, 2024Updated last year
hitz-zentroa / latxa
View on GitHub
Latxa: An Open Language Model and Evaluation Suite for Basque
☆36Dec 15, 2025Updated 7 months ago
tatHi / maxmatch_dropout
View on GitHub
☆10Sep 13, 2022Updated 3 years ago
zoranmedic / mdcr
View on GitHub
Benchmark dataset for the evaluation of scientific article representations on the task of citation recommendation across various scientif…
☆12Oct 21, 2022Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
ntunlp / Evaluation-of-ChatGPT
View on GitHub
A Systematic Study and Comprehensive Evaluation of ChatGPT on Benchmark Datasets.
☆15Jul 10, 2023Updated 3 years ago
hitz-zentroa / lm-contamination
View on GitHub
The LM Contamination Index is a manually created database of contamination evidences for LMs.
☆81Apr 11, 2024Updated 2 years ago
zwhe99 / LLM-MT-Eval
View on GitHub
{DeepL, Google, WMT-Best, davinci-003, turbo, gpt-4} × {En-De, En-Cs, En-Ru, En-Zh, De-Fr, En-Ja, Uk-En, Uk-Cs, En-Hr, En-Ha, En-Is}
☆14Jun 18, 2023Updated 3 years ago
hplt-project / OpusTrainer
View on GitHub
Curriculum training
☆22Jun 25, 2025Updated last year
duyichao / NPDA-KNN-ST
View on GitHub
Official implementation of EMNLP'2022 paper "Non-Parametric Domain Adaptation for End-to-End Speech Translation"
☆11Oct 26, 2022Updated 3 years ago
Tomiinek / Aargh
View on GitHub
☆12Jan 2, 2024Updated 2 years ago
osainz59 / t5-encoder
View on GitHub
A extension of Transformers library to include T5ForSequenceClassification class.
☆40Apr 17, 2023Updated 3 years ago
iesl / CSFCube
View on GitHub
A Test Collection of Computer Science Papers for Faceted Query by Example
☆23Nov 28, 2021Updated 4 years ago
ZurichNLP / ContraDecode
View on GitHub
The implementation of "Mitigating Hallucinations and Off-target Machine Translation with Source-Contrastive and Language-Contrastive Deco…
☆38Aug 29, 2025Updated 10 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
RUCAIBox / MPOP
View on GitHub
☆13Jun 16, 2021Updated 5 years ago
pchizhov / picky_bpe
View on GitHub
BPE modification that implements removing of the intermediate tokens during tokenizer training.
☆27Nov 25, 2024Updated last year
knowledgeable-embedding / knowledgeable-embedding
View on GitHub
Knowledgeable Embedding: Injecting dynamically updatable entity knowledge into embeddings to enhance RAG
☆15Aug 31, 2025Updated 10 months ago
ltgoslo / factorizer
View on GitHub
☆16May 14, 2024Updated 2 years ago
hplt-project / OpusCleaner
View on GitHub
OpusCleaner is a web interface that helps you select, clean and schedule your data for training machine translation models.
☆58Feb 3, 2026Updated 5 months ago
AndreG-P / thesis-template
View on GitHub
A template primarily for PhD theses but also suitable for Bachelor's or Master's theses
☆11Nov 10, 2021Updated 4 years ago
copenlu / scientific-information-change
View on GitHub
Code for the paper "Modeling Information Change in Science Communication with Semantically Matched Paraphrases" from EMNLP 2022
☆13Oct 20, 2022Updated 3 years ago
dykang / xslue
View on GitHub
ACL 2021 paper "Style is NOT a single variable: Case Studies for Cross-Style Language Understanding " by Dongyeop Kang and Eduard Hovy
☆15Jul 19, 2021Updated 5 years ago
ufal / augpt
View on GitHub
DSTC9 Submission
☆16Apr 12, 2021Updated 5 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
copenlu / cite-worth
View on GitHub
Data and code for the paper "CiteWorth: Cite-Worthiness Detection for Improved Scientific Document Understanding"
☆14Sep 8, 2022Updated 3 years ago
kaistAI / KtrlF
View on GitHub
[NAACL 2024] Official repository for "KTRL+F: Knowledge-Augmented In-Document Search"
☆23Oct 11, 2024Updated last year
tekkamanendless / umactually
View on GitHub
This repo contains the stats for the College Humor show "Um, Actually..." as well as a simple HTML page to view those stats. The page is…
☆11Jul 17, 2026Updated last week
mt-upc / ZeroSwot
View on GitHub
Pushing the Limits of Zero-shot End-to-End Speech Translation
☆25Dec 12, 2024Updated last year
GrammaTech / functional-trees
View on GitHub
Tree data structure supporting functional manipulation. Works closely with FSet.
☆17Updated this week
clp-research / clemcore
View on GitHub
A Framework for the Systematic Evaluation of Chat-Optimized Language Models as Conversational Agents and an Extensible Benchmark
☆32Jul 1, 2026Updated 3 weeks ago
ahmetustun / hyperx
View on GitHub
☆21Dec 5, 2022Updated 3 years ago
openlegaldata / legal-ner
View on GitHub
Named entity recognition for the legal domain
☆43Jun 1, 2021Updated 5 years ago
epfl-dlab / pairformance
View on GitHub
Tool to perform paired evaluation of automatic systems
☆13Oct 20, 2021Updated 4 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
hppRC / defsent
View on GitHub
DefSent: Sentence Embeddings using Definition Sentences
☆23Aug 5, 2021Updated 4 years ago
nianlonggu / Local-Citation-Recommendation
View on GitHub
Code for ECIR 2022 paper Local Citation Recommendation with Hierarchical-Attention Text Encoder and SciBERT-based Reranking
☆25Jul 30, 2024Updated last year
ClimSocAna / tecb-de
View on GitHub
German Text Embedding Clustering Benchmark
☆19Mar 15, 2024Updated 2 years ago
WenzhengZhang / Seq2seqCoref
View on GitHub
Official Implementation for Seq2seq is All You Need For Coreference Resolution Paper
☆16Dec 1, 2023Updated 2 years ago
emorynlp / seq2seq-corenlp
View on GitHub
☆13Feb 7, 2023Updated 3 years ago
tylerachang / goldfish
View on GitHub
Goldfish: Monolingual language models for 350 languages.
☆27Mar 4, 2026Updated 4 months ago
trusthlt / eacl24-german-legal-questions
View on GitHub
Data and code: "Answering legal questions from laymen in German civil law system", Büttner & Habernal, EACL'24
☆16Mar 2, 2024Updated 2 years ago