Do Multilingual Language Models Think Better in English?
☆42Aug 3, 2023Updated 2 years ago
Alternatives and similar repositories for self-translate
Users that are interested in self-translate are comparing it to the libraries listed below
Sorting:
- ☆10Sep 13, 2022Updated 3 years ago
- Benchmark dataset for the evaluation of scientific article representations on the task of citation recommendation across various scientif…☆12Oct 21, 2022Updated 3 years ago
- T-Projection is a method to perform high-quality Annotation Projection of Sequence Labeling datasets.☆13Nov 21, 2023Updated 2 years ago
- Curriculum training☆22Jun 25, 2025Updated 8 months ago
- A Test Collection of Computer Science Papers for Faceted Query by Example☆22Nov 28, 2021Updated 4 years ago
- ☆12Jan 2, 2024Updated 2 years ago
- A template primarily for PhD theses but also suitable for Bachelor's or Master's theses☆11Nov 10, 2021Updated 4 years ago
- Data and code: "Answering legal questions from laymen in German civil law system", Büttner & Habernal, EACL'24☆14Mar 2, 2024Updated 2 years ago
- ☆13Jun 16, 2021Updated 4 years ago
- Code for ECIR 2022 paper Local Citation Recommendation with Hierarchical-Attention Text Encoder and SciBERT-based Reranking☆25Jul 30, 2024Updated last year
- Data and code for the paper "CiteWorth: Cite-Worthiness Detection for Improved Scientific Document Understanding"☆14Sep 8, 2022Updated 3 years ago
- ☆13Feb 7, 2023Updated 3 years ago
- The implementation of "Mitigating Hallucinations and Off-target Machine Translation with Source-Contrastive and Language-Contrastive Deco…☆36Aug 29, 2025Updated 6 months ago
- A Framework for the Systematic Evaluation of Chat-Optimized Language Models as Conversational Agents and an Extensible Benchmark☆32Feb 20, 2026Updated last week
- German Text Embedding Clustering Benchmark☆18Mar 15, 2024Updated last year
- ☆16May 14, 2024Updated last year
- Tool to perform paired evaluation of automatic systems☆13Oct 20, 2021Updated 4 years ago
- {DeepL, Google, WMT-Best, davinci-003, turbo, gpt-4} × {En-De, En-Cs, En-Ru, En-Zh, De-Fr, En-Ja, Uk-En, Uk-Cs, En-Hr, En-Ha, En-Is}☆14Jun 18, 2023Updated 2 years ago
- This repository includes the masking vocabulary used in the ICLR 2021 spotlight PMI-Masking paper☆14Aug 9, 2021Updated 4 years ago
- A Systematic Study and Comprehensive Evaluation of ChatGPT on Benchmark Datasets.☆15Jul 10, 2023Updated 2 years ago
- DSTC9 Submission☆16Apr 12, 2021Updated 4 years ago
- ACL22 paper: Imputing Out-of-Vocabulary Embeddings with LOVE Makes Language Models Robust with Little Cost☆42Nov 15, 2023Updated 2 years ago
- Official Implementation for Seq2seq is All You Need For Coreference Resolution Paper☆16Dec 1, 2023Updated 2 years ago
- Named entity recognition for the legal domain☆43Jun 1, 2021Updated 4 years ago
- ☆19Jul 22, 2019Updated 6 years ago
- DefSent: Sentence Embeddings using Definition Sentences☆22Aug 5, 2021Updated 4 years ago
- A extension of Transformers library to include T5ForSequenceClassification class.☆40Apr 17, 2023Updated 2 years ago
- Converting PDF files to text, mainly with a focus on arXiv papers.☆24Feb 19, 2024Updated 2 years ago
- ACL 2021 paper "Style is NOT a single variable: Case Studies for Cross-Style Language Understanding " by Dongyeop Kang and Eduard Hovy☆15Jul 19, 2021Updated 4 years ago
- ☆17Oct 5, 2020Updated 5 years ago
- The LM Contamination Index is a manually created database of contamination evidences for LMs.☆82Apr 11, 2024Updated last year
- EMNLP2022 "Cross-Align: Modeling Deep Cross-lingual Interactions for Word Alignment"☆19Feb 19, 2023Updated 3 years ago
- All source URLs of the 1,000 songs for creating melody-lyric alignment data.☆16Aug 15, 2019Updated 6 years ago
- BPE modification that implements removing of the intermediate tokens during tokenizer training.☆26Nov 25, 2024Updated last year
- LEIA: Facilitating Cross-Lingual Knowledge Transfer in Language Models with Entity-based Data Augmentation☆23Apr 24, 2024Updated last year
- ☆20Jul 24, 2024Updated last year
- ☆25Jan 22, 2024Updated 2 years ago
- A software for transferring pre-trained English models to foreign languages☆19Mar 20, 2023Updated 2 years ago
- Repository for JSICK☆45May 31, 2023Updated 2 years ago