A accurate multilingual word aligner based on LaBSE
☆24Oct 25, 2023Updated 2 years ago
Alternatives and similar repositories for AccAlign
Users that are interested in AccAlign are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This is the code for neural-Jacana aligner, and the data for MultiMWA dataset.☆20Feb 12, 2023Updated 3 years ago
- Code for paper ”Language Versatilists vs. Specialists: An Empirical Revisiting on Multilingual Transfer Ability“☆15Jun 13, 2023Updated 2 years ago
- NanGe - A Rule-based Chinese-English Machine Translation System☆20Jul 23, 2017Updated 8 years ago
- Code for our paper "Mask-Align: Self-Supervised Neural Word Alignment" in ACL 2021☆61May 10, 2021Updated 4 years ago
- ☆13Apr 13, 2021Updated 4 years ago
- SpanAlign: Sentence Alignment Method based on Cross-Language Span Prediction and ILP☆14Mar 24, 2021Updated 5 years ago
- A different, but useful, textcat approach.☆18Jul 15, 2024Updated last year
- ☆10Oct 17, 2021Updated 4 years ago
- ☆37Nov 14, 2025Updated 4 months ago
- 🔍 Multilingual Evaluation of English-Centric LLMs via Cross-Lingual Alignment☆11Apr 6, 2025Updated 11 months ago
- Library for experimenting with state-of-the-art evaluation metrics like UScore☆12May 27, 2023Updated 2 years ago
- 基于中心度的中文关键短语抽取工具☆11Sep 2, 2022Updated 3 years ago
- NOAH's Corpus: Part-of-Speech Tagging for Swiss German☆12Jan 6, 2023Updated 3 years ago
- Interactive parametric benchmarks in Python☆17Apr 18, 2021Updated 4 years ago
- EWoK dataset generation framework☆10May 14, 2024Updated last year
- ☆12Dec 13, 2022Updated 3 years ago
- ☆15Nov 20, 2025Updated 4 months ago
- Reagent interface to the Mafs interactive 2d math visualization library.☆15Jun 1, 2024Updated last year
- KG data for ODA☆12Sep 21, 2024Updated last year
- Python package to augment multilingual data☆15Feb 15, 2023Updated 3 years ago
- Python port for IWNLP.Lemmatizer☆18Oct 18, 2023Updated 2 years ago
- [COLING 2025] NesTools: A Dataset for Evaluating Nested Tool Learning Abilities of Large Language Models☆18Jan 18, 2025Updated last year
- A powerful text cleaner for Japanese web texts☆12Jan 20, 2024Updated 2 years ago
- ☆19Jun 9, 2025Updated 9 months ago
- Can LLMs generate code-mixed sentences through zero-shot prompting?☆11Apr 18, 2023Updated 2 years ago
- An implementation of "Subspace Representations for Soft Set Operations and Sentence Similarities" (NAACL 2024)☆10May 31, 2024Updated last year
- Java command line tool to convert PAGE XML files with layout and text content to PDF☆10Apr 27, 2020Updated 5 years ago
- TaCo: Enhancing Cross-Lingual Transfer for Low-Resource Languages in LLMs through Translation-Assisted Chain-of-Thought Processes☆14Jul 1, 2025Updated 8 months ago
- todd is an interactive console TODO-list manager with VI key bindings for `todo.txt`☆21Mar 25, 2021Updated 4 years ago
- ☆16Jul 17, 2025Updated 8 months ago
- Docker image for Cloudflare workerd☆15Feb 11, 2023Updated 3 years ago
- Implements Global Word Vectors.☆11Feb 8, 2020Updated 6 years ago
- Experimental extension of next.jdbc to work with XTDB 2.0 (snapshots)☆12Jul 22, 2024Updated last year
- Bilingual lexicons map words in one language to their translations in another, and are typically induced by learning linear project…☆18Jun 1, 2021Updated 4 years ago
- A library for evaluation of Grammatical Error Correction (GEC). Accepted to ACL'25 Demo: "gec-metrics: A Unified Library for Grammatical …☆14Jan 25, 2026Updated last month
- PyTorch speech2text inference script for the NVidia openseq2seq wav2letter model variant☆10Aug 12, 2019Updated 6 years ago
- Nanyang Technological University - Multilingual Corpus (STB subcorpora)☆12Mar 11, 2019Updated 7 years ago
- ☆23Feb 4, 2020Updated 6 years ago
- Reasoning-based Evaluation and Ranking of Translations.☆20Jul 18, 2025Updated 8 months ago