A accurate multilingual word aligner based on LaBSE
☆24Oct 25, 2023Updated 2 years ago
Alternatives and similar repositories for AccAlign
Users that are interested in AccAlign are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This is the code for neural-Jacana aligner, and the data for MultiMWA dataset.☆20Feb 12, 2023Updated 3 years ago
- Yet another Python binding for Juman++/KNP/KWJA☆40Jun 21, 2026Updated last week
- NanGe - A Rule-based Chinese-English Machine Translation System☆20Jul 23, 2017Updated 8 years ago
- ☆13Apr 13, 2021Updated 5 years ago
- ☆57Dec 27, 2025Updated 6 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A neural word aligner based on multilingual BERT☆376Mar 10, 2022Updated 4 years ago
- SpanAlign: Sentence Alignment Method based on Cross-Language Span Prediction and ILP☆15Mar 24, 2021Updated 5 years ago
- GLADIS: A General and Large Acronym Disambiguation Benchmark (EACL 23)☆18Jun 24, 2024Updated 2 years ago
- Repository of ACL2023 paper: Unbalanced Optimal Transport for Unbalanced Word Alignment☆38Sep 13, 2023Updated 2 years ago
- ☆10Oct 17, 2021Updated 4 years ago
- [ACL 2025] 🔍 Multilingual Evaluation of English-Centric LLMs via Cross-Lingual Alignment☆11Apr 6, 2025Updated last year
- Library for experimenting with state-of-the-art evaluation metrics like UScore☆12May 27, 2023Updated 3 years ago
- 基于中心度的中文关键短语抽取工具☆11Sep 2, 2022Updated 3 years ago
- ☆12Dec 13, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- [WIP] A TSX-like language that's safer, more functional, and compiles to JSX.☆12Jun 19, 2025Updated last year
- Swete's LXX Text from 1KY Greek with Corrections Against Manuscripts☆10Oct 11, 2020Updated 5 years ago
- MorphyNet: a Large Multilingual Database of Derivational and Inflectional Morphology (+morpheme segmentation)☆57Apr 2, 2023Updated 3 years ago
- Reagent interface to the Mafs interactive 2d math visualization library.☆15Jun 1, 2024Updated 2 years ago
- Official code implementation of "Tree-based Focused Web Crawling with Reinforcement Learning" and the TRES framework☆24Feb 16, 2026Updated 4 months ago
- KG data for ODA☆12May 14, 2026Updated last month
- Python package to augment multilingual data☆15Feb 15, 2023Updated 3 years ago
- A collection of writings from historical Christianity, browse at https://historicalchristian.faith/by_father.php☆17Jun 20, 2026Updated last week
- [COLING 2025] NesTools: A Dataset for Evaluating Nested Tool Learning Abilities of Large Language Models☆18Jan 18, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Can LLMs generate code-mixed sentences through zero-shot prompting?☆11Apr 18, 2023Updated 3 years ago
- ☆15Oct 5, 2025Updated 8 months ago
- An implementation of "Subspace Representations for Soft Set Operations and Sentence Similarities" (NAACL 2024)☆10May 31, 2024Updated 2 years ago
- TaCo: Enhancing Cross-Lingual Transfer for Low-Resource Languages in LLMs through Translation-Assisted Chain-of-Thought Processes☆14Jul 1, 2025Updated last year
- Scripts to preprocess training and test data and to run fast_align and giza☆107Nov 2, 2021Updated 4 years ago
- Docker image for Cloudflare workerd☆15Feb 11, 2023Updated 3 years ago
- ☆16Jul 17, 2025Updated 11 months ago
- Benchmarking framework for Clojure☆10Feb 27, 2019Updated 7 years ago
- A library for evaluation of Grammatical Error Correction (GEC). Accepted to ACL'25 Demo: "gec-metrics: A Unified Library for Grammatical …☆14Jan 25, 2026Updated 5 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- PyTorch speech2text inference script for the NVidia openseq2seq wav2letter model variant☆10Aug 12, 2019Updated 6 years ago
- Nanyang Technological University - Multilingual Corpus (STB subcorpora)☆12Mar 11, 2019Updated 7 years ago
- Bilingual lexicons map words in one language to their translations in another, and are typically induced by learning linear project…☆19Jun 1, 2021Updated 5 years ago
- ☆30May 6, 2026Updated last month
- Data and code to support "Applied Natural Language Processing" (INFO 256, Fall 2023, UC Berkeley)☆17Nov 20, 2023Updated 2 years ago
- ☆10Dec 17, 2020Updated 5 years ago
- Improved Sentence Alignment in Linear Time and Space☆199Mar 6, 2023Updated 3 years ago