Gale-Church sentence aligner with options for variable parameters
☆18Oct 7, 2019Updated 6 years ago
Alternatives and similar repositories for gachalign
Users that are interested in gachalign are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Bilingual sengence aligner☆29Nov 25, 2025Updated 6 months ago
- Nanyang Technological University - Multilingual Corpus (STB subcorpora)☆12Mar 11, 2019Updated 7 years ago
- Entitypedia is an Extended Named Entity Dictionary from Wikipedia.☆13Dec 7, 2022Updated 3 years ago
- Simple tool for generating tokens with open source transformers and/or calculate per-token surprisal.☆14Apr 15, 2026Updated last month
- ☆15Mar 10, 2026Updated 3 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- WMT-2012 shared task on Quality Estimation☆18Sep 5, 2012Updated 13 years ago
- Gale&Church (1993) sentence alignment☆16May 9, 2020Updated 6 years ago
- Code for our paper in ACL 2017☆13Dec 14, 2017Updated 8 years ago
- ☆12Dec 9, 2015Updated 10 years ago
- NLQuAD: A Non-Factoid Long Question Answering Data Set. To be published at EACL2021☆13May 18, 2021Updated 5 years ago
- ☆18Jul 21, 2023Updated 2 years ago
- Making Toronto City Council inclusive and accessible - by us, for us, for free☆12Mar 5, 2025Updated last year
- Arduino C++ library to send telemetry and receive channel data to/from Jeti Duplex receivers via EX Bus.☆16Sep 17, 2023Updated 2 years ago
- Machine-Translation-based sentence alignment tool for parallel text☆314Mar 18, 2021Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- This dataset contains human judgements about answer equivalence. The data is based on SQuAD (Stanford Question Answering Dataset), and co…☆28Oct 24, 2022Updated 3 years ago
- Training scripts for paper Miceli Barone et al. 2017 "Deep Architectures for Neural Machine Translation"☆11Jul 13, 2017Updated 8 years ago
- A tool for extracting plain text from Wikipedia dumps☆15Sep 13, 2018Updated 7 years ago
- ☆23Oct 1, 2021Updated 4 years ago
- ☆13May 17, 2024Updated 2 years ago
- Split argv(argument vector) and handle special cases.☆10Nov 19, 2024Updated last year
- React Native JS Utils for DetoxInstruments☆13Mar 19, 2023Updated 3 years ago
- A framework for quick web archiving; canonical repository: https://gitea.arpa.li/JustAnotherArchivist/qwarc☆31Jan 17, 2026Updated 4 months ago
- Javascript directed acyclic word graph (DAWG)☆14Apr 30, 2016Updated 10 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Pytorch Text GAN for lyrics generation☆10Apr 13, 2019Updated 7 years ago
- Russian coreference resolution made as simple and accessible as could be☆11Sep 3, 2022Updated 3 years ago
- ☆18Dec 13, 2021Updated 4 years ago
- ☆13Dec 7, 2022Updated 3 years ago
- An Arduino-based flight control system for radio-controlled aircraft☆22Aug 20, 2020Updated 5 years ago
- Representation Learning of Entities and Documents from Knowledge Base Descriptions☆18Oct 6, 2018Updated 7 years ago
- Docker container for UDPipe (https://github.com/ufal/udpipe) REST server.☆12Jun 23, 2020Updated 5 years ago
- A Python script to convert vobsub subtitles into srt format using tesseract for ocr☆10Sep 28, 2014Updated 11 years ago
- Code for paper "Nearest Neighbor Knowledge Distillation for Neural Machine Translation" by Zhixian Yang, Renliang Sun, and Xiaojun Wan. T…☆32Jul 16, 2022Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A common methodology for measuring the accuracy of real-time ETA predictions☆27Feb 27, 2025Updated last year
- A Gherkin parser and Cucumber-like implementation for JavaScript☆14Oct 28, 2019Updated 6 years ago
- atmaCup #11 の Public 4th / Private 5th Solution のリポジトリです。☆12Aug 3, 2021Updated 4 years ago
- ParCourE - Parallel Corpus Explorer☆12Dec 27, 2021Updated 4 years ago
- Japanese LLaMa experiment☆54Dec 27, 2025Updated 5 months ago
- Simple CORPORA list crawler☆11Dec 2, 2016Updated 9 years ago
- A simple utility to index wikipedia dumps using Lucene.☆21Oct 13, 2020Updated 5 years ago