Program used to split text into segments
☆28Oct 27, 2024Updated last year
Alternatives and similar repositories for segment
Users that are interested in segment are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Tool to fix bitexts and tag near-duplicates for removal☆35Sep 4, 2025Updated 9 months ago
- Tool for manual evaluation of parallel sentences.☆15Jan 26, 2026Updated 5 months ago
- Corset is a web-based data selection portal that helps you getting relevant data from massive amounts of parallel data.☆21Nov 6, 2023Updated 2 years ago
- A set of Java filters for creating, merging and validating XLIFF 1.2, 2.0, 2.1 and 2.2 files.☆85Jun 24, 2026Updated last week
- Transform TMX to text☆27Nov 23, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Targetted language identifier, based on FastText and Hunspell.☆38Sep 4, 2025Updated 9 months ago
- Hunspell library for Java based on JNA☆63Mar 6, 2023Updated 3 years ago
- 用深度神经网络识别语篇关系的模型,主要结合了TreeLSTM和NTN两种神经网络,用TreeLSTM来获得句子向量,NTN来识别两个句子向量之间的关系.☆14Mar 25, 2016Updated 10 years ago
- TH4J- A wrapper of torch TH library for Java (JVM langauges).☆11Nov 4, 2015Updated 10 years ago
- Scripts that were used for preparing and converting the Wikipedia documents that are part of the CLIN28 shared task on spelling correctio…☆10Jan 20, 2018Updated 8 years ago
- Simple example how to convert an PyTorch model into Tensorflow using ONNX.☆10Jul 5, 2020Updated 5 years ago
- ☆11Jun 3, 2019Updated 7 years ago
- ☆16Jun 10, 2021Updated 5 years ago
- Code and data for EMNLP2016 article "What makes a convincing argument? Empirical analysis and detecting attributes of convincingness in W…☆13Nov 9, 2016Updated 9 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- A recurrent neural network heavily inspired by Long Short Term Memory, but simpler.☆21May 4, 2013Updated 13 years ago
- Integration of Languagetool into the Atom text editor.☆18Feb 18, 2022Updated 4 years ago
- A calendar of your entire life☆20Jan 26, 2026Updated 5 months ago
- Gather information about the data availability of lidar data (mainly airborne laserscanning, ALS) in Germany, Europe and worldwide.☆17Apr 3, 2025Updated last year
- SymSpell: 1 million times faster through Symmetric Delete spelling correction algorithm☆17Jul 7, 2015Updated 10 years ago
- ☆18Sep 13, 2015Updated 10 years ago
- An sbt plugin for adding sounds to task completions☆28May 5, 2018Updated 8 years ago
- SANNet Neural Network Framework☆20Feb 19, 2024Updated 2 years ago
- Codee: An efficient AI programming assistant☆17Mar 30, 2026Updated 3 months ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Lightweight storage for Trino views☆18Feb 10, 2026Updated 4 months ago
- Java library for running and visualizing Deep Neural Networks trained in Torch.☆26Mar 22, 2025Updated last year
- Convert custom sql like dsl query to es query☆12Feb 27, 2020Updated 6 years ago
- Best Practices in Translation Memory Management☆47Dec 14, 2018Updated 7 years ago
- A Cross-Domain Transferable Neural Coherence Model https://arxiv.org/abs/1905.11912☆24Jul 8, 2020Updated 5 years ago
- ☆12Jul 19, 2018Updated 7 years ago
- A simple project that trains an OpenNLP Named Entity Recognition model to identify ingredients in a recipe.☆14Oct 30, 2016Updated 9 years ago
- ☆15Apr 18, 2018Updated 8 years ago
- Support library for Collaborative Realtime Editing via Operational Transformation☆10Feb 23, 2021Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- JSON schema validator☆12Jun 13, 2018Updated 8 years ago
- provide preprocessing platform for Lucene indexing and comprehensive Learning-to-Rank modules☆13Feb 16, 2018Updated 8 years ago
- A graphical diff & merge tool for TMX files☆21Jan 4, 2024Updated 2 years ago
- EMNLP-18☆17Dec 21, 2021Updated 4 years ago
- Plugin for GoCD server that will spin up and shut down EC2 instances as its agent workers on demand☆14Apr 20, 2026Updated 2 months ago
- Brave is a simple visualisation library for NLP information extraction, built on top of embedded BRAT.☆15Dec 25, 2019Updated 6 years ago
- IngestRSS is an AWS-based RSS feed processing system that automatically fetches, processes, and stores articles from specified RSS feeds.…☆18Dec 22, 2024Updated last year