Program used to split text into segments
☆28Oct 27, 2024Updated last year
Alternatives and similar repositories for segment
Users that are interested in segment are comparing it to the libraries listed below
Sorting:
- Tool to fix bitexts and tag near-duplicates for removal☆34Sep 4, 2025Updated 6 months ago
- Tool for manual evaluation of parallel sentences.☆15Jan 26, 2026Updated last month
- Corset is a web-based data selection portal that helps you getting relevant data from massive amounts of parallel data.☆21Nov 6, 2023Updated 2 years ago
- Transform TMX to text☆28Nov 23, 2022Updated 3 years ago
- Targetted language identifier, based on FastText and Hunspell.☆38Sep 4, 2025Updated 6 months ago
- This library provides an API to interface with Hunspell using BridJ.☆23Feb 11, 2023Updated 3 years ago
- 用深度神经网络识别语篇关系的模型,主要结合了TreeLSTM和NTN两种神经网络,用TreeLSTM来获得句子向量,NTN来识别两个句子向量之间的关系.☆14Mar 25, 2016Updated 9 years ago
- Scripts that were used for preparing and converting the Wikipedia documents that are part of the CLIN28 shared task on spelling correctio…☆10Jan 20, 2018Updated 8 years ago
- Simple example how to convert an PyTorch model into Tensorflow using ONNX.☆10Jul 5, 2020Updated 5 years ago
- An off-the-shelf client-side language identification module for JavaScript.☆16Jul 17, 2014Updated 11 years ago
- ☆16Jun 10, 2021Updated 4 years ago
- Code and data for EMNLP2016 article "What makes a convincing argument? Empirical analysis and detecting attributes of convincingness in W…☆13Nov 9, 2016Updated 9 years ago
- python package for unsupervised text segmentation.☆14Oct 31, 2016Updated 9 years ago
- A recurrent neural network heavily inspired by Long Short Term Memory, but simpler.☆21May 4, 2013Updated 12 years ago
- A calendar of your entire life☆20Jan 26, 2026Updated last month
- Gather information about the data availability of lidar data (mainly airborne laserscanning, ALS) in Germany, Europe and worldwide.☆17Apr 3, 2025Updated 11 months ago
- ☆15Mar 28, 2022Updated 3 years ago
- Catalan bert model☆13Oct 17, 2020Updated 5 years ago
- ☆18Sep 13, 2015Updated 10 years ago
- ☆12Feb 23, 2023Updated 3 years ago
- An sbt plugin for adding sounds to task completions☆28May 5, 2018Updated 7 years ago
- A tiny command-line translator was implemented in Plumbum. Youdao.com + Google Translation + Gemini + OpenAI☆10Apr 24, 2025Updated 10 months ago
- SANNet Neural Network Framework☆20Feb 19, 2024Updated 2 years ago
- Modernized version of Eric Brill's Part Of Speech tagger.☆15May 6, 2025Updated 10 months ago
- An Android dictionary application with support for mdx format.☆11Jan 7, 2023Updated 3 years ago
- Lightweight storage for Trino views☆17Feb 10, 2026Updated last month
- ☆21Aug 3, 2023Updated 2 years ago
- An Offline and Secure Retrieval-Augmented Generation (RAG) system designed for efficient processing of diverse content types with minimal…☆20Dec 29, 2024Updated last year
- Parses Polish wiktionary and creates simple dictionaries of foreign languages (e.g. English) to Polish and vice versa.☆16Jul 22, 2013Updated 12 years ago
- A simple project that trains an OpenNLP Named Entity Recognition model to identify ingredients in a recipe.☆14Oct 30, 2016Updated 9 years ago
- Schematron based JSON Semantic Validator☆18Jan 5, 2020Updated 6 years ago
- SymSpell v6.4ish ported to Java 8. Will be a module in my Master Thesis.☆24Jul 18, 2019Updated 6 years ago
- ☆15Apr 18, 2018Updated 7 years ago
- EMNLP-18☆17Dec 21, 2021Updated 4 years ago
- Cybersecurity for the mortals.☆20Updated this week
- This is the open source version of the documentation for AWS Marketplace for Sellers. You can submit feedback & requests for changes by s…☆16Jun 15, 2023Updated 2 years ago
- Brave is a simple visualisation library for NLP information extraction, built on top of embedded BRAT.☆15Dec 25, 2019Updated 6 years ago
- VS Code extension to wrap the LanguageTool API☆27Dec 11, 2018Updated 7 years ago
- IngestRSS is an AWS-based RSS feed processing system that automatically fetches, processes, and stores articles from specified RSS feeds.…☆16Dec 22, 2024Updated last year