microsoft / factored-segmenterLinks
Unsupervised factor-based text tokenizer for natural-language processing applications
☆17Updated 5 years ago
Alternatives and similar repositories for factored-segmenter
Users that are interested in factored-segmenter are comparing it to the libraries listed below
Sorting:
- Port of PragmaticSegmenter for sentence boundary detection☆38Updated 3 years ago
- Machine is a natural language processing library for .NET that is focused on providing tools for processing resource-poor languages.☆28Updated this week
- C# Word2Vec object with fast neighbor search. Format compatible with gensim☆25Updated 5 years ago
- Proof of concept of training a simple Region Classifier using PdfPig and ML.NET (LightGBM). The objective is to classify each text block…☆28Updated 5 years ago
- A C# library to extract tabular data from PDFs (port of camelot Python version using PdfPig).☆35Updated 3 years ago
- .Net Implementation for google word2vec tools.☆37Updated 2 years ago
- Fast and memory-efficient library for WordPiece tokenization as it is used by BERT.☆50Updated last week
- Pure C# library to convert between document formats (Office 97-2003, Open XML, RTF, Markdown)☆20Updated this week
- Open source project for BERT Tokenizers in C#.☆87Updated 2 years ago
- ☆18Updated 2 years ago
- Palaso Library: A set of .Net libraries useful for developers of Language Software.☆43Updated this week
- Research EXpression Language☆24Updated last year
- Search Photo based in content of the photo using ONNX model and ML.Net☆13Updated 6 years ago
- A phonetic matching library. Includes text utilities to do string comparisons on phonemes (the sound of the string), as opposed to charac…☆161Updated 2 years ago
- SpacyDotNet is a .NET wrapper for the popular natural language library spaCy☆35Updated 3 months ago
- CRFSharp is Conditional Random Fields implemented by .NET(C#), a machine learning algorithm for learning from labeled sequences of exampl…☆122Updated 5 years ago
- Numpy style scientific programming package☆18Updated 6 years ago
- International Components for Unicode for .NET☆37Updated 3 months ago
- It is a WIP C# .Net Framework implementation of the original markitdown Python library.☆15Updated 5 months ago
- C# Sequence to Sequence Learning with Attention using LSTM neural Networks☆26Updated 6 months ago
- A simplified version of the JsonAnalyzer. Great for validating local JSON files or when you already have JSON in a string.☆14Updated 9 months ago
- A lightweight cross platform GUI for Docker build with Avalonia UI.☆16Updated 4 years ago
- .NET Standard interface for the WebRTC voice activity detection (VAD) component.☆54Updated 3 years ago
- .NET Core Proxy library based on HttpClient works with FreeProxyList.net☆20Updated 2 years ago
- Simple application to full-text searching in file system☆18Updated 4 years ago
- Interface abstraction for accessing environment variables in .NET☆18Updated 3 weeks ago
- .NET Standard wrapper for fastText library. Now works on Windows, Linux and MacOs!☆76Updated 11 months ago
- A docx renderer allows outputing Markdown-formatted text into Microsoft Word .docx documents☆19Updated last year
- WPF/MVVM Xml Viewer for large Xml files (Alpha Version)☆17Updated 6 years ago
- Easily add searching to your apps☆14Updated last week